OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

K-means improvement by dynamic pre-aggregates

El Malki, Nabil and Ravat, Franck and Teste, Olivier K-means improvement by dynamic pre-aggregates. (2019) In: 21st International Conference on Enterprise Information Systems (ICEIS 2019), 3 May 2019 - 5 May 2019 (Heraklion, Crete, Greece).

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
590kB

Official URL: https://doi.org/10.5220/0007675201330140

Abstract

The k-means algorithm is one well-known of clustering algorithms. k-means requires iterative and repetitive accesses to data up to performing the same calculations several times on the same data. However, intermediate results that are difficult to predict at the beginning of the k-means process are not recorded to avoid recalculating some data in subsequent iterations. These repeated calculations can be costly, especially when it comes to clustering massive data. In this article, we propose to extend the k-means algorithm by introducing pre-aggregates. These aggregates can then be reused to avoid redundant calculations during successive iterations. We show the interest of the approach by several experiments. These last ones show that the more the volume of data is important, the more the pre-aggregations speed up the algorithm.

Item Type:Conference or Workshop Item (Paper)
Additional Information:Thanks to SCITEPRESS (Science and Technology Publications) editor. The definitive version is available at http://www.scitepress.org This papers appears in Proceedings of the 21st International Conference on Enterprise Information Systems - Volume 2: ICEIS ISBN: 978-989-758-372-8 The original PDF is available at: http://www.scitepress.org/DigitalLibrary/Link.aspx?doi=10.5220/0007675201330140
HAL Id:hal-02493880
Audience (conference):National conference proceedings
Uncontrolled Keywords:
Institution:French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UT3 (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Other partners > Capgemini (FRANCE)
Laboratory name:
Statistics:download
Deposited On:10 Feb 2020 14:04

Repository Staff Only: item control page