Density-based clustering validation

Davoud Moulavi, Pablo A. Jaskowiak, Ricardo J.G.B. Campello, Arthur Zimek, Jorg Sander

Publikation: Bidrag til bog/antologi/rapport/konference-proceedingKonferencebidrag i proceedingsForskningpeer review


One of the most challenging aspects of clustering is validation, which is the objective and quantitative assessment of clustering results. A number of different relative validity criteria have been proposed for the validation of globular, clusters. Not all data, however, are composed of globular clusters. Density-based clustering algorithms seek partitions with high density areas of points (clusters, not necessarily globular) separated by low density areas, possibly containing noise objects. In these cases relative validity indices proposed for globular cluster validation may fail. In this paper we propose a relative validation index for density-based, arbitrarily shaped clusters. The index assesses clustering quality based on the relative density connection between pairs of objects. Our index is formulated on the basis of a new kernel density function, which is used to compute the density of objects and to evaluate the within- and between-cluster density connectedness of clustering results. Experiments on synthetic and real world data show the effectiveness of our approach for the evaluation and selection of clustering algorithms and their respective appropriate parameters.

TitelProceedings of the 2014 SIAM International Conference on Data Mining
RedaktørerMohammed Zaki, Zoran Obradovic, Pang Ning-Tan, Arindam Banerjee, Chandrika Kamath, Srinivasan Parthasarathy
ForlagSociety for Industrial and Applied Mathematics Publications
ISBN (Elektronisk)978-1-61197-344-0
StatusUdgivet - 2014
Udgivet eksterntJa
Begivenhed14th SIAM International Conference on Data Mining - Philadelphia, USA
Varighed: 24. apr. 201426. apr. 2014


Konference14th SIAM International Conference on Data Mining
SponsorAmerican Statistical Association



Moulavi, D., Jaskowiak, P. A., Campello, R. J. G. B., Zimek, A., & Sander, J. (2014). Density-based clustering validation. I M. Zaki, Z. Obradovic, P. Ning-Tan, A. Banerjee, C. Kamath, & S. Parthasarathy (red.), Proceedings of the 2014 SIAM International Conference on Data Mining (s. 839-847). Society for Industrial and Applied Mathematics Publications.