On the comparison of relative clustering validity criteria

Lucas Vendramin*, Ricardo J.G.B. Campello, Eduardo R. Hruschka

*Kontaktforfatter

Publikation: Kapitel i bog/rapport/konference-proceedingKonferencebidrag i proceedingsForskningpeer review

Abstract

Many different relative clustering validity criteria exist that are very useful in practice as quantitative measures for evaluating the quality of data partitions, and new criteria have still been proposed from time to time. These criteria are endowed with particular features that may make each of them able to outperform others in specific classes of problems. Then, it is a hard task for the user to choose a specific criterion when he or she faces such a variety of possibilities. For this reason, a relevant issue within the field of cluster analysis consists of comparing the performances of existing validity criteria and, eventually, that of a new criterion to be proposed. In spite of this, there are some conceptual flaws in the comparison paradigm traditionally adopted in the literature. The present paper presents an alternative methodology for comparing clustering validity criteria and uses it to make an extensive comparison of the performances of 4 well-known validity criteria and 20 variants of them over a collection of 142,560 partitions of 324 different data sets of a given class of interest.

OriginalsprogEngelsk
TitelSociety for Industrial and Applied Mathematics - 9th SIAM International Conference on Data Mining 2009, Proceedings in Applied Mathematics 133
Publikationsdato2009
Sider729-740
ISBN (Trykt)9781615671090
StatusUdgivet - 2009
Udgivet eksterntJa
Begivenhed9th SIAM International Conference on Data Mining 2009, SDM 2009 - Sparks, NV, USA
Varighed: 30. apr. 20092. maj 2009

Konference

Konference9th SIAM International Conference on Data Mining 2009, SDM 2009
Land/OmrådeUSA
BySparks, NV
Periode30/04/200902/05/2009

Fingeraftryk

Dyk ned i forskningsemnerne om 'On the comparison of relative clustering validity criteria'. Sammen danner de et unikt fingeraftryk.

Citationsformater