Clustering refinement

Félix Iglesias*, Tanja Zseby, Arthur Zimek

*Kontaktforfatter for dette arbejde

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review


Advanced validation of cluster analysis is expected to increase confidence and allow reliable implementations. In this work, we describe and test CluReAL, an algorithm for refining clustering irrespective of the method used in the first place. Moreover, we present ideograms that enable summarizing and properly interpreting problem spaces that have been clustered. The presented techniques are built on absolute cluster validity indices. Experiments cover a wide variety of scenarios and six of the most popular clustering techniques. Results show the potential of CluReAL for enhancing clustering and the suitability of ideograms to understand the context of the data through the lens of the cluster analysis. Refinement and interpretability are both crucial to reduce failure and increase performance control and operational awareness in unsupervised analysis.

TidsskriftInternational Journal of Data Science and Analytics
Udgave nummer4
Sider (fra-til)333-353
StatusUdgivet - okt. 2021

Bibliografisk note

Funding Information:
This work was partly supported by the project MALware cOmmunication in cRitical Infrastructures (MALORI), funded by the Austrian security research programme KIRAS of the Federal Ministry for Agriculture, Regions and Tourism (BMLRT) under grant no. 873511.

Publisher Copyright:
© 2021, The Author(s).


Dyk ned i forskningsemnerne om 'Clustering refinement'. Sammen danner de et unikt fingeraftryk.