Interpreting and unifying outlier scores

Hans Peter Kriegel, Peer Kröger, Erich Schubert, Arthur Zimek

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Abstract

Outlier scores provided by different outlier models differ widely in their meaning, range, and contrast between different outlier models and, hence, are not easily comparable or interpretable. We propose a unification of outlier scores provided by various outlier models and a translation of the arbitrary "outlier factors" to values in the range [0, 1] interpretable as values describing the probability of a data object of being an outlier. As an application, we show that this unification facilitates enhanced ensembles for outlier detection.

Original languageEnglish
Title of host publicationProceedings of the 11th SIAM International Conference on Data Mining, SDM 2011
EditorsBing Liu, Huan Liu, Chris Clifton, Takashi Washio, Chandrika Kamath
PublisherSociety for Industrial and Applied Mathematics
Publication dateDec 2011
Pages13-24
ISBN (Print)978-0-89871-992-5
ISBN (Electronic)978-1-61197-281-8
DOIs
Publication statusPublished - Dec 2011
Externally publishedYes
Event11th SIAM International Conference on Data Mining - Mesa, United States
Duration: 28. Apr 201130. Apr 2011

Conference

Conference11th SIAM International Conference on Data Mining
CountryUnited States
CityMesa
Period28/04/201130/04/2011
SponsorAmerican Statistical Association

Cite this

Kriegel, H. P., Kröger, P., Schubert, E., & Zimek, A. (2011). Interpreting and unifying outlier scores. In B. Liu, H. Liu, C. Clifton, T. Washio, & C. Kamath (Eds.), Proceedings of the 11th SIAM International Conference on Data Mining, SDM 2011 (pp. 13-24). Society for Industrial and Applied Mathematics. https://doi.org/10.1137/1.9781611972818.2
Kriegel, Hans Peter ; Kröger, Peer ; Schubert, Erich ; Zimek, Arthur. / Interpreting and unifying outlier scores. Proceedings of the 11th SIAM International Conference on Data Mining, SDM 2011. editor / Bing Liu ; Huan Liu ; Chris Clifton ; Takashi Washio ; Chandrika Kamath. Society for Industrial and Applied Mathematics, 2011. pp. 13-24
@inproceedings{0537201fd87848c6907d262948ff2255,
title = "Interpreting and unifying outlier scores",
abstract = "Outlier scores provided by different outlier models differ widely in their meaning, range, and contrast between different outlier models and, hence, are not easily comparable or interpretable. We propose a unification of outlier scores provided by various outlier models and a translation of the arbitrary {"}outlier factors{"} to values in the range [0, 1] interpretable as values describing the probability of a data object of being an outlier. As an application, we show that this unification facilitates enhanced ensembles for outlier detection.",
author = "Kriegel, {Hans Peter} and Peer Kr{\"o}ger and Erich Schubert and Arthur Zimek",
year = "2011",
month = "12",
doi = "10.1137/1.9781611972818.2",
language = "English",
isbn = "978-0-89871-992-5",
pages = "13--24",
editor = "Bing Liu and Huan Liu and Chris Clifton and Takashi Washio and Chandrika Kamath",
booktitle = "Proceedings of the 11th SIAM International Conference on Data Mining, SDM 2011",
publisher = "Society for Industrial and Applied Mathematics",
address = "United States",

}

Kriegel, HP, Kröger, P, Schubert, E & Zimek, A 2011, Interpreting and unifying outlier scores. in B Liu, H Liu, C Clifton, T Washio & C Kamath (eds), Proceedings of the 11th SIAM International Conference on Data Mining, SDM 2011. Society for Industrial and Applied Mathematics, pp. 13-24, 11th SIAM International Conference on Data Mining, Mesa, United States, 28/04/2011. https://doi.org/10.1137/1.9781611972818.2

Interpreting and unifying outlier scores. / Kriegel, Hans Peter; Kröger, Peer; Schubert, Erich; Zimek, Arthur.

Proceedings of the 11th SIAM International Conference on Data Mining, SDM 2011. ed. / Bing Liu; Huan Liu; Chris Clifton; Takashi Washio; Chandrika Kamath. Society for Industrial and Applied Mathematics, 2011. p. 13-24.

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

TY - GEN

T1 - Interpreting and unifying outlier scores

AU - Kriegel, Hans Peter

AU - Kröger, Peer

AU - Schubert, Erich

AU - Zimek, Arthur

PY - 2011/12

Y1 - 2011/12

N2 - Outlier scores provided by different outlier models differ widely in their meaning, range, and contrast between different outlier models and, hence, are not easily comparable or interpretable. We propose a unification of outlier scores provided by various outlier models and a translation of the arbitrary "outlier factors" to values in the range [0, 1] interpretable as values describing the probability of a data object of being an outlier. As an application, we show that this unification facilitates enhanced ensembles for outlier detection.

AB - Outlier scores provided by different outlier models differ widely in their meaning, range, and contrast between different outlier models and, hence, are not easily comparable or interpretable. We propose a unification of outlier scores provided by various outlier models and a translation of the arbitrary "outlier factors" to values in the range [0, 1] interpretable as values describing the probability of a data object of being an outlier. As an application, we show that this unification facilitates enhanced ensembles for outlier detection.

U2 - 10.1137/1.9781611972818.2

DO - 10.1137/1.9781611972818.2

M3 - Article in proceedings

AN - SCOPUS:84864227933

SN - 978-0-89871-992-5

SP - 13

EP - 24

BT - Proceedings of the 11th SIAM International Conference on Data Mining, SDM 2011

A2 - Liu, Bing

A2 - Liu, Huan

A2 - Clifton, Chris

A2 - Washio, Takashi

A2 - Kamath, Chandrika

PB - Society for Industrial and Applied Mathematics

ER -

Kriegel HP, Kröger P, Schubert E, Zimek A. Interpreting and unifying outlier scores. In Liu B, Liu H, Clifton C, Washio T, Kamath C, editors, Proceedings of the 11th SIAM International Conference on Data Mining, SDM 2011. Society for Industrial and Applied Mathematics. 2011. p. 13-24 https://doi.org/10.1137/1.9781611972818.2