Abstract
Outlier detection has been a topic in statistics for centuries. Over mainly the last two decades, there has been also an increasing interest in the database and data mining community to develop scalable methods for outlier detection. Initially based on statistical reasoning, however, these methods soon lost the direct probabilistic interpretability of the derived outlier scores. Here, we detail from a joint point of view of data mining and statistics the roots and the path of development of statistical outlier detection and of database‐related data mining methods for outlier detection. We discuss their inherent meaning, review approaches to again find a statistically meaningful interpretation of outlier scores, and sketch related current research topics.
| Original language | English |
|---|---|
| Article number | e1280 |
| Journal | Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery |
| Volume | 8 |
| Issue number | 6 |
| Number of pages | 26 |
| ISSN | 1942-4787 |
| DOIs | |
| Publication status | Published - 1. Nov 2018 |
Keywords
- anomaly detection
- outlier detection
- outier model
- statistics and data mining
Fingerprint
Dive into the research topics of 'There and back again: Outlier detection between statistical reasoning and data mining algorithms'. Together they form a unique fingerprint.Related Press/Media
-
There and Back Again: Outlier Detection Between Statistical Reasoning And Data Mining Algorithms
14/09/2018
1 item of Media coverage
Press/Media: Press / Media