How to find a unicorn: a novel model-free, unsupervised anomaly detection method for time series

Benkő, Zsigmond; Bábel, Tamás; Somogyvári, Zoltán

doi:10.1038/s41598-021-03526-y

Computer Science > Machine Learning

arXiv:2004.11468 (cs)

[Submitted on 23 Apr 2020 (v1), last revised 15 Jun 2021 (this version, v3)]

Title:How to find a unicorn: a novel model-free, unsupervised anomaly detection method for time series

Authors:Zsigmond Benkő, Tamás Bábel, Zoltán Somogyvári

View PDF

Abstract:Recognition of anomalous events is a challenging but critical task in many scientific and industrial fields, especially when the properties of anomalies are unknown. In this paper, we introduce a new anomaly concept called "unicorn" or unique event and present a new, model-free, unsupervised detection algorithm to detect unicorns. The key component of the new algorithm is the Temporal Outlier Factor (TOF) to measure the uniqueness of events in continuous data sets from dynamic systems. The concept of unique events differs significantly from traditional outliers in many aspects: while repetitive outliers are no longer unique events, a unique event is not necessarily an outlier; it does not necessarily fall out from the distribution of normal activity. The performance of our algorithm was examined in recognizing unique events on different types of simulated data sets with anomalies and it was compared with the Local Outlier Factor (LOF) and discord discovery algorithms. TOF had superior performance compared to LOF and discord algorithms even in recognizing traditional outliers and it also recognized unique events that those did not. The benefits of the unicorn concept and the new detection method were illustrated by example data sets from very different scientific fields. Our algorithm successfully recognized unique events in those cases where they were already known such as the gravitational waves of a binary black hole merger on LIGO detector data and the signs of respiratory failure on ECG data series. Furthermore, unique events were found on the LIBOR data set of the last 30 years.

Subjects:	Machine Learning (cs.LG); Signal Processing (eess.SP); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
Cite as:	arXiv:2004.11468 [cs.LG]
	(or arXiv:2004.11468v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2004.11468
Related DOI:	https://doi.org/10.1038/s41598-021-03526-y

Submission history

From: Zsigmond Benkő [view email]
[v1] Thu, 23 Apr 2020 21:38:38 UTC (2,909 KB)
[v2] Tue, 5 May 2020 14:58:17 UTC (3,578 KB)
[v3] Tue, 15 Jun 2021 09:08:02 UTC (58,700 KB)

Computer Science > Machine Learning

Title:How to find a unicorn: a novel model-free, unsupervised anomaly detection method for time series

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:How to find a unicorn: a novel model-free, unsupervised anomaly detection method for time series

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators