Machine learning and extremes for anomaly detection

Abstract : Anomaly detection is not only a useful preprocessing step for training machine learning algorithms. It is also a crucial component of many real-world applications, from various fields like finance, insurance, telecommunication, computational biology, health or environmental sciences. Anomaly detection is also more and more relevant in the modern world, as an increasing number of autonomous systems need to be monitored and diagnosed. Important research areas in anomaly detection include the design of efficient algorithms and their theoretical study but also the evaluation of such algorithms, in particular when no labeled data is available -- as in lots of industrial setups. In other words, model design and study, and model selection. In this thesis, we focus on both of these aspects. We first propose a criterion for measuring the performance of any anomaly detection algorithm. Then we focus on extreme regions, which are of particular interest in anomaly detection, to obtain lower false alarm rates. Eventually, two heuristic methods are proposed, the first one to evaluate anomaly detection algorithms in the case of high dimensional data, the other to extend the use of random forests to the one-class setting.
Document type :
Theses
Complete list of metadatas

Cited literature [160 references]  Display  Hide  Download

https://pastel.archives-ouvertes.fr/tel-01658623
Contributor : Abes Star <>
Submitted on : Thursday, December 7, 2017 - 5:24:40 PM
Last modification on : Thursday, October 17, 2019 - 12:36:10 PM

File

theseGoix.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01658623, version 1

Collections

Citation

Nicolas Goix. Machine learning and extremes for anomaly detection. Machine Learning [stat.ML]. Télécom ParisTech, 2016. English. ⟨NNT : 2016ENST0072⟩. ⟨tel-01658623⟩

Share

Metrics

Record views

1011

Files downloads

882