Autoencoders for unsupervised anomaly detection in high energy physics

Abstract Autoencoders are widely used in machine learning applications, in particular for anomaly detection. Hence, they have been introduced in high energy physics as a promising tool for model-independent new physics searches. We scrutinize the usage of autoencoders for unsupervised anomaly detect...

Full description

Bibliographic Details
Main Authors: Thorben Finke, Michael Krämer, Alessandro Morandini, Alexander Mück, Ivan Oleksiyuk
Format: Article
Language:English
Published: SpringerOpen 2021-06-01
Series:Journal of High Energy Physics
Subjects:
Online Access:https://doi.org/10.1007/JHEP06(2021)161
id doaj-ba89d8f495c24cc188cd5a0ad38edcf7
record_format Article
spelling doaj-ba89d8f495c24cc188cd5a0ad38edcf72021-07-04T11:52:01ZengSpringerOpenJournal of High Energy Physics1029-84792021-06-012021613210.1007/JHEP06(2021)161Autoencoders for unsupervised anomaly detection in high energy physicsThorben Finke0Michael Krämer1Alessandro Morandini2Alexander Mück3Ivan Oleksiyuk4Institute for Theoretical Particle Physics and Cosmology (TTK), RWTH Aachen UniversityInstitute for Theoretical Particle Physics and Cosmology (TTK), RWTH Aachen UniversityInstitute for Theoretical Particle Physics and Cosmology (TTK), RWTH Aachen UniversityInstitute for Theoretical Particle Physics and Cosmology (TTK), RWTH Aachen UniversityInstitute for Theoretical Particle Physics and Cosmology (TTK), RWTH Aachen UniversityAbstract Autoencoders are widely used in machine learning applications, in particular for anomaly detection. Hence, they have been introduced in high energy physics as a promising tool for model-independent new physics searches. We scrutinize the usage of autoencoders for unsupervised anomaly detection based on reconstruction loss to show their capabilities, but also their limitations. As a particle physics benchmark scenario, we study the tagging of top jet images in a background of QCD jet images. Although we reproduce the positive results from the literature, we show that the standard autoencoder setup cannot be considered as a model-independent anomaly tagger by inverting the task: due to the sparsity and the specific structure of the jet images, the autoencoder fails to tag QCD jets if it is trained on top jets even in a semi-supervised setup. Since the same autoencoder architecture can be a good tagger for a specific example of an anomaly and a bad tagger for a different example, we suggest improved performance measures for the task of model-independent anomaly detection. We also improve the capability of the autoencoder to learn non-trivial features of the jet images, such that it is able to achieve both top jet tagging and the inverse task of QCD jet tagging with the same setup. However, we want to stress that a truly model-independent and powerful autoencoder-based unsupervised jet tagger still needs to be developed.https://doi.org/10.1007/JHEP06(2021)161JetsQCD Phenomenology
collection DOAJ
language English
format Article
sources DOAJ
author Thorben Finke
Michael Krämer
Alessandro Morandini
Alexander Mück
Ivan Oleksiyuk
spellingShingle Thorben Finke
Michael Krämer
Alessandro Morandini
Alexander Mück
Ivan Oleksiyuk
Autoencoders for unsupervised anomaly detection in high energy physics
Journal of High Energy Physics
Jets
QCD Phenomenology
author_facet Thorben Finke
Michael Krämer
Alessandro Morandini
Alexander Mück
Ivan Oleksiyuk
author_sort Thorben Finke
title Autoencoders for unsupervised anomaly detection in high energy physics
title_short Autoencoders for unsupervised anomaly detection in high energy physics
title_full Autoencoders for unsupervised anomaly detection in high energy physics
title_fullStr Autoencoders for unsupervised anomaly detection in high energy physics
title_full_unstemmed Autoencoders for unsupervised anomaly detection in high energy physics
title_sort autoencoders for unsupervised anomaly detection in high energy physics
publisher SpringerOpen
series Journal of High Energy Physics
issn 1029-8479
publishDate 2021-06-01
description Abstract Autoencoders are widely used in machine learning applications, in particular for anomaly detection. Hence, they have been introduced in high energy physics as a promising tool for model-independent new physics searches. We scrutinize the usage of autoencoders for unsupervised anomaly detection based on reconstruction loss to show their capabilities, but also their limitations. As a particle physics benchmark scenario, we study the tagging of top jet images in a background of QCD jet images. Although we reproduce the positive results from the literature, we show that the standard autoencoder setup cannot be considered as a model-independent anomaly tagger by inverting the task: due to the sparsity and the specific structure of the jet images, the autoencoder fails to tag QCD jets if it is trained on top jets even in a semi-supervised setup. Since the same autoencoder architecture can be a good tagger for a specific example of an anomaly and a bad tagger for a different example, we suggest improved performance measures for the task of model-independent anomaly detection. We also improve the capability of the autoencoder to learn non-trivial features of the jet images, such that it is able to achieve both top jet tagging and the inverse task of QCD jet tagging with the same setup. However, we want to stress that a truly model-independent and powerful autoencoder-based unsupervised jet tagger still needs to be developed.
topic Jets
QCD Phenomenology
url https://doi.org/10.1007/JHEP06(2021)161
work_keys_str_mv AT thorbenfinke autoencodersforunsupervisedanomalydetectioninhighenergyphysics
AT michaelkramer autoencodersforunsupervisedanomalydetectioninhighenergyphysics
AT alessandromorandini autoencodersforunsupervisedanomalydetectioninhighenergyphysics
AT alexandermuck autoencodersforunsupervisedanomalydetectioninhighenergyphysics
AT ivanoleksiyuk autoencodersforunsupervisedanomalydetectioninhighenergyphysics
_version_ 1721319876930306048