Search CORE

110 research outputs found

Variational Autoencoders for New Physics Mining at the Large Hadron Collider

Author: Cerri Olmo
Nguyen Thong Q.
Pierini Maurizio
Spiropulu Maria
Vlimant Jean-Roch
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/11/2018
Field of study

Using variational autoencoders trained on known physics processes, we develop a one-sided threshold test to isolate previously unseen processes as outlier events. Since the autoencoder training does not depend on any specific new physics signature, the proposed procedure doesn't make specific assumptions on the nature of new physics. An event selection based on this algorithm would be complementary to classic LHC searches, typically based on model-dependent hypothesis testing. Such an algorithm would deliver a list of anomalous events, that the experimental collaborations could further scrutinize and even release as a catalog, similarly to what is typically done in other scientific domains. Event topologies repeating in this dataset could inspire new-physics model building and new experimental searches. Running in the trigger system of the LHC experiments, such an application could identify anomalous events that would be otherwise lost, extending the scientific reach of the LHC.Comment: 29 pages, 12 figures, 5 table

arXiv.org e-Print Archive

Directory of Open Access Journals

Caltech Authors

CERN Document Server

Variational Autoencoders for New Physics Mining at the Large Hadron Collider

Author: Cerri Olmo
Nguyen Thong Q.
Pierini Maurizio
Spiropulu Maria
Vlimant Jean-Roch
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/05/2019
Field of study

Using variational autoencoders trained on known physics processes, we develop a one-sided threshold test to isolate previously unseen processes as outlier events. Since the autoencoder training does not depend on any specific new physics signature, the proposed procedure doesn’t make specific assumptions on the nature of new physics. An event selection based on this algorithm would be complementary to classic LHC searches, typically based on model-dependent hypothesis testing. Such an algorithm would deliver a list of anomalous events, that the experimental collaborations could further scrutinize and even release as a catalog, similarly to what is typically done in other scientific domains. Event topologies repeating in this dataset could inspire new-physics model building and new experimental searches. Running in the trigger system of the LHC experiments, such an application could identify anomalous events that would be otherwise lost, extending the scientific reach of the LHC

Adversarially Learned Anomaly Detection on CMS Open Data: re-discovering the top quark

Author: Cerri Olmo
Dissertori Guenther
Knapp Oliver
Nguyen Thong Q.
Pierini Maurizio
Vlimant Jean-Roch
Publication venue
Publication date: 04/05/2020
Field of study

We apply an Adversarially Learned Anomaly Detection (ALAD) algorithm to the problem of detecting new physics processes in proton-proton collisions at the Large Hadron Collider. Anomaly detection based on ALAD matches performances reached by Variational Autoencoders, with a substantial improvement in some cases. Training the ALAD algorithm on 4.4 fb-1 of 8 TeV CMS Open Data, we show how a data-driven anomaly detection and characterization would work in real life, re-discovering the top quark by identifying the main features of the t-tbar experimental signature at the LHC.Comment: 16 pages, 9 figure

arXiv.org e-Print Archive

CERN Document Server

Adversarially Learned Anomaly Detection on CMS open data: re-discovering the top quark

Author: Cerri O.
Dissertori G.
Knapp O.
Nguyen T. Q.
Pierini M.
Vlimant J. R.
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2021
Field of study

We apply an Adversarially Learned Anomaly Detection (ALAD) algorithm to the problem of detecting new physics processes in proton–proton collisions at the Large Hadron Collider. Anomaly detection based on ALAD matches performances reached by Variational Autoencoders, with a substantial improvement in some cases. Training the ALAD algorithm on 4.4 fb⁻¹ of 8 TeV CMS Open Data, we show how a data-driven anomaly detection and characterization would work in real life, re-discovering the top quark by identifying the main features of the tt̄ experimental signature at the LHC

Repository for Publications and Research Data

Caltech Authors

Nanosecond anomaly detection with decision trees for high energy physics and real-time application to exotic Higgs decays

Author: Bayer Quincy
Carlson Benjamin
Hong Tae Min
Ouligian William
Roche Stephen
Serhiayenka Pavel
Stelzer Joerg
Publication venue
Publication date: 07/04/2023
Field of study

We present a novel implementation of the artificial intelligence autoencoding algorithm, used as an ultrafast and ultraefficient anomaly detector, built with a forest of deep decision trees on FPGA, field programmable gate arrays. Scenarios at the Large Hadron Collider at CERN are considered, for which the autoencoder is trained using known physical processes of the Standard Model. The design is then deployed in real-time trigger systems for anomaly detection of new unknown physical processes, such as the detection of exotic Higgs decays, on events that fail conventional threshold-based algorithms. The inference is made within a latency value of 25 ns, the time between successive collisions at the Large Hadron Collider, at percent-level resource usage. Our method offers anomaly detection at the lowest latency values for edge AI users with tight resource constraints.Comment: 26 pages, 9 figures, 1 tabl

arXiv.org e-Print Archive

Anomaly Detection in Collider Physics via Factorized Observables

Author: Metodiev Eric M.
Thaler Jesse
Wynne Raymond
Publication venue
Publication date: 30/11/2023
Field of study

To maximize the discovery potential of high-energy colliders, experimental searches should be sensitive to unforeseen new physics scenarios. This goal has motivated the use of machine learning for unsupervised anomaly detection. In this paper, we introduce a new anomaly detection strategy called FORCE: factorized observables for regressing conditional expectations. Our approach is based on the inductive bias of factorization, which is the idea that the physics governing different energy scales can be treated as approximately independent. Assuming factorization holds separately for signal and background processes, the appearance of non-trivial correlations between low- and high-energy observables is a robust indicator of new physics. Under the most restrictive form of factorization, a machine-learned model trained to identify such correlations will in fact converge to the optimal new physics classifier. We test FORCE on a benchmark anomaly detection task for the Large Hadron Collider involving collimated sprays of particles called jets. By teasing out correlations between the kinematics and substructure of jets, our method can reliably extract percent-level signal fractions. This strategy for uncovering new physics adds to the growing toolbox of anomaly detection methods for collider physics with a complementary set of assumptions.Comment: 5+11 pages, 3+7 figure

arXiv.org e-Print Archive

Benchmark data and model independent event classification for the large hadron collider

Author: Aarrestad T
Bona M
Boveia A
Caron S
Davies J
de Austri RR
De Simone A
Doglioni C
Duarte JM
Farbin A
Gupta H
Heinrich LA
Hendriks L
Howarth J
Jawahar P
Jueid A
Lastow J
Leinweber A
Mamuzic J
Merenyi E
Morandini A
Moskvitina P
Nellist C
Ngadiuba J
Ostdiek B
Pierini M
Ravina B
Sekmen S
Touranakou M
van Beekveld M
Vaskeviciute M
Verheyen R
Vilalta R
Vlimant J-R
Wallin E
White M
Wozniak KA
Wulff E
Zhang Z
Publication venue: 'Stichting SciPost'
Publication date: 28/05/2021
Field of study

We describe the outcome of a data challenge conducted as part of the Dark Machines (https://www.darkmachines.org) initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims to detect signals of new physics at the Large Hadron Collider (LHC) using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We define and describe a large benchmark dataset, consisting of > 1 billion simulated LHC events corresponding to 10 fb−1 of proton-proton collisions at a center-of-mass energy of 13 TeV. We then review a wide range of anomaly detection and density estimation algorithms, developed in the context of the data challenge, and we measure their performance in a set of realistic analysis environments. We draw a number of useful conclusions that will aid the development of unsupervised new physics searches during the third run of the LHC, and provide our benchmark dataset for future studies at https://www.phenoMLdata.org. Code to reproduce the analysis is provided at https://github.com/bostdiek/DarkMachines-UnsupervisedChallenge

Lund University Publications

UCL Discovery

Queen Mary Research Online

CERN Document Server

Anomalies, representations, and self-supervision

Author: Dillon Barry
Favaro Luigi
Feiden Friedrich
Modak Tanmoy
Plehn Tilman
Publication venue
Publication date: 16/08/2024
Field of study

We develop a self-supervised method for density-based anomaly detection using contrastive learning, and test it using event-level anomaly data from CMS ADC2021. The AnomalyCLR technique is data-driven and uses augmentations of the background data to mimic non-Standard-Model events in a model-agnostic way. It uses a permutation-invariant Transformer Encoder architecture to map the objects measured in a collider event to the representation space, where the data augmentations define a representation space which is sensitive to potential anomalous features. An AutoEncoder trained on background representations then computes anomaly scores for a variety of signals in the representation space. With AnomalyCLR we find significant improvements on performance metrics for all signals when compared to the raw data baseline

Ulster University's Research Portal