110 research outputs found
Variational Autoencoders for New Physics Mining at the Large Hadron Collider
Using variational autoencoders trained on known physics processes, we develop
a one-sided threshold test to isolate previously unseen processes as outlier
events. Since the autoencoder training does not depend on any specific new
physics signature, the proposed procedure doesn't make specific assumptions on
the nature of new physics. An event selection based on this algorithm would be
complementary to classic LHC searches, typically based on model-dependent
hypothesis testing. Such an algorithm would deliver a list of anomalous events,
that the experimental collaborations could further scrutinize and even release
as a catalog, similarly to what is typically done in other scientific domains.
Event topologies repeating in this dataset could inspire new-physics model
building and new experimental searches. Running in the trigger system of the
LHC experiments, such an application could identify anomalous events that would
be otherwise lost, extending the scientific reach of the LHC.Comment: 29 pages, 12 figures, 5 table
Variational Autoencoders for New Physics Mining at the Large Hadron Collider
Using variational autoencoders trained on known physics processes, we develop a one-sided threshold test to isolate previously unseen processes as outlier events. Since the autoencoder training does not depend on any specific new physics signature, the proposed procedure doesnât make specific assumptions on the nature of new physics. An event selection based on this algorithm would be complementary to classic LHC searches, typically based on model-dependent hypothesis testing. Such an algorithm would deliver a list of anomalous events, that the experimental collaborations could further scrutinize and even release as a catalog, similarly to what is typically done in other scientific domains. Event topologies repeating in this dataset could inspire new-physics model building and new experimental searches. Running in the trigger system of the LHC experiments, such an application could identify anomalous events that would be otherwise lost, extending the scientific reach of the LHC
Adversarially Learned Anomaly Detection on CMS Open Data: re-discovering the top quark
We apply an Adversarially Learned Anomaly Detection (ALAD) algorithm to the
problem of detecting new physics processes in proton-proton collisions at the
Large Hadron Collider. Anomaly detection based on ALAD matches performances
reached by Variational Autoencoders, with a substantial improvement in some
cases. Training the ALAD algorithm on 4.4 fb-1 of 8 TeV CMS Open Data, we show
how a data-driven anomaly detection and characterization would work in real
life, re-discovering the top quark by identifying the main features of the
t-tbar experimental signature at the LHC.Comment: 16 pages, 9 figure
Adversarially Learned Anomaly Detection on CMS open data: re-discovering the top quark
We apply an Adversarially Learned Anomaly Detection (ALAD) algorithm to the problem of detecting new physics processes in protonâproton collisions at the Large Hadron Collider. Anomaly detection based on ALAD matches performances reached by Variational Autoencoders, with a substantial improvement in some cases. Training the ALAD algorithm on 4.4 fbâ»Âč of 8 TeV CMS Open Data, we show how a data-driven anomaly detection and characterization would work in real life, re-discovering the top quark by identifying the main features of the ttÌ experimental signature at the LHC
Nanosecond anomaly detection with decision trees for high energy physics and real-time application to exotic Higgs decays
We present a novel implementation of the artificial intelligence autoencoding
algorithm, used as an ultrafast and ultraefficient anomaly detector, built with
a forest of deep decision trees on FPGA, field programmable gate arrays.
Scenarios at the Large Hadron Collider at CERN are considered, for which the
autoencoder is trained using known physical processes of the Standard Model.
The design is then deployed in real-time trigger systems for anomaly detection
of new unknown physical processes, such as the detection of exotic Higgs
decays, on events that fail conventional threshold-based algorithms. The
inference is made within a latency value of 25 ns, the time between successive
collisions at the Large Hadron Collider, at percent-level resource usage. Our
method offers anomaly detection at the lowest latency values for edge AI users
with tight resource constraints.Comment: 26 pages, 9 figures, 1 tabl
Anomaly Detection in Collider Physics via Factorized Observables
To maximize the discovery potential of high-energy colliders, experimental
searches should be sensitive to unforeseen new physics scenarios. This goal has
motivated the use of machine learning for unsupervised anomaly detection. In
this paper, we introduce a new anomaly detection strategy called FORCE:
factorized observables for regressing conditional expectations. Our approach is
based on the inductive bias of factorization, which is the idea that the
physics governing different energy scales can be treated as approximately
independent. Assuming factorization holds separately for signal and background
processes, the appearance of non-trivial correlations between low- and
high-energy observables is a robust indicator of new physics. Under the most
restrictive form of factorization, a machine-learned model trained to identify
such correlations will in fact converge to the optimal new physics classifier.
We test FORCE on a benchmark anomaly detection task for the Large Hadron
Collider involving collimated sprays of particles called jets. By teasing out
correlations between the kinematics and substructure of jets, our method can
reliably extract percent-level signal fractions. This strategy for uncovering
new physics adds to the growing toolbox of anomaly detection methods for
collider physics with a complementary set of assumptions.Comment: 5+11 pages, 3+7 figure
Benchmark data and model independent event classification for the large hadron collider
We describe the outcome of a data challenge conducted as part of the Dark Machines (https://www.darkmachines.org) initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims to detect signals of new physics at the Large Hadron Collider (LHC) using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We define and describe a large benchmark dataset, consisting of > 1 billion simulated LHC events corresponding to 10 fbâ1 of proton-proton collisions at a center-of-mass energy of 13 TeV. We then review a wide range of anomaly detection and density estimation algorithms, developed in the context of the data challenge, and we measure their performance in a set of realistic analysis environments. We draw a number of useful conclusions that will aid the development of unsupervised new physics searches during the third run of the LHC, and provide our benchmark dataset for future studies at https://www.phenoMLdata.org. Code to reproduce the analysis is provided at https://github.com/bostdiek/DarkMachines-UnsupervisedChallenge
Anomalies, representations, and self-supervision
We develop a self-supervised method for density-based anomaly detection using contrastive learning, and test it using event-level anomaly data from CMS ADC2021. The AnomalyCLR technique is data-driven and uses augmentations of the background data to mimic non-Standard-Model events in a model-agnostic way. It uses a permutation-invariant Transformer Encoder architecture to map the objects measured in a collider event to the representation space, where the data augmentations define a representation space which is sensitive to potential anomalous features. An AutoEncoder trained on background representations then computes anomaly scores for a variety of signals in the representation space. With AnomalyCLR we find significant improvements on performance metrics for all signals when compared to the raw data baseline
- âŠ