1,374 research outputs found
Using LSTM recurrent neural networks for monitoring the LHC superconducting magnets
The superconducting LHC magnets are coupled with an electronic monitoring
system which records and analyses voltage time series reflecting their
performance. A currently used system is based on a range of preprogrammed
triggers which launches protection procedures when a misbehavior of the magnets
is detected. All the procedures used in the protection equipment were designed
and implemented according to known working scenarios of the system and are
updated and monitored by human operators.
This paper proposes a novel approach to monitoring and fault protection of
the Large Hadron Collider (LHC) superconducting magnets which employs
state-of-the-art Deep Learning algorithms. Consequently, the authors of the
paper decided to examine the performance of LSTM recurrent neural networks for
modeling of voltage time series of the magnets. In order to address this
challenging task different network architectures and hyper-parameters were used
to achieve the best possible performance of the solution. The regression
results were measured in terms of RMSE for different number of future steps and
history length taken into account for the prediction. The best result of
RMSE=0.00104 was obtained for a network of 128 LSTM cells within the internal
layer and 16 steps history buffer
Learning to Classify from Impure Samples with High-Dimensional Data
A persistent challenge in practical classification tasks is that labeled
training sets are not always available. In particle physics, this challenge is
surmounted by the use of simulations. These simulations accurately reproduce
most features of data, but cannot be trusted to capture all of the complex
correlations exploitable by modern machine learning methods. Recent work in
weakly supervised learning has shown that simple, low-dimensional classifiers
can be trained using only the impure mixtures present in data. Here, we
demonstrate that complex, high-dimensional classifiers can also be trained on
impure mixtures using weak supervision techniques, with performance comparable
to what could be achieved with pure samples. Using weak supervision will
therefore allow us to avoid relying exclusively on simulations for
high-dimensional classification. This work opens the door to a new regime
whereby complex models are trained directly on data, providing direct access to
probe the underlying physics.Comment: 6 pages, 2 tables, 2 figures. v2: updated to match PRD versio
The model of an anomaly detector for HiLumi LHC magnets based on Recurrent Neural Networks and adaptive quantization
This paper focuses on an examination of an applicability of Recurrent Neural
Network models for detecting anomalous behavior of the CERN superconducting
magnets. In order to conduct the experiments, the authors designed and
implemented an adaptive signal quantization algorithm and a custom GRU-based
detector and developed a method for the detector parameters selection. Three
different datasets were used for testing the detector. Two artificially
generated datasets were used to assess the raw performance of the system
whereas the 231 MB dataset composed of the signals acquired from HiLumi magnets
was intended for real-life experiments and model training. Several different
setups of the developed anomaly detection system were evaluated and compared
with state-of-the-art OC-SVM reference model operating on the same data. The
OC-SVM model was equipped with a rich set of feature extractors accounting for
a range of the input signal properties. It was determined in the course of the
experiments that the detector, along with its supporting design methodology,
reaches F1 equal or very close to 1 for almost all test sets. Due to the
profile of the data, the best_length setup of the detector turned out to
perform the best among all five tested configuration schemes of the detection
system. The quantization parameters have the biggest impact on the overall
performance of the detector with the best values of input/output grid equal to
16 and 8, respectively. The proposed solution of the detection significantly
outperformed OC-SVM-based detector in most of the cases, with much more stable
performance across all the datasets.Comment: Related to arXiv:1702.0083
Pulling Out All the Tops with Computer Vision and Deep Learning
We apply computer vision with deep learning -- in the form of a convolutional
neural network (CNN) -- to build a highly effective boosted top tagger.
Previous work (the "DeepTop" tagger of Kasieczka et al) has shown that a
CNN-based top tagger can achieve comparable performance to state-of-the-art
conventional top taggers based on high-level inputs. Here, we introduce a
number of improvements to the DeepTop tagger, including architecture, training,
image preprocessing, sample size and color pixels. Our final CNN top tagger
outperforms BDTs based on high-level inputs by a factor of --3 or more
in background rejection, over a wide range of tagging efficiencies and fiducial
jet selections. As reference points, we achieve a QCD background rejection
factor of 500 (60) at 50\% top tagging efficiency for fully-merged (non-merged)
top jets with in the 800--900 GeV (350--450 GeV) range. Our CNN can also
be straightforwardly extended to the classification of other types of jets, and
the lessons learned here may be useful to others designing their own deep NNs
for LHC applications.Comment: 33 pages, 11 figure
The Machine Learning Landscape of Top Taggers
Based on the established task of identifying boosted, hadronically decaying
top quarks, we compare a wide range of modern machine learning approaches.
Unlike most established methods they rely on low-level input, for instance
calorimeter output. While their network architectures are vastly different,
their performance is comparatively similar. In general, we find that these new
approaches are extremely powerful and great fun.Comment: Yet another tagger included
Identifying WIMP dark matter from particle and astroparticle data
One of the most promising strategies to identify the nature of dark matter
consists in the search for new particles at accelerators and with so-called
direct detection experiments. Working within the framework of simplified
models, and making use of machine learning tools to speed up statistical
inference, we address the question of what we can learn about dark matter from
a detection at the LHC and a forthcoming direct detection experiment. We show
that with a combination of accelerator and direct detection data, it is
possible to identify newly discovered particles as dark matter, by
reconstructing their relic density assuming they are weakly interacting massive
particles (WIMPs) thermally produced in the early Universe, and demonstrating
that it is consistent with the measured dark matter abundance. An inconsistency
between these two quantities would instead point either towards additional
physics in the dark sector, or towards a non-standard cosmology, with a thermal
history substantially different from that of the standard cosmological model.Comment: 24 pages (+21 pages of appendices and references) and 14 figures. v2:
Updated to match JCAP version; includes minor clarifications in text and
updated reference
Study of energy deposition patterns in hadron calorimeter for prompt and displaced jets using convolutional neural network
Sophisticated machine learning techniques have promising potential in search
for physics beyond Standard Model in Large Hadron Collider (LHC). Convolutional
neural networks (CNN) can provide powerful tools for differentiating between
patterns of calorimeter energy deposits by prompt particles of Standard Model
and long-lived particles predicted in various models beyond the Standard Model.
We demonstrate the usefulness of CNN by using a couple of physics examples from
well motivated BSM scenarios predicting long-lived particles giving rise to
displaced jets. Our work suggests that modern machine-learning techniques have
potential to discriminate between energy deposition patterns of prompt and
long-lived particles, and thus, they can be useful tools in such searches.Comment: 32 pages, 17 figures; version accepted for publication in JHE
- …