Search CORE

1,374 research outputs found

Using LSTM recurrent neural networks for monitoring the LHC superconducting magnets

Author: Mertik Matej
Skoczeń Andrzej
Wielgosz Maciej
Publication venue: 'Elsevier BV'
Publication date: 18/11/2016
Field of study

The superconducting LHC magnets are coupled with an electronic monitoring system which records and analyses voltage time series reflecting their performance. A currently used system is based on a range of preprogrammed triggers which launches protection procedures when a misbehavior of the magnets is detected. All the procedures used in the protection equipment were designed and implemented according to known working scenarios of the system and are updated and monitored by human operators. This paper proposes a novel approach to monitoring and fault protection of the Large Hadron Collider (LHC) superconducting magnets which employs state-of-the-art Deep Learning algorithms. Consequently, the authors of the paper decided to examine the performance of LSTM recurrent neural networks for modeling of voltage time series of the magnets. In order to address this challenging task different network architectures and hyper-parameters were used to achieve the best possible performance of the solution. The regression results were measured in terms of RMSE for different number of future steps and history length taken into account for the prediction. The best result of RMSE=0.00104 was obtained for a network of 128 LSTM cells within the internal layer and 16 steps history buffer

arXiv.org e-Print Archive

CERN Document Server

Learning to Classify from Impure Samples with High-Dimensional Data

Author: Komiske Patrick T.
Metodiev Eric M.
Nachman Benjamin
Schwartz Matthew D.
Publication venue: 'American Physical Society (APS)'
Publication date: 01/07/2018
Field of study

A persistent challenge in practical classification tasks is that labeled training sets are not always available. In particle physics, this challenge is surmounted by the use of simulations. These simulations accurately reproduce most features of data, but cannot be trusted to capture all of the complex correlations exploitable by modern machine learning methods. Recent work in weakly supervised learning has shown that simple, low-dimensional classifiers can be trained using only the impure mixtures present in data. Here, we demonstrate that complex, high-dimensional classifiers can also be trained on impure mixtures using weak supervision techniques, with performance comparable to what could be achieved with pure samples. Using weak supervision will therefore allow us to avoid relying exclusively on simulations for high-dimensional classification. This work opens the door to a new regime whereby complex models are trained directly on data, providing direct access to probe the underlying physics.Comment: 6 pages, 2 tables, 2 figures. v2: updated to match PRD versio

arXiv.org e-Print Archive

DSpace@MIT

eScholarship - University of California

The model of an anomaly detector for HiLumi LHC magnets based on Recurrent Neural Networks and adaptive quantization

Author: De Matteis Ernesto
Mertik Matej
Skoczeń Andrzej
Wielgosz Maciej
Publication venue: 'Elsevier BV'
Publication date: 28/09/2017
Field of study

This paper focuses on an examination of an applicability of Recurrent Neural Network models for detecting anomalous behavior of the CERN superconducting magnets. In order to conduct the experiments, the authors designed and implemented an adaptive signal quantization algorithm and a custom GRU-based detector and developed a method for the detector parameters selection. Three different datasets were used for testing the detector. Two artificially generated datasets were used to assess the raw performance of the system whereas the 231 MB dataset composed of the signals acquired from HiLumi magnets was intended for real-life experiments and model training. Several different setups of the developed anomaly detection system were evaluated and compared with state-of-the-art OC-SVM reference model operating on the same data. The OC-SVM model was equipped with a rich set of feature extractors accounting for a range of the input signal properties. It was determined in the course of the experiments that the detector, along with its supporting design methodology, reaches F1 equal or very close to 1 for almost all test sets. Due to the profile of the data, the best_length setup of the detector turned out to perform the best among all five tested configuration schemes of the detection system. The quantization parameters have the biggest impact on the overall performance of the detector with the best values of input/output grid equal to 16 and 8, respectively. The proposed solution of the detection significantly outperformed OC-SVM-based detector in most of the cases, with much more stable performance across all the datasets.Comment: Related to arXiv:1702.0083

arXiv.org e-Print Archive

CERN Document Server

Pulling Out All the Tops with Computer Vision and Deep Learning

Author: Macaluso Sebastian
Shih David
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/02/2018
Field of study

We apply computer vision with deep learning -- in the form of a convolutional neural network (CNN) -- to build a highly effective boosted top tagger. Previous work (the "DeepTop" tagger of Kasieczka et al) has shown that a CNN-based top tagger can achieve comparable performance to state-of-the-art conventional top taggers based on high-level inputs. Here, we introduce a number of improvements to the DeepTop tagger, including architecture, training, image preprocessing, sample size and color pixels. Our final CNN top tagger outperforms BDTs based on high-level inputs by a factor of

\sim 2

--3 or more in background rejection, over a wide range of tagging efficiencies and fiducial jet selections. As reference points, we achieve a QCD background rejection factor of 500 (60) at 50\% top tagging efficiency for fully-merged (non-merged) top jets with

p_T

in the 800--900 GeV (350--450 GeV) range. Our CNN can also be straightforwardly extended to the classification of other types of jets, and the lessons learned here may be useful to others designing their own deep NNs for LHC applications.Comment: 33 pages, 11 figure

arXiv.org e-Print Archive

Directory of Open Access Journals

The Machine Learning Landscape of Top Taggers

Author: Butter A.
Cranmer K.
Debnath D.
Dillon B. M.
Fairbairn M.
Faroughy D. A.
Fedorko W.
Gay C.
Gouskos L.
Kamenik J. F.
Kasieczka G.
Komiske P. T.
Leiss S.
Lister A.
Macaluso S.
Metodiev E. M.
Moore L.
Nachman B.
Nordstrom K.
Pearkes J.
Plehn T.
Qu H.
Rath Y.
Rieger M.
Shih D.
Thompson J. M.
Varma S.
Publication venue: 'Stichting SciPost'
Publication date: 01/01/2019
Field of study

Based on the established task of identifying boosted, hadronically decaying top quarks, we compare a wide range of modern machine learning approaches. Unlike most established methods they rely on low-level input, for instance calorimeter output. While their network architectures are vastly different, their performance is comparatively similar. In general, we find that these new approaches are extremely powerful and great fun.Comment: Yet another tagger included

arXiv.org e-Print Archive

HAL Descartes

Publikationsserver der RWTH Aachen University

Hal-Diderot

Identifying WIMP dark matter from particle and astroparticle data

Author: Bertone Gianfranco
Bozorgnia Nassim
de Austri Roberto Ruiz
Kim Jong Soo
Liem Sebastian
McCabe Christopher
Otten Sydney
Publication venue: 'IOP Publishing'
Publication date: 01/01/2018
Field of study

One of the most promising strategies to identify the nature of dark matter consists in the search for new particles at accelerators and with so-called direct detection experiments. Working within the framework of simplified models, and making use of machine learning tools to speed up statistical inference, we address the question of what we can learn about dark matter from a detection at the LHC and a forthcoming direct detection experiment. We show that with a combination of accelerator and direct detection data, it is possible to identify newly discovered particles as dark matter, by reconstructing their relic density assuming they are weakly interacting massive particles (WIMPs) thermally produced in the early Universe, and demonstrating that it is consistent with the measured dark matter abundance. An inconsistency between these two quantities would instead point either towards additional physics in the dark sector, or towards a non-standard cosmology, with a thermal history substantially different from that of the standard cosmological model.Comment: 24 pages (+21 pages of appendices and references) and 14 figures. v2: Updated to match JCAP version; includes minor clarifications in text and updated reference

arXiv.org e-Print Archive

Publikationsserver der RWTH Aachen University

King's Research Portal

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Study of energy deposition patterns in hadron calorimeter for prompt and displaced jets using convolutional neural network

Author: Bhattacherjee Biplob
Mukherjee Swagata
Sengupta Rhitaja
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Sophisticated machine learning techniques have promising potential in search for physics beyond Standard Model in Large Hadron Collider (LHC). Convolutional neural networks (CNN) can provide powerful tools for differentiating between patterns of calorimeter energy deposits by prompt particles of Standard Model and long-lived particles predicted in various models beyond the Standard Model. We demonstrate the usefulness of CNN by using a couple of physics examples from well motivated BSM scenarios predicting long-lived particles giving rise to displaced jets. Our work suggests that modern machine-learning techniques have potential to discriminate between energy deposition patterns of prompt and long-lived particles, and thus, they can be useful tools in such searches.Comment: 32 pages, 17 figures; version accepted for publication in JHE

arXiv.org e-Print Archive

Open Access Repository of IISc Research Publications

Publikationsserver der RWTH Aachen University