Search CORE

7 research outputs found

Particle-based Fast Jet Simulation at the LHC with Variational Autoencoders

Author: Chernyavskaya Nadezda
Duarte Javier
Gunopulos Dimitrios
Kansal Raghav
Orzari Breno
Pierini Maurizio
Tomei Thiago
Touranakou Mary
Vlimant Jean-Roch
Publication venue: 'IOP Publishing'
Publication date: 01/03/2022
Field of study

We study how to use Deep Variational Autoencoders for a fast simulation of jets of particles at the LHC. We represent jets as a list of constituents, characterized by their momenta. Starting from a simulation of the jet before detector effects, we train a Deep Variational Autoencoder to return the corresponding list of constituents after detection. Doing so, we bypass both the time-consuming detector simulation and the collision reconstruction steps of a traditional processing chain, speeding up significantly the events generation workflow. Through model optimization and hyperparameter tuning, we achieve state-of-the-art precision on the jet four-momentum, while providing an accurate description of the constituents momenta, and an inference time comparable to that of a rule-based fast simulation.Comment: 11 pages, 8 figure

arXiv.org e-Print Archive

CERN Document Server

Sparse Data Generation for Particle-Based Simulation of Hadronic Jets in the LHC

Author: Duarte Javier
Gunopulos Dimitrios
Kansal Raghav
Orzari Breno
Pierini Maurizio
Tomei Thiago
Touranakou Mary
Vlimant Jean-Roch
Publication venue
Publication date: 30/09/2021
Field of study

We develop a generative neural network for the generation of sparse data in particle physics using a permutation-invariant and physics-informed loss function. The input dataset used in this study consists of the particle constituents of hadronic jets due to its sparsity and the possibility of evaluating the network's ability to accurately describe the particles and jets properties. A variational autoencoder composed of convolutional layers in the encoder and decoder is used as the generator. The loss function consists of a reconstruction error term and the Kullback-Leibler divergence between the output of the encoder and the latent vector variables. The permutation-invariant loss on the particles' properties is combined with two mean-squared error terms that measure the difference between input and output jets mass and transverse momentum, which improves the network's generation capability as it imposes physics constraints, allowing the model to learn the kinematics of the jets

arXiv.org e-Print Archive

CERN Document Server

Graph Generative Adversarial Networks for Sparse Data Generation in High Energy Physics

Author: Duarte Javier
Gunopulos Dimitrios
Kansal Raghav
Orzari Breno
Pierini Maurizio
Tomei Thiago
Touranakou Mary
Vlimant Jean-Roch
Publication venue
Publication date: 30/11/2020
Field of study

We develop a graph generative adversarial network to generate sparse data sets like those produced at the CERN Large Hadron Collider (LHC). We demonstrate this approach by training on and generating sparse representations of MNIST handwritten digit images and jets of particles in proton-proton collisions like those at the LHC. We find the model successfully generates sparse MNIST digits and particle jet data. We quantify agreement between real and generated data with a graph-based Fr\'echet Inception distance, and the particle and jet feature-level 1-Wasserstein distance for the MNIST and jet datasets respectively

arXiv.org e-Print Archive

CERN Document Server

LHC hadronic jet generation using convolutional variational autoencoders with normalizing flows

Author: Breno Orzari
Dimitrios Gunopulos
Javier Duarte
Jefferson Fialho
Mary Touranakou
Maurizio Pierini
Nadezda Chernyavskaya
Raghav Kansal
Raphael Cobe
Thiago Tomei
Publication venue: IOP Publishing
Publication date: 01/01/2023
Field of study

In high energy physics, one of the most important processes for collider data analysis is the comparison of collected and simulated data. Nowadays the state-of-the-art for data generation is in the form of Monte Carlo (MC) generators. However, because of the upcoming high-luminosity upgrade of the Large Hadron Collider (LHC), there will not be enough computational power or time to match the amount of needed simulated data using MC methods. An alternative approach under study is the usage of machine learning generative methods to fulfill that task. Since the most common final-state objects of high-energy proton collisions are hadronic jets, which are collections of particles collimated in a given region of space, this work aims to develop a convolutional variational autoencoder (ConVAE) for the generation of particle-based LHC hadronic jets. Given the ConVAE’s limitations, a normalizing flow (NF) network is coupled to it in a two-step training process, which shows improvements on the results for the generated jets. The ConVAE+NF network is capable of generating a jet in

18.30 \pm 0.04\,\,{\mu\text{s}}

, making it one of the fastest methods for this task up to now

Directory of Open Access Journals

CERN Document Server

Benchmark data and model independent event classification for the large hadron collider

We describe the outcome of a data challenge conducted as part of the Dark Machines (https://www.darkmachines.org) initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims to detect signals of new physics at the Large Hadron Collider (LHC) using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We define and describe a large benchmark dataset, consisting of > 1 billion simulated LHC events corresponding to 10 fb−1 of proton-proton collisions at a center-of-mass energy of 13 TeV. We then review a wide range of anomaly detection and density estimation algorithms, developed in the context of the data challenge, and we measure their performance in a set of realistic analysis environments. We draw a number of useful conclusions that will aid the development of unsupervised new physics searches during the third run of the LHC, and provide our benchmark dataset for future studies at https://www.phenoMLdata.org. Code to reproduce the analysis is provided at https://github.com/bostdiek/DarkMachines-UnsupervisedChallenge

Lund University Publications