Cloud native approach for Machine Learning as a Service for High Energy Physics

Bonacorsi, Daniele; Giommi, Luca; Kuznetsov, Valentin; Paladino, Mattia; Spiga, Daniele

Cloud native approach for Machine Learning as a Service for High Energy Physics

Authors: Daniele Bonacorsi
Luca Giommi
Valentin Kuznetsov
Mattia Paladino
Daniele Spiga
Publication date: 1 January 2022
Publisher
Doi

Abstract

Nowadays Machine Learning (ML) techniques are widely adopted in many areas of High Energy Physics (HEP) and certainly will play a significant role also in the upcoming High-Luminosity LHC (HL-LHC) upgrade foreseen at CERN. A huge amount of data will be produced by LHC and collected by the experiments, facing challenges at the exascale. Here, we present Machine Learning as a Service solution for HEP (MLaaS4HEP) to perform an entire ML pipeline (in terms of reading data, processing data, training ML models, serving predictions) in a completely model-agnostic fashion, directly using ROOT files of arbitrary size from local or distributed data sources. With the new version of MLaaS4HEP code based on uproot4, we provide new features to improve users’ experience with the framework and their workflows, e.g. users can provide some preprocessing operations to be applied to ROOT data before starting the ML pipeline. Then our approach is extended to use local and cloud resources via HTTP proxy which allows physicists to submit their workflows using the HTTP protocol. We discuss how this pipeline could be enabled in the INFN Cloud Provider and what could be the final architecture

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

oai:cris.unibo.it:11585/915095

Last time updated on 25/07/2023