5 research outputs found
Distributed statistical inference with pyhf enabled through funcX
In High Energy Physics facilities that provide High Performance Computing
environments provide an opportunity to efficiently perform the statistical
inference required for analysis of data from the Large Hadron Collider, but can
pose problems with orchestration and efficient scheduling. The compute
architectures at these facilities do not easily support the Python compute
model, and the configuration scheduling of batch jobs for physics often
requires expertise in multiple job scheduling services. The combination of the
pure-Python libraries pyhf and funcX reduces the common problem in HEP analyses
of performing statistical inference with binned models, that would
traditionally take multiple hours and bespoke scheduling, to an on-demand
(fitting) "function as a service" that can scalably execute across workers in
just a few minutes, offering reduced time to insight and inference. We
demonstrate execution of a scalable workflow using funcX to simultaneously fit
125 signal hypotheses from a published ATLAS search for new physics using pyhf
with a wall time of under 3 minutes. We additionally show performance
comparisons for other physics analyses with openly published probability models
and argue for a blueprint of fitting as a service systems at HPC centers.Comment: 9 pages, 1 figure, 2 listings, 1 table, submitted to the 25th
International Conference on Computing in High Energy & Nuclear Physic
Towards Real-World Applications of ServiceX, an Analysis Data Transformation System
One of the biggest challenges in the High-Luminosity LHC (HL- LHC) era will
be the significantly increased data size to be recorded and analyzed from the
collisions at the ATLAS and CMS experiments. ServiceX is a software R&D project
in the area of Data Organization, Management and Access of the IRIS- HEP to
investigate new computational models for the HL- LHC era. ServiceX is an
experiment-agnostic service to enable on-demand data delivery specifically
tailored for nearly-interactive vectorized analyses. It is capable of
retrieving data from grid sites, on-the-fly data transformation, and delivering
user-selected data in a variety of different formats. New features will be
presented that make the service ready for public use. An ongoing effort to
integrate ServiceX with a popular statistical analysis framework in ATLAS will
be described with an emphasis of a practical implementation of ServiceX into
the physics analysis pipeline.Comment: 8 pages, 3 figures, 2 listings, 1 table, submitted to the 25th
International Conference on Computing in High Energy & Nuclear Physic
Learning from the Pandemic: the Future of Meetings in HEP and Beyond
International audienceThe COVID-19 pandemic has by-and-large prevented in-person meetings since March 2020. While the increasing deployment of effective vaccines around the world is a very positive development, the timeline and pathway to "normality" is uncertain and the "new normal" we will settle into is anyone's guess. Particle physics, like many other scientific fields, has more than a year of experience in holding virtual meetings, workshops, and conferences. A great deal of experimentation and innovation to explore how to execute these meetings effectively has occurred. Therefore, it is an appropriate time to take stock of what we as a community learned from running virtual meetings and discuss possible strategies for the future. Continuing to develop effective strategies for meetings with a virtual component is likely to be important for reducing the carbon footprint of our research activities, while also enabling greater diversity and inclusion for participation. This report summarizes a virtual two-day workshop on Virtual Meetings held May 5-6, 2021 which brought together experts from both inside and outside of high-energy physics to share their experiences and practices with organizing and executing virtual workshops, and to develop possible strategies for future meetings as we begin to emerge from the COVID-19 pandemic. This report outlines some of the practices and tools that have worked well which we hope will serve as a valuable resource for future virtual meeting organizers in all scientific fields
Second Analysis Ecosystem Workshop Report
International audienceThe second workshop on the HEP Analysis Ecosystem took place 23-25 May 2022 at IJCLab in Orsay, to look at progress and continuing challenges in scaling up HEP analysis to meet the needs of HL-LHC and DUNE, as well as the very pressing needs of LHC Run 3 analysis. The workshop was themed around six particular topics, which were felt to capture key questions, opportunities and challenges. Each topic arranged a plenary session introduction, often with speakers summarising the state-of-the art and the next steps for analysis. This was then followed by parallel sessions, which were much more discussion focused, and where attendees could grapple with the challenges and propose solutions that could be tried. Where there was significant overlap between topics, a joint discussion between them was arranged. In the weeks following the workshop the session conveners wrote this document, which is a summary of the main discussions, the key points raised and the conclusions and outcomes. The document was circulated amongst the participants for comments before being finalised here
Second Analysis Ecosystem Workshop Report
International audienceThe second workshop on the HEP Analysis Ecosystem took place 23-25 May 2022 at IJCLab in Orsay, to look at progress and continuing challenges in scaling up HEP analysis to meet the needs of HL-LHC and DUNE, as well as the very pressing needs of LHC Run 3 analysis. The workshop was themed around six particular topics, which were felt to capture key questions, opportunities and challenges. Each topic arranged a plenary session introduction, often with speakers summarising the state-of-the art and the next steps for analysis. This was then followed by parallel sessions, which were much more discussion focused, and where attendees could grapple with the challenges and propose solutions that could be tried. Where there was significant overlap between topics, a joint discussion between them was arranged. In the weeks following the workshop the session conveners wrote this document, which is a summary of the main discussions, the key points raised and the conclusions and outcomes. The document was circulated amongst the participants for comments before being finalised here