Search CORE

10 research outputs found

Improving efficiency of analysis jobs in CMS

Author: Balcas Justas
Belforte Stefano
Bockelman Brian Paul
Ciangottini Diego
Cristella Leonardo
Davila Foyo Diego
Hernández José M.
Hurtado Anampa Kenyi
Ivanov Todor Trendafilov
Letts James
Mascheroni Marco
Pérez-Calero Yzquierdo Antonio
Wolf Matthias
Woodard Anna Elizabeth
Publication venue: 'EDP Sciences'
Publication date: 17/09/2019
Field of study

Hundreds of physicists analyze data collected by the Compact Muon Solenoid (CMS) experiment at the Large Hadron Collider using the CMS Remote Analysis Builder and the CMS global pool to exploit the resources of the Worldwide LHC Computing Grid. Efficient use of such an extensive and expensive resource is crucial. At the same time, the CMS collaboration is committed to minimizing time to insight for every scientist, by pushing for fewer possible access restrictions to the full data sample and supports the free choice of applications to run on the computing resources. Supporting such variety of workflows while preserving efficient resource usage poses special challenges. In this paper we report on three complementary approaches adopted in CMS to improve the scheduling efficiency of user analysis jobs: automatic job splitting, automated run time estimates and automated site selection for jobs

Improving the Scheduling Efficiency of a Global Multi-Core HTCondor Pool in CMS

Author: Bockelman Brian Paul
Foyo Diego Davila
Hurtado Anampa Kenyi
Ivanov Todor Trendafilov
Khan Farrukh Aftab
Kotobi Amjad
Larson Krista
Letts James
Mascheroni Marco
Mason David
Pérez-Calero Yzquierdo Antonio
Publication venue: 'EDP Sciences'
Publication date: 01/01/2019
Field of study

Scheduling multi-core workflows in a global HTCondor pool is a multi-dimensional problem whose solution depends on the requirements of the job payloads, the characteristics of available resources, and the boundary conditions such as fair share and prioritization imposed on the job matching to resources. Within the context of a dedicated task force, CMS has increased significantly the scheduling efficiency of workflows in reusable multi-core pilots by various improvements to the limitations of the GlideinWMS pilots, accuracy of resource requests, efficiency and speed of the HTCondor infrastructure, and job matching algorithms

Directory of Open Access Journals

Producing Madgraph5_aMC@NLO gridpacks and using TensorFlow GPU resources in the CMS HTCondor Global Pool

Author: Antonio Perez-Calero Yzquierdo
Brian Paul Bockelman
David Mason
Diego Davila Foyo
Edgar Fajardo Hernandez
Farrukh Aftab Khan
James Letts
Kenyi Hurtado Anampa
Krista Larson
Marco Mascheroni
Todor Trendafilovz Ivanov
Publication venue: 'EDP Sciences'
Publication date: 17/09/2019
Field of study

The CMS experiment has an HTCondor Global Pool, composed of more than 200K CPU cores available for Monte Carlo production and the analysis of da.The submission of user jobs to this pool is handled by either CRAB, the standard workflow management tool used by CMS users to submit analysis jobs requiring event processing of large amounts of data, or by CMS Connect, a service focused on final stage condor-like analysis jobs and applications that already have a workflow job manager in place. The latest scenario canbring cases in which workflows need further adjustments in order to efficiently work in a globally distributed pool of resources. For instance, the generation of matrix elements for high energy physics processes via Madgraph5_aMC@NLO and the usage of tools not (yet) fully supported by the CMS software, such as Ten-sorFlow with GPUsupport, are tasks with particular requirements. A special adaption, either at the pool factory level (advertising GPU resources) or at the execute level (e.g: to handle special parameters that describe certain needs for the remote execute nodes during submission) is needed in order to adequately work in the CMS global pool. This contribution describes the challenges and efforts performed towards adaptingsuch workflows so they can properly profit from the Global Pool via CMS Connect

EDP Sciences OAI-PMH repository (1.2.0)

Improving efficiency of analysis jobs in CMS

Author: Anna Elizabeth Woodard
Antonio Pérez-Calero Yzquierdo
Brian Paul Bockelman
Diego Ciangottini
Diego Davila Foyo
James Letts
José M. Hernández
Justas Balcas
Kenyi Hurtado Anampa
Leonardo Cristella
Marco Mascheroni
Matthias Wolf
Stefano Belforte
Todor Trendafilov Ivanov
Publication venue: 'EDP Sciences'
Publication date: 17/09/2019
Field of study

EDP Sciences OAI-PMH repository (1.2.0)

Caltech Authors

Producing Madgraph5_aMC@NLO gridpacks and using TensorFlow GPU resources in the CMS HTCondor Global Pool

Author: Aftab Khan Farrukh
Bockelman Brian Paul
Davila Foyo Diego
Fajardo Hernandez Edgar
Hurtado Anampa Kenyi
Larson Krista
Letts James
Mascheroni Marco
Mason David
Perez-Calero Yzquierdo Antonio
Trendafilovz Ivanov Todor
Publication venue: 'EDP Sciences'
Publication date: 01/01/2019
Field of study

Directory of Open Access Journals

Improving efficiency of analysis jobs in CMS

Author: Anampa Kenyi Hurtado
Balcas Justas
Belforte Stefano
Bockelman Brian Paul
Ciangottini Diego
Cristella Leonardo
Foyo Diego Davila
Hernández José M.
Ivanov Todor Trendafilov
Letts James
Mascheroni Marco
Wolf Matthias
Woodard Anna Elizabeth
Yzquierdo Antonio Pérez-Calero
Publication venue: 'EDP Sciences'
Publication date: 01/01/2019
Field of study

Directory of Open Access Journals

Evolution of the CMS Global Submission Infrastructure for the HL-LHC Era

Author: Antonio Pérez-Calero Yzquierdo
David Mason
Diego Davila Foyo
Edita Kizinevič
Farrukh Aftab Khan
James Letts
Kenyi Hurtado Anampa
Krista Larson
Marco Mascheroni
Maria Acosta Flechas
Saqib Haleem
Todor Trendafilov Ivanov
Publication venue: 'EDP Sciences'
Publication date: 10/02/2020
Field of study

Efforts in distributed computing of the CMS experiment at the LHC at CERN are now focusing on the functionality required to fulfill the projected needs for the HL-LHC era. Cloud and HPC resources are expected to be dominant relative to resources provided by traditional Grid sites, being also much more diverse and heterogeneous. Handling their special capabilities or limitations and maintaining global flexibility and efficiency, while also operating at scales much higher than the current capacity, are the major challenges being addressed by the CMS Submission Infrastructure team. These proceedings discuss the risks to the stability and scalability of the CMS HTCondor infrastructure extrapolated to such a scenario, thought to be derived mostly from its growing complexity, with multiple Negotiators and schedulers flocking work to multiple federated pools. New mechanisms for enhanced customization and control over resource allocation and usage, mandatory in this future scenario, are also described

EDP Sciences OAI-PMH repository (1.2.0)

CERN Document Server

Evolution of the CMS Global Submission Infrastructure for the HL-LHC Era

Author: Acosta Flechas Maria
Davila Foyo Diego
Haleem Saqib
Hurtado Anampa Kenyi
Ivanov Todor Trendafilov
Khan Farrukh Aftab
Kizinevič Edita
Larson Krista
Letts James
Mascheroni Marco
Mason David
Pérez-Calero Yzquierdo Antonio
Publication venue: 'EDP Sciences'
Publication date: 01/01/2020
Field of study

Directory of Open Access Journals