Search CORE

49 research outputs found

DIRAC - Distributed Infrastructure with Remote Agent Control

Author: Bogdanchikov A.
Brook N.
Buckley A.
Closier J.
Diaz R. Graciani
Egede U.
Frank M.
Galli D.
Gandelman M.
Garonne V.
Gaspar C.
Harrison K.
Khan A.
Klous S.
Korolko I.
Kuznetsov G.
Loverre F.
Marconi U.
Palacios J. P.
Patrick G. N.
Pickford A.
Ponce S.
Romanovski V.
Saborido J. J.
Schmelling M.
Soroko A.
Tsaregorodtsev A.
Vagnoni V.
van Herwijnen E.
Washbrook A.
Publication venue
Publication date: 01/01/2003
Field of study

This paper describes DIRAC, the LHCb Monte Carlo production system. DIRAC has a client/server architecture based on: Compute elements distributed among the collaborating institutes; Databases for production management, bookkeeping (the metadata catalogue) and software configuration; Monitoring and cataloguing services for updating and accessing the databases. Locally installed software agents implemented in Python monitor the local batch queue, interrogate the production database for any outstanding production requests using the XML-RPC protocol and initiate the job submission. The agent checks and, if necessary, installs any required software automatically. After the job has processed the events, the agent transfers the output data and updates the metadata catalogue. DIRAC has been successfully installed at 18 collaborating institutes, including the DataGRID, and has been used in recent Physics Data Challenges. In the near to medium term future we must use a mixed environment with different types of grid middleware or no middleware. We describe how this flexibility has been achieved and how ubiquitously available grid middleware would improve DIRAC.Comment: Talk from the 2003 Computing in High Energy and Nuclear Physics (CHEP03), La Jolla, Ca, USA, March 2003, 8 pages, Word, 5 figures. PSN TUAT00

arXiv.org e-Print Archive

DIRAC - Distributed Infrastructure with Remote Agent Control

Author: Bogdanchikov A
Brook N
Buckley A
Closier J
Diaz RG
Egede U
Frank M
Galli D
Gandelman M
Garonne V
Gaspar C
Harrison K
Herwijnen EV
Khan A
Klous S
Korolko I
Kuznetsov G
Loverre F
Marconi U
Palacios JP
Patrick GN
Pickford A
Ponce S
Romanovski V
Saborido JJ
Schmelling M
Soroko A
Tsaregorodtsev A
Vagnoni V
Washbrook A
Publication venue
Publication date: 15/01/2018
Field of study

Spiral - Imperial College Digital Repository

DIRAC framework evaluation for the $\boldsymbol{Fermi}$ -LAT and CTA experiments

Author: Arrabito Luisa
Cohen-Tanugi Johann
Diaz Ricardo Graciani
Kuss Michael
Longo Francesco
Piron Frédéric
Renaud Matthieu
Rolland Vincent
Sapunov Matvey
Tsaregorodtsev Andreï
Zimmer Stephan
Publication venue: 'IOP Publishing'
Publication date: 14/10/2013
Field of study

DIRAC (Distributed Infrastructure with Remote Agent Control) is a general framework for the management of tasks over distributed heterogeneous computing environments. It has been originally developed to support the production activities of the LHCb (Large Hadron Collider Beauty) experiment and today is extensively used by several particle physics and biology communities. Current (

Fermi

Large Area Telescope -- LAT) and planned (Cherenkov Telescope Array -- CTA) new generation astrophysical/cosmological experiments, with very large processing and storage needs, are currently investigating the usability of DIRAC in this context. Each of these use cases has some peculiarities:

Fermi

-LAT will interface DIRAC to its own workflow system to allow the access to the grid resources, while CTA is using DIRAC as workflow management system for Monte Carlo production and analysis on the grid. We describe the prototype effort that we lead toward deploying a DIRAC solution for some aspects of

Fermi

-LAT and CTA needs.Comment: proceedings to CHEP 2013 conference : http://www.chep2013.org

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Trieste

HAL-IN2P3

Crossref

HAL AMU

Utilisation des infrastructures de la Grille EGI pour les simulations de CTA

Author: Arrabito Luisa
Cohen-Tanugi Johann
Lamanna Giovanni
Neyroud Nadine
Renaud Matthieu
Publication venue: HAL CCSD
Publication date: 13/11/2013
Field of study

Utilisation des infrastructures de la Grille EGI pour les simulations de CT

HAL-IN2P3

Hal - Université Grenoble Alpes

HAL Descartes

HAL Université de Savoie

Cherenkov Telescope Array Data Management

Very High Energy gamma-ray astronomy with the Cherenkov Telescope Array (CTA) is evolving towards the model of a public observatory. Handling, processing and archiving the large amount of data generated by the CTA instruments and delivering scientific products are some of the challenges in designing the CTA Data Management. The participation of scientists from within CTA Consortium and from the greater worldwide scientific community necessitates a sophisticated scientific analysis system capable of providing unified and efficient user access to data, software and computing resources. Data Management is designed to respond to three main issues: (i) the treatment and flow of data from remote telescopes; (ii) "big-data" archiving and processing; (iii) and open data access. In this communication the overall technical design of the CTA Data Management, current major developments and prototypes are presented.Comment: 8 pages, 2 figures, In Proceedings of the 34th International Cosmic Ray Conference (ICRC2015), The Hague, The Netherlands. All CTA contributions at arXiv:1508.0589

Hal - Université Grenoble Alpes

HAL Descartes

OA@INAF - Istituto Nazionale di Astrofisica

DESY

HAL Université de Savoie

HAL-CEA

Archivio istituzionale della ricerca - Università di Padova

arXiv.org e-Print Archive

DESY Publication Database

HAL-IN2P3

HAL-INSU

HAL-OBSPM

Cloud flexibility using DIRAC interware

Author: Fernández Albor Víctor Manuel
Fernández Pena Anselmo Tomás
Graciani Díaz Ricardo
Méndez Muñoz Víctor
Saborido Silva Juan José
Seco Miguélez Marcos Antonio
Publication venue: 'IOP Publishing'
Publication date: 01/01/2014
Field of study

Communities of different locations are running their computing jobs on dedicated infrastructures without the need to worry about software, hardware or even the site where their programs are going to be executed. Nevertheless, this usually implies that they are restricted to use certain types or versions of an Operating System because either their software needs an definite version of a system library or a specific platform is required by the collaboration to which they belong. On this scenario, if a data center wants to service software to incompatible communities, it has to split its physical resources among those communities. This splitting will inevitably lead to an underuse of resources because the data centers are bound to have periods where one or more of its subclusters are idle. It is, in this situation, where Cloud Computing provides the flexibility and reduction in computational cost that data centers are searching for. This paper describes a set of realistic tests that we ran on one of such implementations. The test comprise software from three different HEP communities (Auger, LHCb and QCD phenomelogists) and the Parsec Benchmark Suite running on one or more of three Linux flavors (SL5, Ubuntu 10.04 and Fedora 13). The implemented infrastructure has, at the cloud level, CloudStack that manages the virtual machines (VM) and the hosts on which they run, and, at the user level, the DIRAC framework along with a VM extension that will submit, monitorize and keep track of the user jobs and also requests CloudStack to start or stop the necessary VM's. In this infrastructure, the community software is distributed via the CernVM-FS, which has been proven to be a reliable and scalable software distribution system. With the resulting infrastructure, users are allowed to send their jobs transparently to the Data Center. The main purpose of this system is the creation of flexible cluster, multiplatform with an scalable method for software distribution for several VOs. Users from different communities do not need to care about the installation of the standard software that is available at the nodes, nor the operating system of the host machine, which is transparent to the userThis work was supported by projects FPA2007-66437- C02-01/02 assigned to UB and FPA2010-21885-C02- 01/02 and TIN 2010-17541 USC. FPA2007-66152-C02-01/02 and FPA2010-21816-C02-01/02, assigned to PICS

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositorio Institucional da Universidade de Santiago de Compostela

CTACG - CTA Computing Grid

Author: Arrabito Luisa
Barbier Cecile
Elles Sabine
Khelifi Bruno
Komin Nukri
Lamanna Giovanni
Lavalley Claudia
Lenain Jean-Philippe
Vasileiadis George
Publication venue: HAL CCSD
Publication date: 19/09/2011
Field of study

International audienceCTACG - CTA Computing Gri

HAL-IN2P3

Hal - Université Grenoble Alpes

HAL-INSU

HAL Université de Savoie

HAL-OBSPM

HAL-Polytechnique

Control and Monitoring of the Online Computer Farm for Offline Processing in LHCb

Author: Callot O.
Cardoso L.G.
Charpentier P.
Closier J.
Frank M.
Gaspar C.
Jost B.
Liu G.
Neufeld N.
Publication venue: HAL CCSD
Publication date: 06/10/2013
Field of study

ISBN 978-3-95450-139-7 - http://accelconf.web.cern.ch/AccelConf/ICALEPCS2013/papers/tuppc063.pdfInternational audienceLHCb, one of the 4 experiments at the LHC accelerator at CERN, uses approximately 1500 PCs (averaging 12 cores each) for processing the High Level Trigger (HLT) during physics data taking. During periods when data acquisition is not required most of these PCs are idle. In these periods it is possible to profit from the unused processing capacity to run offline jobs, such as Monte Carlo simulation. The LHCb offline computing environment is based on LHCbDIRAC (Distributed Infrastructure with Remote Agent Control). In LHCbDIRAC, job agents are started on Worker Nodes, pull waiting tasks from the central WMS (Workload Management System) and process them on the available resources. A Control System was developed which is able to launch, control and monitor the job agents for the offline data processing on the HLT Farm. This control system is based on the existing Online System Control infrastructure, the PVSS SCADA and the FSM toolkit. It has been extensively used launching and monitoring 22.000+ agents simultaneously and more than 850.000 jobs have already been processed in the HLT Farm. This paper describes the deployment and experience with the Control System in the LHCb experiment

HAL-IN2P3

CERN Document Server

Hal-Diderot