Search CORE

684 research outputs found

Data handling in KLOE

Author: I. Sfiligoi
Publication venue
Publication date: 01/10/2001
Field of study

Abstract The KLOE experiment is going to acquire and manage petabytes of data. An efficient and easy to use system is essential to cope with this amount of data. In this paper a general overview of the approach chosen at KLOE is presented

Crossref

Open Access Repository

Measuring gravitational lensing of the cosmic microwave background using cross correlation with large scale structure

Author: Aneesh V. Manohar
Brian Keating
Chang Feng
Grigor Aslanyan
Hans P. Paar
I. Sfiligoi
K. M. Smith
Oliver Zahn
Publication venue: 'American Physical Society (APS)'
Publication date: 20/09/2012
Field of study

We cross correlate the gravitational lensing map extracted from cosmic microwave background measurements by the Wilkinson Microwave Anisotropy Probe (WMAP) with the radio galaxy distribution from the NRAO VLA Sky Survey (NVSS) by using a quadratic estimator technique. We use the full covariance matrix to filter the data, and calculate the cross-power spectra for the lensing-galaxy correlation. We explore the impact of changing the values of cosmological parameters on the lensing reconstruction, and obtain statistical detection significances at

>3\sigma

. The results of all cross correlations pass the curl null test as well as a complementary diagnostic test using the NVSS data in equatorial coordinates. We forecast the potential for Planck and NVSS to constrain the lensing-galaxy cross correlation as well as the galaxy bias. The lensing-galaxy cross-power spectra are found to be Gaussian distributed.Comment: 16 pages, 10 figure

arXiv.org e-Print Archive

Crossref

Flexible Session Management in a Distributed Environment

Author: Dan Bradley
Douglas Thain
Foster I
Freier A O
Igor Sfiligoi
Igor Sfiligoi
Keith Brown
Michael Litzkow
Miron Livny
Rajesh Raman
Ronald L Rivest
Schneier B
Steiner J G
Todd Tannenbaum
US National Institute of Standards
US National Institute of Standards
Zach Miller
Publication venue: 'IOP Publishing'
Publication date: 01/01/2010
Field of study

Many secure communication libraries used by distributed systems, such as SSL, TLS, and Kerberos, fail to make a clear distinction between the authentication, session, and communication layers. In this paper we introduce CEDAR, the secure communication library used by the Condor High Throughput Computing software, and present the advantages to a distributed computing system resulting from CEDAR's separation of these layers. Regardless of the authentication method used, CEDAR establishes a secure session key, which has the flexibility to be used for multiple capabilities. We demonstrate how a layered approach to security sessions can avoid round-trips and latency inherent in network authentication. The creation of a distinct session management layer allows for optimizations to improve scalability by way of delegating sessions to other components in the system. This session delegation creates a chain of trust that reduces the overhead of establishing secure connections and enables centralized enforcement of system-wide security policies. Additionally, secure channels based upon UDP datagrams are often overlooked by existing libraries; we show how CEDAR's structure accommodates this as well. As an example of the utility of this work, we show how the use of delegated security sessions and other techniques inherent in CEDAR's architecture enables US CMS to meet their scalability requirements in deploying Condor over large-scale, wide-area grid systems

arXiv.org e-Print Archive

CiteSeerX

Crossref

UNT Digital Library

Recommended from our members

Pseudo-interactive monitoring in distributed computing

Author: Bradley D.
Livny M.
Sfiligoi I.
Publication venue: Fermi National Accelerator Laboratory
Publication date: 01/05/2009
Field of study

Distributed computing, and in particular Grid computing, enables physicists to use thousands of CPU days worth of computing every day, by submitting thousands of compute jobs. Unfortunately, a small fraction of such jobs regularly fail; the reasons vary from disk and network problems to bugs in the user code. A subset of these failures result in jobs being stuck for long periods of time. In order to debug such failures, interactive monitoring is highly desirable; users need to browse through the job log files and check the status of the running processes. Batch systems typically don't provide such services; at best, users get job logs at job termination, and even this may not be possible if the job is stuck in an infinite loop. In this paper we present a novel approach of using regular batch system capabilities of Condor to enable users to access the logs and processes of any running job. This does not provide true interactive access, so commands like vi are not viable, but it does allow operations like ls, cat, top, ps, lsof, netstat and dumping the stack of any process owned by the user; we call this pseudo-interactive monitoring. It is worth noting that the same method can be used to monitor Grid jobs in a glidein-based environment. We further believe that the same mechanism could be applied to many other batch systems

UNT Digital Library

glideinWMS—a generic pilot-based workload management system

Author: Belforte S
Foster I
I Sfiligoi
Publication venue: 'IOP Publishing'
Publication date
Field of study

Crossref

Use of glide-ins in CMS for production and analysis

Author: Bradley D
Gutsche O
Hahn K
Holzman B
Padhi S
Pi H
Sfiligoi I
Spiga D
Vaandering E
WÃ¼rthwein F
Publication venue: 'IOP Publishing'
Publication date: 14/05/2009
Field of study

With the evolution of various grid federations, the Condor glide-ins represent a key feature in providing a homogeneous pool of resources using late-binding technology. The CMS collaboration uses the glide-in based Workload Management System, glideinWMS, for production (ProdAgent) and distributed analysis (CRAB) of the data. The Condor glide-in daemons traverse to the worker nodes, submitted via Condor-G. Once activated, they preserve the Master-Worker relationships, with the worker first validating the execution environment on the worker node before pulling the jobs sequentially until the expiry of their lifetimes. The combination of late-binding and validation significantly reduces the overall failure rate visible to CMS physicists. We discuss the extensive use of the glideinWMS since the computing challenge, CCRC-08, in order to prepare for the forthcoming LHC data-taking period. The key features essential to the success of large-scale production and analysis on CMS resources across major grid federations, including EGEE, OSG and NorduGrid are outlined. Use of glide-ins via the CRAB server mechanism and ProdAgent, as well as first hand experience of using the next generation CREAM computing element within the CMS framework is discussed

CERN Document Server

CDF experience with monte carlo production using LCG grid

Author: Daniele Cesini
Donatella Lucchesi
G. Compostella
I Sfiligoi
S P Griso
Publication venue
Publication date: 01/07/2008
Field of study

The upgrades of the Tevatron collider and CDF detector have considerably increased the demand on computing resources, in particular for Monte Carlo production. This has forced the collaboration to move beyond the usage of dedicated resources and start exploiting the Grid. The CDF Analysis Farm (CAF) model has been reimplemented into LcgCAF in order to access Grid resources by using the LCG/EGEE middleware. Many sites in Italy and in Europe are accessed through this portal by CDF users mainly to produce Monte Carlo data but also for other analysis jobs. We review here the setup used to submit jobs to Grid sites and retrieve the output, including CDF-specific configuration of some Grid components. We also describe the batch and interactive monitor tools developed to allow users to verify the jobs status during their lifetime in the Grid environment. Finally we analyze the efficiency and typical failure modes of the current Grid infrastructure reporting the performances of different parts of the system used

Open Access Repository

CMS@home: Integrating the Volunteer Cloud and High‑Throughput Computing

Author: A McNab
E Korpela
I Sfiligoi
Julia Andreeva
L Field
M Cinquilli
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Volunteer computing has the potential to provide significant additional computing capacity for the LHC experiments. Initiatives such as the CMS@home project are aiming to integrate volunteer computing resources into the experiment’s computational frameworks to support their scientific workloads. This is especially important, as over the next few years the demands on computing capacity will increase beyond what can be supported by general technology trends. This paper describes how a volunteer computing project that uses virtualization to run high energy physics simulations can integrate those resources into their computing infrastructure. The concept of the volunteer cloud is introduced and how this model can simplify the integration is described. An architecture for implementing the volunteer cloud model is presented along with an implementation for the CMS@home project. Finally, the submission of real CMS workloads to this volunteer cloud are compared to identical workloads submitted to the grid

Crossref

CERN Document Server

Brunel University Research Archive

Recommended from our members

FermiGrid - experience and future plans

Author: /Fermilab
Berman E.
Canal P.
Chadwick K.
Garzoglio G.
Hesselroth T.
Levshina T.
Sergeev V.
Sfiligoi I.
Timm S.
Yocum D.
Publication venue: Fermi National Accelerator Laboratory
Publication date: 01/09/2007
Field of study

Fermilab supports a scientific program that includes experiments and scientists located across the globe. In order to better serve this community, Fermilab has placed its production computer resources in a Campus Grid infrastructure called 'FermiGrid'. The FermiGrid infrastructure allows the large experiments at Fermilab to have priority access to their own resources, enables sharing of these resources in an opportunistic fashion, and movement of work (jobs, data) between the Campus Grid and National Grids such as Open Science Grid and the WLCG. FermiGrid resources support multiple Virtual Organizations (VOs), including VOs from the Open Science Grid (OSG), EGEE and the Worldwide LHC Computing Grid Collaboration (WLCG). Fermilab also makes leading contributions to the Open Science Grid in the areas of accounting, batch computing, grid security, job management, resource selection, site infrastructure, storage management, and VO services. Through the FermiGrid interfaces, authenticated and authorized VOs and individuals may access our core grid services, the 10,000+ Fermilab resident CPUs, near-petabyte (including CMS) online disk pools and the multi-petabyte Fermilab Mass Storage System. These core grid services include a site wide Globus gatekeeper, VO management services for several VOs, Fermilab site authorization services, grid user mapping services, as well as job accounting and monitoring, resource selection and data movement services. Access to these services is via standard and well-supported grid interfaces. We will report on the user experience of using the FermiGrid campus infrastructure interfaced to a national cyberinfrastructure--the successes and the problems

UNT Digital Library