Search CORE

3 research outputs found

Analyzing high energy physics data using database computing: Preliminary report

Author: Baden Andrew
Day Chris
Grossman Robert
Lifka Dave
Lusk Ewing
May Edward
Price Larry
Publication venue
Publication date
Field of study

A proof of concept system is described for analyzing high energy physics (HEP) data using data base computing. The system is designed to scale up to the size required for HEP experiments at the Superconducting SuperCollider (SSC) lab. These experiments will require collecting and analyzing approximately 10 to 100 million 'events' per year during proton colliding beam collisions. Each 'event' consists of a set of vectors with a total length of approx. one megabyte. This represents an increase of approx. 2 to 3 orders of magnitude in the amount of data accumulated by present HEP experiments. The system is called the HEPDBC System (High Energy Physics Database Computing System). At present, the Mark 0 HEPDBC System is completed, and can produce analysis of HEP experimental data approx. an order of magnitude faster than current production software on data sets of approx. 1 GB. The Mark 1 HEPDBC System is currently undergoing testing and is designed to analyze data sets 10 to 100 times larger

NASA Technical Reports Server

Three Case Studies of Large-Scale Data Flows

Author: Anders Ryd
Dan Riley
Dave Lifka
Gregory J. Sharp
Johannes Gehrke
Manuel Calimlim
Mirek Riedewald
Selcuk Aya
William Y. Arms
Publication venue
Publication date
Field of study

We survey three examples of large-scale scientific workflows that we are working with at Cornell: the Arecibo sky survey, the CLEO high-energy particle physics experiment, and the Web Lab project for enabling social science studies of the Internet. All three projects face the same general challenges: massive amounts of raw data, expensive processing steps, and the requirement to make raw data or data products available to users nation- or world-wide. However, there are several differences that prevent a one-sizefits-all approach to handling their data flows. Instead, current implementations are heavily tuned by domain and data management experts. We describe the three projects, and we outline research issues and opportunities to integrate Grid technology into these workflows.

CiteSeerX

XSEDE: Accelerating Scientific Discovery

Author: Andrew Grimshaw
Dave Lifka
Gregory D. Peterson
Ian Foster
J. Ray Scott
John Towns
Kelly Gaither
Maytal Dahan
Nancy Wilkins-Diehr
Ralph Roskies
Scott Lathrop
Timothy Cockerill
Victor Hazlewood
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref