Search CORE

2,743 research outputs found

High-Throughput Computing on High-Performance Platforms: A Case Study

Author: Angius Alessio
De Kaushik
Jha Shantenu
Klimentov Alexei
Oleynik Danila
Oral Sarp H.
Panitkin Sergey
Turilli Matteo
Wells Jack C.
Publication venue
Publication date: 27/10/2017
Field of study

The computing systems used by LHC experiments has historically consisted of the federation of hundreds to thousands of distributed resources, ranging from small to mid-size resource. In spite of the impressive scale of the existing distributed computing solutions, the federation of small to mid-size resources will be insufficient to meet projected future demands. This paper is a case study of how the ATLAS experiment has embraced Titan---a DOE leadership facility in conjunction with traditional distributed high- throughput computing to reach sustained production scales of approximately 52M core-hours a years. The three main contributions of this paper are: (i) a critical evaluation of design and operational considerations to support the sustained, scalable and production usage of Titan; (ii) a preliminary characterization of a next generation executor for PanDA to support new workloads and advanced execution modes; and (iii) early lessons for how current and future experimental and observational systems can be integrated with production supercomputers and other platforms in a general and extensible manner

arXiv.org e-Print Archive

Crossref

Technology assessment of advanced automation for space missions

Author
Publication venue
Publication date: 01/11/1982
Field of study

Six general classes of technology requirements derived during the mission definition phase of the study were identified as having maximum importance and urgency, including autonomous world model based information systems, learning and hypothesis formation, natural language and other man-machine communication, space manufacturing, teleoperators and robot systems, and computer science and technology

NASA Technical Reports Server

Reusable Centaur study. Volume 1: Executive summary

Author: Heald D. A.
Publication venue
Publication date
Field of study

A study of the Reusable Centaur for use as an initial upper stage with the space shuttle was conducted. The currently operative Centaur stage, with modifications for space shuttle orbiter compatibility and for improved performance, represents a cost effective development solution. The performance needs and available development funds are discussed. The main features of three Reusable Centaur configurations with increasing capability at increasing development costs are summarized

NASA Technical Reports Server

Analyzing Resource Utilization in an HPC System: A Case Study of NERSC Perlmutter

Author: Chen Yong
Cook Brandon
Cooray Dulanya
Li Jie
Michelogiannakis George
Publication venue
Publication date: 12/03/2023
Field of study

Resource demands of HPC applications vary significantly. However, it is common for HPC systems to primarily assign resources on a per-node basis to prevent interference from co-located workloads. This gap between the coarse-grained resource allocation and the varying resource demands can lead to HPC resources being not fully utilized. In this study, we analyze the resource usage and application behavior of NERSC's Perlmutter, a state-of-the-art open-science HPC system with both CPU-only and GPU-accelerated nodes. Our one-month usage analysis reveals that CPUs are commonly not fully utilized, especially for GPU-enabled jobs. Also, around 64% of both CPU and GPU-enabled jobs used 50% or less of the available host memory capacity. Additionally, about 50% of GPU-enabled jobs used up to 25% of the GPU memory, and the memory capacity was not fully utilized in some ways for all jobs. While our study comes early in Perlmutter's lifetime thus policies and application workload may change, it provides valuable insights on performance characterization, application behavior, and motivates systems with more fine-grain resource allocation

arXiv.org e-Print Archive

Power Bounded Computing on Current & Emerging HPC Systems

Author: Zou Pengfei
Publication venue: Clemson University Libraries
Publication date: 01/05/2020
Field of study

Power has become a critical constraint for the evolution of large scale High Performance Computing (HPC) systems and commercial data centers. This constraint spans almost every level of computing technologies, from IC chips all the way up to data centers due to physical, technical, and economic reasons. To cope with this reality, it is necessary to understand how available or permissible power impacts the design and performance of emergent computer systems. For this reason, we propose power bounded computing and corresponding technologies to optimize performance on HPC systems with limited power budgets. We have multiple research objectives in this dissertation. They center on the understanding of the interaction between performance, power bounds, and a hierarchical power management strategy. First, we develop heuristics and application aware power allocation methods to improve application performance on a single node. Second, we develop algorithms to coordinate power across nodes and components based on application characteristic and power budget on a cluster. Third, we investigate performance interference induced by hardware and power contentions, and propose a contention aware job scheduling to maximize system throughput under given power budgets for node sharing system. Fourth, we extend to GPU-accelerated systems and workloads and develop an online dynamic performance & power approach to meet both performance requirement and power efficiency. Power bounded computing improves performance scalability and power efficiency and decreases operation costs of HPC systems and data centers. This dissertation opens up several new ways for research in power bounded computing to address the power challenges in HPC systems. The proposed power and resource management techniques provide new directions and guidelines to green exscale computing and other computing systems

Clemson University: TigerPrints

Biospheres and solar system exploration

Author: Paine Thomas O.
Publication venue
Publication date
Field of study

The implications of biosphere technology is briefly examined. The exploration status and prospects of each world in the solar system is briefly reviewed, including the asteroid belt, the moon, and comets. Five program elements are listed as particularly critical for future interplanetary operations during the coming extraterrestrial century. They include the following: (1) a highway to Space (earth orbits); (2) Orbital Spaceports to support spacecraft assembly, storage, repair, maintenance, refueling, launch, and recovery; (3) a Bridge Between Worlds to transport cargo and crews to the moon and beyond to Mars; (4) Prospecting and Resource Utilization Systems to map and characterize the resources of planets, moons, and asteroids; and (5) Closed Ecology Biospheres. The progress in these five field is reviewed

NASA Technical Reports Server

Aerospace Medicine and Biology: A continuing bibliography with indexes (supplement 290)

Author
Publication venue
Publication date
Field of study

This bibliography lists 125 reports, articles and other documents introduced into the NASA scientific and technical information system in October 1986

NASA Technical Reports Server

The 1990 progress report and future plans

Author: Compton Michael
Friedland Peter
Zweben Monte
Publication venue
Publication date
Field of study

This document describes the progress and plans of the Artificial Intelligence Research Branch (RIA) at ARC in 1990. Activities span a range from basic scientific research to engineering development and to fielded NASA applications, particularly those applications that are enabled by basic research carried out at RIA. Work is conducted in-house and through collaborative partners in academia and industry. Our major focus is on a limited number of research themes with a dual commitment to technical excellence and proven applicability to NASA short, medium, and long-term problems. RIA acts as the Agency's lead organization for research aspects of artificial intelligence, working closely with a second research laboratory at JPL and AI applications groups at all NASA centers

NASA Technical Reports Server