Search CORE

18 research outputs found

Performance Evaluation of Adaptive Scientific Applications using TAU

Author: Alan Morris A
Allen D. Malony A
Germain B
J. Davison De St
Sameer Shende A
Steven Parker B
Publication venue
Publication date: 02/04/2008
Field of study

Fueled by increasing processor speeds and high speed interconnection networks, advances in high performance computer architectures have allowed the development of increasingly complex large scale parallel systems. For computational scientists, programming these systems efficiently is a challenging task. Understanding the performance of their parallel applications i

CiteSeerX

Recommended from our members

Performance measurement and modeling of component applications in a high performance computing environment : a case study.

Author: Armstrong Robert C.
Malony A. (University of Oregon, Eugene, OR)
Ray Jaideep
Shende Sameer (University of Oregon, Eugene, OR)
Trebon Nicholas D.
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date: 01/11/2003
Field of study

We present a case study of performance measurement and modeling of a CCA (Common Component Architecture) component-based application in a high performance computing environment. We explore issues peculiar to component-based HPC applications and propose a performance measurement infrastructure for HPC based loosely on recent work done for Grid environments. A prototypical implementation of the infrastructure is used to collect data for a three components in a scientific application and construct performance models for two of them. Both computational and message-passing performance are addressed

UNT Digital Library

Performance Anomaly Detection and Bottleneck Identification

Author: Alpaydin E.
Barham Paul
Berkhin Pavel
Bodík Peter
Brey Jack
Burke Shaun
Chung Hsin
Cohen Ira
Dean Daniel J.
Fodor Imola K.
Frank
Fu Song
Fu Song
Gregg Brendan
Guan Qiang
Gunther Neil J.
Huang Su-Yun
Igor
Jeffrey
John
Kang Hui
Kelly Terence
Kotsiantis S. B.
Lee Han Bok
Lee Wenke
Lilja David J.
Malkowski Simon
McHugh Andrew
Oakland John S.
Panourgias Iakovos
Reiss Charles
Reynolds Douglas
Sambasivan Raja R.
Shallahamer Craig A.
Shende Sameer
Tan Yongmin
Tarby Jean-Claude
Trubin Igor
Wang Chengwei
Wang Haichuan
Wang Tao
Wilder John
Yu Minlan
Zhang Qi
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Scalable, Automated Performance Analysis with TAU and PerfExplorer

Author: Huck Kevin A.
Malony Allen D.
Shende Sameer
Publication venue: John von Neumann Institute for Computing
Publication date: 01/01/2007
Field of study

Juelich Shared Electronic Resources

GPAW - massively parallel electronic structure calculations with Python-based software

Author: Enkovaara Jussi
Mortensen Jens Jørgen
Romero Nichols A.
Shende Sameer
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

AbstractElectronic structure calculations are a widely used tool in materials science and large consumer of supercomputing resources. Traditionally, the software packages for these kind of simulations have been implemented in compiled languages, where Fortran in its different versions has been the most popular choice. While dynamic, interpreted languages, such as Python, can increase the effciency of programmer, they cannot compete directly with the raw performance of compiled languages. However, by using an interpreted language together with a compiled language, it is possible to have most of the productivity enhancing features together with a good numerical performance. We have used this approach in implementing an electronic structure simulation software GPAW using the combination of Python and C programming languages. While the chosen approach works well in standard workstations and Unix environments, massively parallel supercomputing systems can present some challenges in porting, debugging and profiling the software. In this paper we describe some details of the implementation and discuss the advantages and challenges of the combined Python/C approach. We show that despite the challenges it is possible to obtain good numerical performance and good parallel scalability with Python based software

Elsevier - Publisher Connector

Online Research Database In Technology

Scalable, Automated Performance Analysis with TAU and PerfExplorer

Author: Alan Morris
Allen D. Malony
Kevin A. Huck
Sameer Shende
Publication venue
Publication date: 01/01/2007
Field of study

Scalable performance analysis is a challenge for parallel development tools. The potential size of data sets and the need to compare results from multiple experiments presents a challenge to manage and process the information, and to characterize the performance of parallel applications running on potentially hundreds of thousands of processor cores. In addition, many exploratory analysis processes represent potentially repeatable processes which can and should be automated. In this paper, we will discuss the current version of PerfExplorer, a performance analysis framework which provides dimension reduction, clustering and correlation analysis of individual trails of large dimensions, and can perform relative performance analysis between multiple application executions. PerfExplorer analysis processes can be captured in the form of Python scripts, automating what would otherwise be time-consuming tasks. We will give examples of large-scale analysis results, and discuss the future development of the framework, including the encoding and processing of expert performance rules, and the increasing use of performance metadata.

CiteSeerX

Knowledge Support and Automation for Performance Analysis with PerfExplorer 2.0

Author: Alan Morris
Allen D. Malony
Kevin A. Huck
Sameer Shende
Publication venue: Hindawi Limited
Publication date: 01/01/2008
Field of study

The integration of scalable performance analysis in parallel development tools is difficult. The potential size of data sets and the need to compare results from multiple experiments presents a challenge to manage and process the information. Simply to characterize the performance of parallel applications running on potentially hundreds of thousands of processor cores requires new scalable analysis techniques. Furthermore, many exploratory analysis processes are repeatable and could be automated, but are now implemented as manual procedures. In this paper, we will discuss the current version of PerfExplorer, a performance analysis framework which provides dimension reduction, clustering and correlation analysis of individual trails of large dimensions, and can perform relative performance analysis between multiple application executions. PerfExplorer analysis processes can be captured in the form of Python scripts, automating what would otherwise be time-consuming tasks. We will give examples of large-scale analysis results, and discuss the future development of the framework, including the encoding and processing of expert performance rules, and the increasing use of performance metadata

Directory of Open Access Journals