Search CORE

12 research outputs found

A visual Analytics System for Optimizing Communications in Massively Parallel Applications

Author: Fujiwara Takanori
Ma Kwan-Liu
Malakar Preeti
Papka Michael E.
Reda Khairi
Vishwanath Venkatram
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

Current and future supercomputers have tens of thousands of compute nodes interconnected with high-dimensional networks and complex network topologies for improved performance. Application developers are required to write scalable parallel programs in order to achieve high throughput on these machines. Application performance is largely determined by efficient inter-process communication. A common way to analyze and optimize performance is through profiling parallel codes to identify communication bottlenecks. However, understanding gigabytes of profile data is not a trivial task. In this paper, we present a visual analytics system for identifying the scalability bottlenecks and improving the communication efficiency of massively parallel applications. Visualization methods used in this system are designed to comprehend large-scale and varied communication patterns on thousands of nodes in complex networks such as the 5D torus and the dragonfly. We also present efficient rerouting and remapping algorithms that can be coupled with our interactive visual analytics design for performance optimization. We demonstrate the utility of our system with several case studies using three benchmark applications on two leading supercomputers. The mapping suggestion from our system led to 38% improvement in hop-bytes for MiniAMR application on 4,096 MPI processes.This research has been sponsored in part by the U.S. National Science Foundation through grant IIS-1320229, and the U.S. Department of Energy through grants DE-SC0012610 and DE-SC0014917. This research has been funded in part and used resources of the Argonne Leadership Computing Facility at Argonne National Lab- oratory, which is supported by the Office of Science of the U.S. Department of Energy under contract no. DE-AC02-06CH11357. This work was supported in part by the DOE Office of Science, ASCR, under award numbers 57L38, 57L32, 57L11, 57K50, and 508050

Crossref

IUPUIScholarWorks

Topology-Aware Data Aggregation for Intensive I/O on Large-Scale Supercomputers

Author: Isaila Florin
Jeannot Emmanuel
Malakar Preeti
Tessier François
Vishwanath Venkatram
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/11/2016
Field of study

International audienceReading and writing data efficiently from storage systems is critical for high performance data-centric applications. These I/O systems are being increasingly characterized by complex topologies and deeper memory hierarchies. Effective parallel I/O solutions are needed to scale applications on current and future supercomputers. Data aggregation is an efficient approach consisting of electing some processes in charge of aggregating data from a set of neighbors and writing the aggregated data into storage. Thus, the bandwidth use can be optimized while the contention is reduced. In this work, we take into account the network topology for mapping aggregators and we propose an optimized buffering system in order to reduce the aggregation cost. We validate our approach using micro-benchmarks and the I/O kernel of a large-scale cosmology simulation. We show improvements up to 15× faster for I/O operations compared to a standard implementation of MPI I/O

INRIA a CCSD electronic archive server

Integrated parallelization of computations and visualization for large-scale applications

Author: Malakar Preeti
Natarajan Vijay
Vadhiyar Sathish S
Publication venue: IEEE
Publication date
Field of study

Critical applications like cyclone tracking and earthquake modeling require simultaneous high-performance simulations and online visualization for timely analysis. Faster simulations and simultaneous visualization enable scientists provide real-time guidance to decision makers. In this work, we have developed an integrated user-driven and automated steering framework that simultaneously performs numerical simulations and efficient online remote visualization of critical weather applications in resource-constrained environments. It considers application dynamics like the criticality of the application and resource dynamics like the storage space, network bandwidth and available number of processors to adapt various application and resource parameters like simulation resolution, simulation rate and the frequency of visualization. We formulate the problem of finding an optimal set of simulation parameters as a linear programming problem. This leads to 30% higher simulation rate and 25-50% lesser storage consumption than a naive greedy approach. The framework also provides the user control over various application parameters like region of interest and simulation resolution. We have also devised an adaptive algorithm to reduce the lag between the simulation and visualization times. Using experiments with different network bandwidths, we find that our adaptive algorithm is able to reduce lag as well as visualize the most representative frames

Open Access Repository of IISc Research Publications

An Integrated Simulation and Visualization Framework for Tracking Cyclone Aila

Author: Malakar Preeti
Nanjundiah Ravi
Natarajan Vijay
Vadhiyar Sathish
Publication venue
Publication date
Field of study

Open Access Repository of IISc Research Publications

InSt: An Integrated Steering Framework for Critical Weather Applications

Author: Preeti Malakar A
Sathish S. Vadhiyar B
Vijay Natarajan A
Publication venue
Publication date: 01/01/2011
Field of study

Online remote visualization and steering of critical weather applications like cyclone tracking are essential for effective and timely analysis by geographically distributed climate science community. A steering framework for controlling the high-performance simulations of critical weather events needs to take into account both the steering inputs of the scientists and the criticality needs of the application including minimum progress rate of simulations and continuous visualization of significant events. In this work, we have developed an integrated user-driven and automated steering framework InSt for simulations, online remote visualization, and analysis for critical weather applications. InSt provides the user control over various application parameters including region of interest, resolution of simulation, and frequency of data for visualization. Unlike existing efforts, our framework considers both the steering inputs and the criticality of the application, namely, the minimum progress rate needed for the application, and various resource constraints including storage space and network bandwidth to decide the best possible parameter values for simulations and visualization

CiteSeerX

Elsevier - Publisher Connector

Open Access Repository of IISc Research Publications

Hierarchical Read–Write Optimizations for Scientific Applications with Multi-variable Structured Datasets

Author: JW Hurrell
Preeti Malakar
R Haring
R Thakur
Venkatram Vishwanath
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A Diffusion-Based Processor Reallocation Strategy for Tracking Multiple Dynamically Varying Weather Phenomena

Author: Preeti Malakar
Ravi S. Nanjundiah
Sathish S. Vadhiyar
Vijay Natarajan
Publication venue
Publication date
Field of study

Abstract—Many meteorological phenomena occur at different locations simultaneously. These phenomena vary temporally and spatially. It is essential to track these multiple phenomena for accurate weather prediction. Efficient analysis require highresolution simulations which can be conducted by introducing finer resolution nested simulations, nests at the locations of these phenomena. Simultaneous tracking of these multiple weather phenomena requires simultaneous execution of the nests on different subsets of the maximum number of processors for the main weather simulation. Dynamic variation in the number of these nests require efficient processor reallocation strategies. In this paper, we have developed strategies for efficient partitioning and repartitioning of the nests among the processors. As a case study, we consider an application of tracking multiple organized cloud clusters in tropical weather systems. We first present a parallel data analysis algorithm to detect such clouds. We have developed a tree-based hierarchical diffusion method which reallocates processors for the nests such that the redistribution cost is less. We achieve this by a novel tree reorganization approach. We show that our approach exhibits up to 25 % lower redistribution cost and 53 % lesser hop-bytes than the processor reallocation strategy that does not consider the existing processor allocation. Index Terms—redistribution; processor reallocation; data analysis; cloud tracking I

CiteSeerX

Crossref

Open Access Repository of IISc Research Publications

A Divide and Conquer Strategy for Scaling Weather Simulations with Multiple Regions of Interest

Author: Preeti Malakar
Rashmi Mittal
Sameer Kumar
Sathish S. Vadhiyar
Thomas George
Vaibhav Saxena
Vijay Natarajan
Yogish Sabharwal
Publication venue
Publication date: 01/01/2013
Field of study

Abstract—Accurate and timely prediction of weather phenomena, such as hurricanes and flash floods, require highfidelity compute intensive simulations of multiple finer regions of interest within a coarse simulation domain. Current weather applications execute these nested simulations sequentially using all the available processors, which is sub-optimal due to their sublinear scalability. In this work, we present a strategy for parallel execution of multiple nested domain simulations based on partitioning the 2-D processor grid into disjoint rectangular regions associated with each domain. We propose a novel combination of performance prediction, processor allocation methods and topology-aware mapping of the regions on torus interconnects. Experiments on IBM Blue Gene systems using WRF show that the proposed strategies result in performance improvement of up to 33 % with topology-oblivious mapping and up to additional 7 % with topology-aware mapping over the default sequential strategy. Index Terms—weather simulation; performance modeling; processor allocation; topology-aware mapping; I

CiteSeerX

Crossref

Directory of Open Access Journals

Open Access Repository of IISc Research Publications

Recommended from our members

A terminology for in situ visualization and analysis systems

Author: Ahern Sean D.
Ahrens James
Bauer Andrew C.
Bennett Janine
Bethel E. Wes
Bremer Peer-Timo
Brugger Eric
Childs Hank
Cottam Joseph
Dorier Matthieu
Dutta Soumya
Favre Jean M.
Fogal Thomas
Frey Steffen
Garth Christoph
Geveci Berk
Godoy William F.
Hansen Charles D.
Harrison Cyrus
Hentschel Bernd
Insley Joseph
Johnson Chris R.
Klasky Scott
Knoll Aaron
Kress James
Larsen Matthew
Lofstead Jay
Ma Kwan-Liu
Malakar Preeti
Meredith Jeremy
Moreland Kenneth
Navrátil Paul
O’Leary Patrick
Parashar Manish
Pascucci Valerio
Patchett John
Peterka Tom
Petruzza Steve
Podhorszki Norbert
Pugmire David
Rasquin Michel
Rizzi Silvio
Rogers David H.
Sane Sudhanshu
Sauer Franz
Shen Han-Wei
Sisneros Robert
Usher Will
Vickery Rhonda
Vishwanath Venkatram
Wald Ingo
Wang Ruonan
Weber Gunther H.
Whitlock Brad
Wolf Matthew
Yu Hongfeng
Ziegeler Sean B.
Publication venue: 'SAGE Publications'
Publication date: 01/01/2020
Field of study

The term “in situ processing” has evolved over the last decade to mean both a specific strategy for visualizing and analyzing data and an umbrella term for a processing paradigm. The resulting confusion makes it difficult for visualization and analysis scientists to communicate with each other and with their stakeholders. To address this problem, a group of over 50 experts convened with the goal of standardizing terminology. This paper summarizes their findings and proposes a new terminology for describing in situ systems. An important finding from this group was that in situ systems are best described via multiple, distinct axes: integration type, proximity, access, division of execution, operation controls, and output type. This paper discusses these axes, evaluates existing systems within the axes, and explores how currently used terms relate to the axes

eScholarship - University of California

Publikationsserver der RWTH Aachen University