Search CORE

10 research outputs found

Construction and Evaluation of Coordinated Performance Skeletons

Author: J. Ziv
L.P. Cordella
M. Crochemore
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Performance prediction is particularly challenging for dynamic foreign environments that cannot be modeled well, such as those involving resource sharing or foreign system components. Our approach is based on the concept of a performance skeleton which is a short running program whose execution time in any scenario reflects the estimated execution time of the application it represents. The fundamental technical challenge is automatic construction of performance skeletons for parallel MPI programs. The steps are 1) generation of process execution traces and conversion to a single coordinated logical program trace, 2) compression of the logical program trace, and 3) conversion to an executable parallel skeleton program. Results are presented to validate the construction methodology and prediction power of performance skeletons. The execution scenarios analyzed involve network sharing, different architectures and different MPI libraries. The emphasis is on identifying the strength and limitations of this approach to performanc

CiteSeerX

Crossref

Loop-based Modeling of Parallel Communication Traces

Author: Clauss Philippe
Genaud Stéphane
Ketterlin Alain
Kuhn Matthieu
Publication venue: HAL CCSD
Publication date: 23/07/2014
Field of study

This paper describes an algorithm that takes a trace of a distributed program and builds a model of all communications of the program. The model is a set of nested loops representing repeated patterns. Loop bodies collect events representing communication actions performed by the various processes, like sending or receiving messages, and participating in collective operations. The model can be used for compact visualization of full executions, for program understanding and debugging, and also for building statistical analyzes of various quantitative aspects of the program's behavior. The construction of the communication model is performed in two phases. First, a local model is built for each process, capturing local regularities; this phase is incremental and fast, and can be done on-line, during the execution. The second phase is a reduction process that collects, aligns, and finally merges all local models into a global, system-wide model. This global model is a compact representation of all communications of the original program, capturing patterns across groups of processes. It can be visualized directly and, because it takes the form of a sequence of loop nests, can be used to replay the original program's communication actions. Because the model is based on communication events only, it completely ignores other quantitative aspects like timestamps or messages sizes. Including such data would in most case break regularities, reducing the usefulness of trace-based modeling. Instead, the paper shows how one can efficiently access quantitative data kept in the original trace(s), by annotating the model and compiling data scanners automatically.Ce rapport de recherche décrit un algorithme qui prend en entrée la trace d'un programme distribué, et construit un modèle de l'ensemble des communications du programme. Le modèle prend la forme d'un ensemble de boucles imbriquées représentant la répétition de motifs de communication. Le corps des boucles regroupe des événements représentant les actions de communication réalisées par les différents processus impliqués, tels que l'envoi et la réception de messages, ou encore la participation à des opérations collectives. Le modèle peut servir à la visualisation compact d'exécutions complètes, à la compréhension de programme et au debugging, mais également à la construction d'analyses statistiques de divers aspects quantitatifs du comportement du programme. La construction du modèle de communication s'effectue en deux phases. Premièrement, un modèle local est construit au sein de chaque processus, capturant les régularités locales~; cette phase est incrémentale et rapide, et peut être réalisée au cours de l'exécution. La seconde phase est un processus de réduction qui rassemble, aligne, et finalement fusionne tous les modèles locaux en un modèle global décrivant la totalité du système. Ce modèle global est une représentation compacte de toutes les communications du programme original, représentant des motifs de communication entre groupes de processus. Il peut être visualisé directement et, puisqu'il prend la forme d'un ensemble de nids de boucles, peut servir à rejouer les opérations de communication du programme initial. Puisque le modèle construit se base uniquement sur les opérations de communication, il ignore complètement d'autres données quantitatives, telles que les informations chronologiques, ou les tailles de messages. L'inclusion de telles données briserait dans la plupart des cas les régularités topologiques, réduisant l'efficacité de la modélisation par boucles. Nous préférons, dans ce rapport, montrer comment, grâce au modèle construit, il est possible d'accéder efficacement aux données quantitatives si celles-ci sont conservées dans les traces individuelles, en annotant le modèle et en l'utilisant pour compiler automatiquement des programmes d'accès aux données

INRIA a CCSD electronic archive server

Techniques To Facilitate the Understanding of Inter-process Communication Traces

Author: Alawneh Lu'ay
Publication venue
Publication date: 12/04/2012
Field of study

High Performance Computing (HPC) systems play an important role in today’s heavily digitized world, which is in a constant demand for higher speed of calculation and performance. HPC applications are used in multiple domains such as telecommunication, health, scientific research, and more. With the emergence of multi-core and cloud computing platforms, the HPC paradigm is quickly becoming the design of choice of many service providers. HPC systems are also known to be complex to debug and analyze due to the large number of processes they involve and the way these processes communicate with each other to perform specific tasks. As a result, software engineers must spend extensive amount of time understanding the complex interactions among a system’s processes. This is usually done through the analysis of execution traces generated from running the system at hand. Traces, however, are very difficult to work with due to the overwhelming size of typical traces. The objective of this research is to present a set of techniques that facilitates the understanding of the behaviour of HPC applications through the analysis of system traces. The first technique consists of building an exchange format called MTF (MPI Trace Format) for representing and exchanging traces generated from HPC applications based on the MPI (Message Passing Interface) standard, which is a de facto standard for inter-process communication for high performance computing systems. The design of MTF is validated against well-known requirements for a standard exchange format. The second technique aims to facilitate the understanding of large traces of inter-process communication by automatically extracting communication patterns that characterize their main behaviour. Two algorithms are presented. The first one permits the recognition of repeating patterns in traces of MPI (Message Passing Interaction) applications whereas the second algorithm searches if a given communication pattern occurs in a trace. Both algorithms are based on the n-gram extraction technique used in natural language processing. Finally, we developed a technique to abstract MPI traces by detecting the different execution phases in a program based on concepts from information theory. Using this approach, software engineers can examine the trace as a sequence of high-level computational phases instead of a mere flow of low-level events. The techniques presented in this thesis have been tested on traces generated from real HPC programs. The results from several case studies demonstrate the usefulness and effectiveness of our techniques

Concordia University Research Repository

Automatic Energy Saving Schemes for Parallel Applications

Author: Sundriyal Vaibhav
Publication venue: Iowa State University Digital Repository
Publication date: 01/01/2013
Field of study

Although high-performance computing traditionally focuses on the efficient execution of large-scale applications, both energy and power have become critical concerns when approaching exascale. Drastic increases in the power consumption of supercomputers affect significantly their operating costs and failure rates. In modern microprocessor architectures, equipped with dynamic voltage and frequency scaling (DVFS) and CPU clock modulation (throttling), the power consumption may be controlled in software. Additionally, network interconnect, such as Infiniband, may be exploited to maximize energy savings while the application performance loss and frequency switching overheads must be carefully balanced. This work first studies two important collective communication operations, all-to-all and allgather and proposes energy saving strategies on the per-call basis. Next, it targets point-to-point communications to group them into phases and apply frequency scaling to them to save energy by exploiting the architectural and communication stalls. Finally, it proposes an automatic runtime system which combines both collective and point-to-point communications into phases, and applies throttling to them apart from DVFS to maximize energy savings. The experimental results are presented for NAS parallel benchmark problems as well as for the realistic parallel electronic structure calculations performed by the widely used quantum chemistry package GAMESS. Close to the maximum energy savings were obtained with a substantially low performance loss on the given platform

Digital Repository @ Iowa State University (ISU)

British Musical Modernism Defended against its Devotees

Author: Forkert Annika
Publication venue
Publication date: 01/01/2014
Field of study

Royal Holloway - Pure

Cartographies of Copyright: Crisis & Propertization

Author: Flanders Paul
Publication venue: fi=Turun yliopisto|en=University of Turku|
Publication date: 15/08/2017
Field of study

Cartographies of Copyright is a cultural history of copyright that maps out various contradictions and tensions that give shape to the crisis of copyright and its relations to US music industries. More specifically, this work charts the radical dissension of copyright in recent history and argues for an understanding of the crisis as an internal transformative process. This formulation shifts the analytic approach from an abstract conceptual-legal perspective to a series of discrete points within a lived history, culture and materiality of copyright thought, audio technologies and neoliberal capitalism. Seen in terms of territorialization, the expansion of copyright via the music industries gives unique insight into the particular ways music and its media formats effect the evolution of copyright culture and law. When framed this way, an investigation into music and copyright leads to recognizing new forms of control, changing modes of administering access and contemporary relations of power. To chart the effects of these transformations I draw upon the tensions and problematics that constitute the Wu-Tang Clan’s 2015 one-of-a-kind release: Once Upon A Time In Shaolin. After accessing contentious foci within traditional copyright paradigms, I argue that classical models lack explanatory power, pose problems for understanding the transformations of copyright throughout history and fail to provide a comprehensive account of the present crisis. Taking inspiration from the reoccupation thesis presented by Hans Blumenberg and recent research in copyright I propose an alternative model rooted at the dialectic crux of property metaphors, audio technologies and formats, and neocapitalist commodity logic––all of which give shape to an internal transformation within copyright law and culture I term propertization. Cartographies of Copyright is not a legal treatise, per se; rather, the work sees law as text set within a larger social milieu liable to change and evolution. Drawing from legal theory, cultural studies and media studies the work engages copyright from the perspectives of critical history, rhetoric, media-materiality and speculative economics. The multidisciplinary approach also stakes out new formations to the problematics faced by music historically and to draw connections between the music industries and the broader social contexts they are situated in. Cartographies argues for a new lexicon to begin imagining alternative models to account for copyright’s transformations––ones better suited to imagining viable alternatives. It calls attention to the urgent need of public discussion concerning the role of copyright in contemporary music and culture and provides modest suggestions for thinkingSiirretty Doriast

UTUPub

Theology of the Pain of God: An Analysis and Evaluation of Kazoh Kitamori\u27s (1916- ) Work in Japanese Protestantism

Author: Hashimoto Akio
Publication venue: Scholarly Resources from Concordia Seminary
Publication date: 01/05/1992
Field of study

The overall negative appraisal of Kitamori must be considered as a critical reaction to various aspects of his own theology. Perhaps not one or two aspects of his theology alone are responsible for this. The reason for the negative responses is surely of a composite nature. However, it is not the main concern of this study to find answers individually to the questions raised above. The intention of this study is to analyze and assess Kitamori\u27s theology as a whole. But to try to find the answers to the questions is useful for the purpose here: they provide methodological clues

Concordia Seminary, St. Louis: Scholarly Resources

Neurath Reconsidered: New Sources and Perspectives

Author
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2019
Field of study

Repository of the Academy's Library

Logicalization of Communication Traces from Parallel Execution

Author: Jaspal Subhlok
Qiang Xu
Rong Zheng
Sara Voss
Publication venue
Publication date: 24/03/2010
Field of study

Abstract—Communication traces are integral to performance modeling and analysis of parallel programs. However, execution on a large number of nodes results in a large trace volume that is cumbersome and expensive to analyze. This paper presents an automatic framework to convert all process traces corresponding to the parallel execution of an SPMD MPI program into a single logical trace. First, the application communication matrix is generated from process traces. Next, topology identification is performed based on the underlying communication structure and independent of the way ranks (or numbers) are assigned to processes. Finally, message exchanges between physical processes are converted into logical message exchanges that represent similar message exchanges across all processes, resulting in a trace volume reduction approximately equal to the number of processes executing the application. This logicalization framework has been implemented and the results report on its performance and effectiveness. 1 I

CiteSeerX

Crossref