Search CORE

971 research outputs found

An Analysis for Evaluating the Cost/Profit Effectiveness of Parallel Systems

Author: Teran Maria
Publication venue: Scholars Junction
Publication date: 13/12/2002
Field of study

A new domain of commercial applications demands the development of inexpensive parallel computing platforms to lower the cost of operations and increase the business profit. The calculation of returns on an IT investment is now important to justify the decision of upgrading or replacing parallel systems. This thesis presents a framework of the performance and economic factors that are considered when evaluating a parallel system. We introduce a metric called the cost/profit effective metric, which measures the effectiveness of a parallel system in terms of performance, cost and profit. This metric describes the profit obtained from the performance of three different domains for scaling: speed-up, throughput and/or scale-up. Cost is measured by the actual costs of a parallel system. We present two cases of study to demonstrate the application of this metric and analyze the results to support the evaluation of the parallel system on each case

Scholars Junction - Mississippi State University Institutional Repository

Performance measurement and analysis of PC based cluster server using SET of Architecture and modeling a scalable High performance cluster

Author: Mehta Mihir J.
Publication venue
Publication date: 01/04/2006
Field of study

Not availabl

Etheses - A Saurashtra University Library Service

Holistic Hardware Counter Performance Analysis of Parallel Programs

Author: Mohr Bernd
Wolf Felix
Wylie Brian J. N.
Publication venue: Dagstuhl Seminar Proceedings. 05501 - Automatic Performance Analysis
Publication date: 01/01/2006
Field of study

The KOJAK toolkit has been augmented with refined hardware performance counter support, including more convenient measurement specification, additional metric derivations and hierarchical structuring, and an extended algebra for integrating multiple experiments. Comprehensive automated analysis of a hybrid OpenMP/MPI parallel program, the ASC Purple sPPM benchmark, is demonstrated with performance experiments on equisized POWER4-II-based IBM Regatta p690+ cluster, Opteron-based Cray XD1 cluster and UltraSPARC-IV-based Sun Fire E25000 systems. Automatically assessed communication and synchronisation performance properties, combined with a rich set of measured and derived counter metrics, provide a holistic analysis context and facilitate multi-platform comparison

Dagstuhl Research Online Publication Server

Holistic Hardware Counter Performance Analysis of Parallel Programs

Author: Mohr B.
Wolf F.
Wylie B. J. N.
Publication venue: John von Neumann Institute for Computing
Publication date: 01/01/2005
Field of study

Juelich Shared Electronic Resources

Effects of Communication Protocol Stack Offload on Parallel Performance in Clusters

Author: Protopopov Boris Vladimirovich
Publication venue: Scholars Junction
Publication date: 09/07/2003
Field of study

The primary research objective of this dissertation is to demonstrate that the effects of communication protocol stack offload (CPSO) on application execution time can be attributed to the following two complementary sources. First, the application-specific computation may be executed concurrently with the asynchronous communication performed by the communication protocol stack offload engine. Second, the protocol stack processing can be accelerated or decelerated by the offload engine. These two types of performance effects can be quantified with the use of the degree of overlapping Do and degree of acceleration Daccs. The composite communication speedup metrics S_comm(Do, Daccs) can be used in order to quantify the combined effects of the protocol stack offload. This dissertation thesis is validated empirically. The degree of overlapping Do, the degree of acceleration Daccs, and the communication speedup Scomm characteristic of the system configurations under test are derived in the course of experiments performed for the system configurations of interest. It is shown that the proposed metrics adequately describe the effects of the protocol stack offload on the application execution time. Additionally, a set of analytical models of the networking subsystem of a PC-based cluster node is developed. As a result of the modeling, the metrics Do, Daccs, and Scomm are obtained. The models are evaluated as to their complexity and precision by comparing the modeling results with the measured values of Do, Daccs, and Scomm. The primary contributions of this dissertation research are as follows. First, the metric Daccs and Scomm are introduced in order to complement the Do metric in its use for evaluation of the effects of optimizations in the networking subsystem on parallel performance in clusters. The metrics are shown to adequately describe CPSO performance effects. Second, a method for assessing performance effects of CPSO scenarios on application performance is developed and presented. Third, a set of analytical models of cluster node networking subsystems with CPSO capability is developed and characterised as to their complexity and precision of the prediction of the Do and Daccs metrics

Mississippi State University Libraries ETD database

Scholars Junction - Mississippi State University Institutional Repository

Exploring Scheduling for On-demand File Systems and Data Management within HPC Environments

Author: Soysal Mehmet
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 15/03/2021
Field of study

KITopen

Exploring Scheduling for On-demand File Systems and Data Management within HPC Environments

Author: Soysal Mehmet
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 15/03/2021
Field of study

KITopen

Simulation of networks of spiking neurons: A review of tools and strategies

Author: Beeman D.
Boustani S. El
Bower J. M.
Brette R.
Carnevale T.
Davison A. P.
Destexhe A.
Diesmann M.
Djurfeldt M.
Ermentrout B.
Goodman P. H.
Harris Jr. F. C.
Hines M.
Lansner A.
Morrison A.
Muller E.
Natschlager T.
Pecevski D.
Rochel O.
Rudolph M.
Vieville T.
Zirpe M.
Publication venue
Publication date: 01/01/2007
Field of study

We review different aspects of the simulation of spiking neural networks. We start by reviewing the different types of simulation strategies and algorithms that are currently implemented. We next review the precision of those simulation strategies, in particular in cases where plasticity depends on the exact timing of the spikes. We overview different simulators and simulation environments presently available (restricted to those freely available, open source and documented). For each simulation tool, its advantages and pitfalls are reviewed, with an aim to allow the reader to identify which simulator is appropriate for a given task. Finally, we provide a series of benchmark simulations of different types of networks of spiking neurons, including Hodgkin-Huxley type, integrate-and-fire models, interacting with current-based or conductance-based synapses, using clock-driven or event-driven integration strategies. The same set of models are implemented on the different simulators, and the codes are made available. The ultimate goal of this review is to provide a resource to facilitate identifying the appropriate integration strategy and simulation tool to use for a given modeling problem related to spiking neural networks.Comment: 49 pages, 24 figures, 1 table; review article, Journal of Computational Neuroscience, in press (2007

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

PubMed Central

Juelich Shared Electronic Resources

HAL-Ecole des Ponts ParisTech