Search CORE

4,746 research outputs found

Multidisciplinary computational aerosciences

Author: Kutler Paul
Publication venue
Publication date
Field of study

As the challenges of single disciplinary computational physics are met, such as computational fluid dynamics, computational structural mechanics, computational propulsion, computational aeroacoustics, computational electromagnetics, etc., scientists have begun investigating the combination of these single disciplines into what is being called multidisciplinary computational aerosciences (MCAS). The combination of several disciplines not only offers simulation realism but also formidable computational challenges. The solution of such problems will require computers orders of magnitude larger than those currently available. Such computer power can only be supplied by massively parallel machines because of the current speed-of-light limitation of conventional serial systems. Even with such machines, MCAS problems will require hundreds of hours for their solution. To efficiently utilize such a machine, research is required in three areas that include parallel architectures, systems software, and applications software. The main emphasis of this paper is the applications software element. Examples that demonstrate application software for multidisciplinary problems currently being solved at NASA Ames Research Center are presented. Pacing items for MCAS are discussed such as solution methodology, physical modeling, computer power, and multidisciplinary validation experiments

NASA Technical Reports Server

Policy-based techniques for self-managing parallel applications

Author: Anthony Richard
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/09/2006
Field of study

This paper presents an empirical investigation of policy-based self-management techniques for parallel applications executing in loosely-coupled environments. The dynamic and heterogeneous nature of these environments is discussed and the special considerations for parallel applications are identified. An adaptive strategy for the run-time deployment of tasks of parallel applications is presented. The strategy is based on embedding numerous policies which are informed by contextual and environmental inputs. The policies govern various aspects of behaviour, enhancing flexibility so that the goals of efficiency and performance are achieved despite high levels of environmental variability. A prototype self-managing parallel application is used as a vehicle to explore the feasibility and benefits of the strategy. In particular, several aspects of stability are investigated. The implementation and behaviour of three policies are discussed and sample results examined

Crossref

Greenwich Academic Literature Archive

Adjusting process count on demand for petascale global optimization⋆

Author: Haftka Rafael T.
Radcliffe Nicholas R.
Sosonkina Masha
Trosset Michael W.
Watson Layne T.
Publication venue
Publication date: 01/01/2011
Field of study

There are many challenges that need to be met before efficient and reliable computation at the petascale is possible. Many scientific and engineering codes running at the petascale are likely to be memory intensive, which makes thrashing a serious problem for many petascale applications. One way to overcome this challenge is to use a dynamic number of processes, so that the total amount of memory available for the computation can be increased on demand. This paper describes modifications made to the massively parallel global optimization code pVTdirect in order to allow for a dynamic number of processes. In particular, the modified version of the code monitors memory use and spawns new processes if the amount of available memory is determined to be insufficient. The primary design challenges are discussed, and performance results are presented and analyzed

Computer Science Technical Reports @Virginia Tech

Distributed evolutionary algorithms and their models: A survey of the state-of-the-art

Author: Alba
Alba
Alba
Alba
Alba
Anglano
Apolloni
Bai
Bollini
Bouvry
Branke
Burczynski
Burczyński
Cahon
Cahon
Cantu-Paz
Cantu-Paz
Cantú-Paz
Chatzimilioudis
Chen
Creput
Danoy
Davis
de Toro Negro
Dean
Deb
Decraene
Desell
Dorronsoro
Du
Dubreuil
Durillo
Durillo
Durillo
Durillo
Durillo
Epitropakis
Escuela
Ewald
Fok
Folino
Folino
Gagné
Garcia-Arenas
García-Arenas
Giacobini
Giacobini
Giacobini
Giacobini
Giacobini
Giacobini
Goh
Goldberg
Gong
Gonzalez
Herrera
Herrera
Hidalgo
Hidalgo
Hosseini
Iimura
Ishimizu
Ismail
Jin
Jing-Jing Li
Johar
Jun Zhang
Kattan
Kattan
Kirley
Kirley
Kwok
Laredo
Li
Liang
Lienig
Lim
Lim
Liu
Llora
Lorion
Manfrin
McNabb
Melab
Melab
Mendiburu
Merelo
Merelo-Guervos
Merelo-Guervós
Merelo-Guervós
Michel
Mostaghim
Mussi
Nebro
Nebro
Nesmachnow
Nojima
Ordeshook
Pedemonte
Pendharkar
Pierreval
Piriyakumar
Potter
Qingfu Zhang
Ray
Robilliard
Roy
Ruiz-Andino
Said
Said
Sarma
Schutte
Schönfisch
Scriven
Sefrioui
Seredynski
Seredynski
Seredynski
Sherry
Soca
Starzynski
Stützle
Su
Subbu
Subbu
Suganthan
Tagawa
Tan
Tan
Tan
Tan
Tan
Tasoulis
Tomassini
Tomassini
Umbarkar
Van Veldhuizen
Verma
Veronese
Vlachogiannis
Weber
Weber
Wei-Neng Chen
Whitley
Wickramasinghe
Wu
Xiong
Xu
Yang
Yu
Yu
Yue-Jiao Gong
Yun Li
Zhang
Zhang
Zhang
Zhao
Zhao
Zhi-Hui Zhan
Zhou
Zhou
Zhu
Publication venue: 'Elsevier BV'
Publication date: 11/05/2015
Field of study

The increasing complexity of real-world optimization problems raises new challenges to evolutionary computation. Responding to these challenges, distributed evolutionary computation has received considerable attention over the past decade. This article provides a comprehensive survey of the state-of-the-art distributed evolutionary algorithms and models, which have been classified into two groups according to their task division mechanism. Population-distributed models are presented with master-slave, island, cellular, hierarchical, and pool architectures, which parallelize an evolution task at population, individual, or operation levels. Dimension-distributed models include coevolution and multi-agent models, which focus on dimension reduction. Insights into the models, such as synchronization, homogeneity, communication, topology, speedup, advantages and disadvantages are also presented and discussed. The study of these models helps guide future development of different and/or improved algorithms. Also highlighted are recent hotspots in this area, including the cloud and MapReduce-based implementations, GPU and CUDA-based implementations, distributed evolutionary multiobjective optimization, and real-world applications. Further, a number of future research directions have been discussed, with a conclusion that the development of distributed evolutionary computation will continue to flourish

University of Essex Research Repository

Crossref

Enlighten

Preparing HPC Applications for the Exascale Era: A Decoupling Strategy

Author: Gioiosa Roberto
Kestor Gokcen
Laure Erwin
Markidis Stefano
Peng Ivy Bo
Publication venue
Publication date: 03/08/2017
Field of study

Production-quality parallel applications are often a mixture of diverse operations, such as computation- and communication-intensive, regular and irregular, tightly coupled and loosely linked operations. In conventional construction of parallel applications, each process performs all the operations, which might result inefficient and seriously limit scalability, especially at large scale. We propose a decoupling strategy to improve the scalability of applications running on large-scale systems. Our strategy separates application operations onto groups of processes and enables a dataflow processing paradigm among the groups. This mechanism is effective in reducing the impact of load imbalance and increases the parallel efficiency by pipelining multiple operations. We provide a proof-of-concept implementation using MPI, the de-facto programming system on current supercomputers. We demonstrate the effectiveness of this strategy by decoupling the reduce, particle communication, halo exchange and I/O operations in a set of scientific and data-analytics applications. A performance evaluation on 8,192 processes of a Cray XC40 supercomputer shows that the proposed approach can achieve up to 4x performance improvement.Comment: The 46th International Conference on Parallel Processing (ICPP-2017

arXiv.org e-Print Archive

Crossref

An Application Perspective on High-Performance Computing and Communications

Author: Fox Geoffrey C.
Publication venue: SURFACE at Syracuse University
Publication date: 01/01/1996
Field of study

We review possible and probable industrial applications of HPCC focusing on the software and hardware issues. Thirty-three separate categories are illustrated by detailed descriptions of five areas -- computational chemistry; Monte Carlo methods from physics to economics; manufacturing; and computational fluid dynamics; command and control; or crisis management; and multimedia services to client computers and settop boxes. The hardware varies from tightly-coupled parallel supercomputers to heterogeneous distributed systems. The software models span HPF and data parallelism, to distributed information systems and object/data flow parallelism on the Web. We find that in each case, it is reasonably clear that HPCC works in principle, and postulate that this knowledge can be used in a new generation of software infrastructure based on the WebWindows approach, and discussed in an accompanying paper

Syracuse University Research Facility and Collaborative Environment