Search CORE

8 research outputs found

Www.redbooks.ibm.com

Author: Bob Walkup
Christoph Pospiech
David Klepacki
Farid Parpia
Frank Johnston
Gyan Bhanot
Jarek Nieplocha
Jim Tuccillo
John Hague
John Levesque
Marcelo Barrios
Richard Treumann
Scientific Applications In
Stefan Andersson
Swamy K
Publication venue
Publication date
Field of study

This memory advantage would allow us to solve the same problem as PESSL using a smaller number of MPI tasks or to solve a bigger problem using the same number of tasks. 3.3.4 Domain splitting Another good example for MPI is the Domain Splitting method. It assumes you have a domain that you split onto different MPI processes. The problem with this approach is that you will have to exchange border values between the different processes as shown in Figure 19. In general, you have to send information between processes that share a common edge. In our example, this involves communication between domains A and B and B and C. Depending on your problem, you might have communication between domains sharing a common edge, C and A in our example. If you have to do this, you might need a total of eight communications. There is an algorithm that, in a two-dimensional domain splitting, reduces the number of communications needed from eight unordered to four ordered ones. In a ssend column (gen.) 3.551 1.464 .331 All2All column 3.654 1.279 .621 Pack standard 3.599 1.508 .335 Pack column 3.364 1.272 .335 three-dimensional problem, you would only need six communication steps instead of 26. Figure 19 illustrates domain splitting with nine domains. Figure 19. Domain splitting with nine domains Here is a code fragment that transfers MPI_COMM_WORLD into a two dimensional Cartesian topology, defines two MPI data types on the border, and, finally, uses MPI_Sendrecv() to do the update. The trick is to define the data types to include the corners and also transfer them. In the code fragment, the update of the corner between each communication step is missing; this can be a problem depending on how the transposition is don

CiteSeerX

Abstract

Author: Alan Gara
Alessandro Curioni
Amy Henning
Bob Walkup
Bor Chan
Bruce Curtis
Charles Archer
George Almasi
Giri Chukkapalli Robert Harkness
John Gunnels
Jose E. Moreira
Leonardo Bachega
Manish Gupta
Sharon Brunett
Sid Chatterjee
Publication venue
Publication date
Field of study

The BlueGene/L supercomputer is expected to deliver new levels of application performance by providing a combination of good single-node computational performance and high scalability. To achieve good single-node performance, the BlueGene/L design includes a special dual floating-point unit on each processor and the ability to use two processors per node. BlueGene/L also includes both a torus and a tree network to achieve high scalability. We demonstrate how benchmarks and applications can take advantage of these architectural features to get the most out of BlueGene/L. 1

CiteSeerX

Early Experience with Scientific Applications on the Blue Gene/L Supercomputer

Author: A Andreoni
Alan Gara
Blake Fitch
Bob Walkup
Charles Archer
Dong Chen
Frank Suits
George Almasi
Gyan Bhanot
Henry Tufo
James Sexton
John Gunnels
Katherine Riley
Manish Gupta
Maria Eleftheriou
Mike Pitman
Pavlos Vranas
Philip Heidelberg
Richard Loft
Ro Curioni
Robert Germain
Theron Voran
Yuriy Zhestkov
Publication venue
Publication date: 01/01/2005
Field of study

Abstract. Blue Gene/L uses a large number of low power processors, together with multiple integrated interconnection networks, to build a supercomputer with low cost, space and power consumption. It uses a novel system software architecture designed with application scalability in mind. However, whether real applications will scale to tens of thousands of processors has been an open question. In this paper, we describe early experience with several applications on a 16,384 node Blue Gene/L system. This study establishes that applications from a broad variety of scientific disciplines can effectively scale to thousands of processors. The results reported in this study represent the highest performance ever demonstrated for most of these applications, and in fact, show effective scaling for the first time ever on thousands of processors.

CiteSeerX

Crossref

Scaling Physics and Material Science Applications on a Massively Parallel Blue Gene/L System

Author: Alan Gara
Alison Kubota
Andrew W. Cook
Bob Walkup
Bronis R. De Supinski
Charles Archer
Charles Rendleman
Francois Gygi
Frederick H. Streitz
George Almasi
Gyan Bhanot
James N. Glosli
James Sexton
Jeffrey A. Greenough
Jose Moreira
Manish Gupta
Peter L. Williams
Robert K. Yates
Steve Louis
Thomas E. Spelce
Vasily V. Bulatov
Publication venue
Publication date
Field of study

Blue Gene/L represents a new way to build supercomputers, using a large number of low power processors, together with multiple integrated interconnection networks. Whether real applications can scale to tens of thousands of processors (on a machine like Blue Gene/L) has been an open question. In this paper, we describe early experience with several physics and material science applications on a 32,768 node Blue Gene/L system, which was installed recently at the Lawrence Livermore National Laboratory. Our study shows some problems in the applications and in the current software implementation, but overall, excellent scaling of these applications to 32K nodes on the current Blue Gene/L system. While there is clearly room for improvement, these results represent the first proof point that MPI applications can effectively scale to over ten thousand processors. They also validate the scalability of the hardware and software architecture of Blue Gene/L. Categories and Subject Descriptors J.2 [Computer Applications]: Physical Sciences and Engineerin

CiteSeerX

Dissociative absorption, mind-wandering, and attention-deficit symptoms: Associations with obsessive-compulsive symptoms

Crossref

The design, deployment, and evaluation of the CORAL pre-exascale systems

Author: Appelhans David
Atchley Scott
Bertsch Adam
Blackmore Robert
Bland Arthur S.
Casses Ben
Chambreau Chris
Chochia George
Davison Gene
De Supinski Bronis R.
Ezell Matthew A.
Geist Al
Goldstone Robin
Gonsiorowski Elsa
Gooding Tom
Grinberg Leopold
Hanson Bill
Hartner Bill
Joubert Wayne
Kahle Jim
Karlin Ian
Larrea Veronica G.Vergara
Leininger Matthew L.
Leverman Dustin
Marroquin Chris
Maxwell Don E.
Moody Adam
Ohmacht Martin
Oral Sarp
Pankajakshan Ramesh
Pizzano Fernando
Rogers James H.
Rosenburg Bryan
Schmidt Drew
Sexton James
Shankar Mallikarjun
Vazhkudai Sudharshan S.
Walkup Bob
Wang Feiyi
Watson Py
Weems Lance D.
Yin Junqi
Zimmer Christopher J.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 11/03/2019
Field of study

Queen's University Belfast Research Portal

Crossref