Search CORE

93 research outputs found

Improving the scalability of parallel N-body applications with an event driven constraint based execution model

Author: Aarseth SJ
Alfieri RA
Bonachea D
Chandra R
Dekate C
El-Ghazawi T
Hewitt C
Kale L
Message Passing Interface Forum
O’Shea BW
Salmon JK
Singh JP
Publication venue: 'SAGE Publications'
Publication date: 23/09/2011
Field of study

The scalability and efficiency of graph applications are significantly constrained by conventional systems and their supporting programming models. Technology trends like multicore, manycore, and heterogeneous system architectures are introducing further challenges and possibilities for emerging application domains such as graph applications. This paper explores the space of effective parallel execution of ephemeral graphs that are dynamically generated using the Barnes-Hut algorithm to exemplify dynamic workloads. The workloads are expressed using the semantics of an Exascale computing execution model called ParalleX. For comparison, results using conventional execution model semantics are also presented. We find improved load balancing during runtime and automatic parallelism discovery improving efficiency using the advanced semantics for Exascale computing.Comment: 11 figure

arXiv.org e-Print Archive

Crossref

UPC++: A high-performance communication framework for asynchronous computation

Author: Bachan J
Baden SB
Hofmeyr S
Jacquelin M
Kamil A
Bonachea D
Hargrove PH
Ahmed H
Publication venue: eScholarship, University of California
Publication date: 01/01/1968
Field of study

UPC++ is a C++ library that supports high-performance computation via an asynchronous communication framework. This paper describes a new incarnation that differs substantially from its predecessor, and we discuss the reasons for our design decisions. We present new design features, including future-based asynchrony management, distributed objects, and generalized Remote Procedure Call (RPC). We show microbenchmark performance results demonstrating that one-sided Remote Memory Access (RMA) in UPC++ is competitive with MPI-3 RMA; on a Cray XC40 UPC++ delivers up to a 25% improvement in the latency of blocking RMA put, and up to a 33% bandwidth improvement in an RMA throughput test. We showcase the benefits of UPC++ with irregular applications through a pair of application motifs, a distributed hash table and a sparse solver component. Our distributed hash table in UPC++ delivers near-linear weak scaling up to 34816 cores of a Cray XC40. Our UPC++ implementation of the sparse solver component shows robust strong scaling up to 2048 cores, where it outperforms variants communicating using MPI by up to 3.1x. UPC++ encourages the use of aggressive asynchrony in low-overhead RMA and RPC, improving programmer productivity and delivering high performance in irregular applications

The University of Utah: J. Willard Marriott Digital Library

eScholarship - University of California

Targeted next generation sequencing makes new molecular diagnoses and expands genotype-phenotype relationship in Ehlers-Danlos syndrome

Author: A De Paepe
A De Paepe
A McKenna
Abdulshakur Abdullah
AM Vandersteen
Anthony Vandersteen
BL Loeys
BL Loeys
Christina Kanonidou
CL Phillips
D van der Linde
David Ferguson
David Ross
EM Bonachea
F Malfait
F Malfait
F. Michael Pope
GR Monroe
Hanadi Kazkaz
Holly A. Black
J Halbritter
J Vandrovcova
Jana Vandrovcova
Jennifer Biggs
JM Bourhis
KT Ong
L Nuytinck
M Pepin
MG Pepin
Michael Mueller
MJ Pickup
ML Metzker
Neeti Ghali
Nicholas J. Cheshire
NS Stembridge
P Beighton
Penny Norsworthy
Piyush Gampawar
R Derynck
R Morissette
Rodney Grahame
Ruwan A. Weerakkody
S Kirmani
S Proske
S Richards
Timothy J. Aitman
Yousef Ibrahim
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/03/2016
Field of study

Crossref

Edinburgh Research Explorer

When and how to develop domain-specific languages

Author: Anlauff M.
Anthony M. Sloane
Antoniotti M.
Attali I.
Aycock J.
Backus J. W.
Badros G.
Bagge O. S.
Batory D.
Batory D.
Baxter I. D.
Biggerstaff T. J.
Biggerstaff T. J.
Bonachea D.
Braband C.
Bravenboer M.
Bruntink M.
Buffenbarger J.
Chiba S.
Clements J.
Consel C.
Cordy J. R.
Courbis C.
Crew R. F.
Czarnecki K.
de Jonge M.
Faith R. E.
Falbo R. A.
Felleisen M.
Fertalj K.
Frakes W.
Frakes W.
Gamma E.
Germon R.
Gil J.
Gilmore S.
Gondow K.
Granicz A.
Gray J.
Greenfield J.
Guyer S. Z.
Guyer S. Z.
HICSS
HICSS
HICSS
Hudak P.
Jan Heering
Jennings J.
Kadhim B. M.
Kamin S.
Kamin S. Ed.
Kieburtz R. B.
Kienle H. M.
Kumar S.
Kutter P. W.
Lengauer C.
Marjan Mernik
Martin J.
Mauw S.
Mernik M.
Mernik M.
Mernik M.
Mernik M.
Moura J. M. F.
Nakatani L.
Neighbors J. M.
Peyton Jones S.
Pfahler P.
Raymond E. S.
Risi W.
Salus P. H. Ed.
Sammet J. E.
Saraiva J.
Schnarr E.
Schneider K. A.
Schupp S.
Simos M.
Sirer E. G.
Sloane A. M.
Smaragdakis Y.
Soroker D.
Sutcliffe A.
Tennent R. D.
Thatte S.
Thibault S. A.
USENIX
USENIX
van den Brand M. G. J.
van Deursen A.
van Engelen R.
Wang D. C.
Wile D. S.
Wile D. S.
Wile D. S.
Xiong J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

High-performance file I/O in Java: Existing approaches and bulk I/O extensions

Author: Bonachea D,
Publication venue
Publication date: 20/12/2017
Field of study

Ezid

GASNet Specification, v1.1

Author: Bonachea D
Publication venue: eScholarship, University of California
Publication date: 01/01/2002
Field of study

This document has been superseded by: GASNet Specification, v1.8.1 (LBNL-2001064) https://escholarship.org/uc/item/03b5g0q4 This GASNet specification describes a network-independent and language-independent high-performance communication interface intended for use in implementing the runtime system for global address space languages (such as UPC or Titanium)

Ezid

eScholarship - University of California

Proposal for Extending the UPC Memory Copy Library Functions and Supporting Extensions to GASNet, v2.0 ..

Author: Bonachea D,
Publication venue
Publication date: 12/12/2022
Field of study

Ezid

Recommended from our members

Bulk file I/O extensions to Java

Author: Bonachea D
Publication venue: eScholarship, University of California
Publication date: 01/12/2000
Field of study

The file I/O classes present in Java have proven too inefficient to meet the demands of high-performance applications that perform large amounts of I/O. The inefficiencies stem primarily from the library interface which requires programs to read arrays a single element at a time. We present two extensions to the Java I/O libraries which alleviate this problem. The first adds bulk (array) I/O operations to the existing libraries, removing much of the overhead currently associated with array I/O. The second is a new library that adds direct support for asynchronous I/O to enable masking I/O latency with overlapped computation. The extensions were implemented in Titanium, a high-performance, parallel dialect of Java. We present experimental results that compare the performance of the extensions with the existing I/O libraries on a simple, external merge sort application. The results demonstrate that our extensions deliver vastly superior I/O performance for this array-based application

eScholarship - University of California

Hancock: A language for processing very large-scale data

Author: Bonachea D,
Publication venue
Publication date: 15/11/2022
Field of study

Ezid

Optimized collectives for PGAS languages with one-sided communication

Author: Bonachea D,
Publication venue
Publication date: 20/12/2017
Field of study

Ezid