Search CORE

5 research outputs found

Effectively Mapping Linguistic Abstractions for Message-passing Concurrency to Threads on the Java Virtual Machine

Author: Astley M.
Eric Lin S. L. M.
Guirado F.
Hewitt C.
May R. M.
Quintin J.-N.
Tousimojarad A.
Zhang J.
Publication venue: Iowa State University Digital Repository
Publication date: 01/07/2015
Field of study

Efficient mapping of message passing concurrency (MPC) abstractions to Java Virtual Machine (JVM) threads is critical for performance, scalability, and CPU utilization; but tedious and time consuming to perform manually. In general, this mapping cannot be found in polynomial time, but we show that by exploiting the local characteristics of MPC abstractions and their communication patterns this mapping can be determined effectively. We describe our MPC abstraction to thread mapping technique, its realization in two frameworks (Panini and Akka), and its rigorous evaluation using several benchmarks from representative MPC frameworks. We also compare our technique against four default mapping techniques: thread-all, round-robin-task-all, random-task-all and work-stealing. Our evaluation shows that our mapping technique can improve the performance by 30%-60% over default mapping techniques. These improvements are due to a number of challenges addressed by our technique namely: i) balancing the computations across JVM threads, ii) reducing the communication overheads, iii) utilizing information about cache locality, and iv) mapping MPC abstractions to threads in a way that reduces the contention between JVM threads

Digital Repository @ Iowa State University (ISU)

Crossref

Effectively mapping linguistic abstractions for message-passing concurrency to threads on the Java virtual machine

Author: Astley M.
Eric Lin S. L. M.
Guirado F.
Hewitt C.
May R. M.
Quintin J.-N.
Tousimojarad A.
Zhang J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

On parallel Branch and Bound frameworks for Global Optimization

Author: A Brüngger
A Tousimojarad
B Gendron
E Alba
EL Lawler
Eligius M. T. Hendrix
EMT Hendrix
EMT Hendrix
FA Escobar
GJ Li
J Eckstein
J Eckstein
J Reinders
J Žilinskas
J Žilinskas
JFR Herrera
José M. G. Salmerón
Juan F. R. Herrera
Leocadio G. Casado
LG Casado
LG Casado
M Poldner
MJ Saltzman
R Paulavičius
R Paulavičius
R Paulavičius
R Paulavičius
Rafael Asenjo
RH Mladineo
T Ralphs
TH Lai
W Baritompa
Y Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Branch and Bound (B&B) algorithms are known to exhibit an irregularity of the search tree. Therefore, developing a parallel approach for this kind of algorithms is a challenge. The efficiency of a B&B algorithm depends on the chosen Branching, Bounding, Selection, Rejection, and Termination rules. The question we investigate is how the chosen platform consisting of programming language, used libraries, or skeletons influences programming effort and algorithm performance. Selection rule and data management structures are usually hidden to programmers for frameworks with a high level of abstraction, as well as the load balancing strategy, when the algorithm is run in parallel. We investigate the question by implementing a multidimensional Global Optimization B&B algorithm with the help of three frameworks with a different level of abstraction (from more to less): Bobpp, Threading Building Blocks (TBB), and a customized Pthread implementation. The following has been found. The Bobpp implementation is easy to code, but exhibits the poorest scalability. On the contrast, the TBB and Pthread implementations scale almost linearly on the used platform. The TBB approach shows a slightly better productivity

Crossref

Springer - Publisher Connector

Edinburgh Research Explorer

Wageningen University & Research Publications

Comparison of Three Popular Parallel Programming Models on the Intel Xeon Phi

Author: A. Tousimojarad
C.E. Leiserson
E. Ayguadé
S. Eyerman
W. Kim
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Systems with large numbers of cores have become commonplace. Accordingly, applications are shifting towards increased parallelism. In a general-purpose system, applications residing in the system compete for shared resources. Thread and task scheduling in such a multithreaded multiprogramming environment is a significant challenge. In this study, we have chosen the Intel Xeon Phi system as a modern platform to explore how popular parallel programming models, namely OpenMP, Intel Cilk Plus and Intel TBB (Threading Building Blocks) scale on manycore architectures. We have used three benchmarks with different features which exercise different aspects of the system performance. Moreover, a multiprogramming scenario is used to compare the behaviours of these models when all three applications reside in the system. Our initial results show that it is to some extent possible to infer multiprogramming performance from single-program cases

Crossref

Enlighten

Optimization strategies for geophysics models on manycore systems

Author: Albert Farrés
Arthur M Krause
Claudia Rosas
de Melo AC
Eduardo HM Cruz
Jairo Panetta
Matheus S Serpa
Matthias Diener
Mauricio Hanzich
Niu X
Philippe OA Navaux
Tousimojarad A
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref