Search CORE

62,458 research outputs found

Session-Based Programming for Parallel Algorithms: Expressiveness and Performance

Author: Alastair R. Beresford
Andi Bejleri
Kaku Takeuchi
Kohei Honda
Kohei Honda
Nobuko Yoshida
Raymond Hu
Raymond Hu
Simon Gay
Timothy G. Mattson
William Gropp
Publication venue: 'Open Publishing Association'
Publication date: 01/02/2010
Field of study

This paper investigates session programming and typing of benchmark examples to compare productivity, safety and performance with other communications programming languages. Parallel algorithms are used to examine the above aspects due to their extensive use of message passing for interaction, and their increasing prominence in algorithmic research with the rising availability of hardware resources such as multicore machines and clusters. We contribute new benchmark results for SJ, an extension of Java for type-safe, binary session programming, against MPJ Express, a Java messaging system based on the MPI standard. In conclusion, we observe that (1) despite rich libraries and functionality, MPI remains a low-level API, and can suffer from commonly perceived disadvantages of explicit message passing such as deadlocks and unexpected message types, and (2) the benefits of high-level session abstraction, which has significant impact on program structure to improve readability and reliability, and session type-safety can greatly facilitate the task of communications programming whilst retaining competitive performance

arXiv.org e-Print Archive

CiteSeerX

Crossref

Directory of Open Access Journals

Spiral - Imperial College Digital Repository

Learning Transfers over Several Programming Languages

Author: Baltaji Razan
Buratti Luca
Hirzel Martin
Mandel Louis
Pujar Saurabh
Varshney Lav
Publication venue
Publication date: 25/10/2023
Field of study

Large language models (LLMs) have recently become remarkably good at improving developer productivity for high-resource programming languages. These models use two kinds of data: large amounts of unlabeled code samples for pretraining and relatively smaller amounts of labeled code samples for fine-tuning or in-context learning. Unfortunately, many programming languages are low-resource, lacking labeled samples for most tasks and often even lacking unlabeled samples. Therefore, users of low-resource languages (e.g., legacy or new languages) miss out on the benefits of LLMs. Cross-lingual transfer learning uses data from a source language to improve model performance on a target language. It has been well-studied for natural languages, but has received little attention for programming languages. This paper reports extensive experiments on four tasks using a transformer-based LLM and 11 to 41 programming languages to explore the following questions. First, how well cross-lingual transfer works for a given task across different language pairs. Second, given a task and target language, how to best choose a source language. Third, the characteristics of a language pair that are predictive of transfer performance, and fourth, how that depends on the given task.Comment: 16 pages, 5 figures, 5 table

arXiv.org e-Print Archive

An Extension of the StarSs Programming Model for Platforms with Multiple GPUs

Author: Ayguadé Eduardo
Badía Sala Rosa María
Igual Francisco D.
Labarta Mancho Jesús
Mayo Rafael
Quintana-Orti Enrique S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

While general-purpose homogeneous multi-core architectures are becoming ubiquitous, there are clear indications that, for a number of important applications, a better performance/power ratio can be attained using specialized hardware accelerators. These accelerators require specific SDK or programming languages which are not always easy to program. Thus, the impact of the new programming paradigms on the programmer’s productivity will determine their success in the high-performance computing arena. In this paper we present GPU Superscalar (GPUSs), an extension of the Star Superscalar programming model that targets the parallelization of applications on platforms consisting of a general-purpose processor connected with multiple graphics processors. GPUSs deals with architecture heterogeneity and separate memory address spaces, while preserving simplicity and portability. Preliminary experimental results for a well-known operation in numerical linear algebra illustrate the correct adaptation of the runtime to a multi-GPU system, attaining notable performance results

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Repositori Institucional de la Universitat Jaume I

A comparative study of high-productivity high-performance programming languages for parallel metaheuristics

Author: Carneiro Tiago
Gmys Jan
Melab Nouredine
Talbi El-Ghazali
Tuyttens Daniel
Publication venue: Elsevier
Publication date: 09/06/2020
Field of study

International audienceParallel metaheuristics require programming languages that provide both, high performance and a high level of programmability. This paper aims at providing a useful data point to help practitioners gauge the difficult question of whether to invest time and effort into learning and using a new programming language. To accomplish this objective, three productivity-aware languages (Chapel, Julia, and Python) are compared in terms of performance, scalability and productivity. To the best of our knowledge, this is the first time such a comparison is performed in the context of parallel metaheuristics. As a test-case, we implement two parallel metaheuristics in three languages for solving the 3D Quadratic Assignment Problem (Q3AP), using thread-based parallelism on a multi-core shared-memory computer. We also evaluate and compare the performance of the three languages for a parallel fitness evaluation loop, using four different test-functions with different computational characteristics. Besides providing a comparative study, we give feedback on the implementation and parallelization process in each language

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot