Search CORE

2,005 research outputs found

Towards high-level execution primitives for and-parallelism: preliminary results

Author: Carro Liñares Manuel
Casas Amadeo
Hermenegildo Manuel V.
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/2007
Field of study

Most implementations of parallel logic programming rely on complex low-level machinery which is arguably difflcult to implement and modify. We explore an alternative approach aimed at taming that complexity by raising core parts of the implementation to the source language level for the particular case of and-parallelism. Therefore, we handle a signiflcant portion of the parallel implementation mechanism at the Prolog level with the help of a comparatively small number of concurrency-related primitives which take care of lower-level tasks such as locking, thread management, stack set management, etc. The approach does not eliminate altogether modiflcations to the abstract machine, but it does greatly simplify them and it also facilitates experimenting with different alternatives. We show how this approach allows implementing both restricted and unrestricted (i.e., non fork-join) parallelism. Preliminary experiments show that the amount of performance sacriflced is reasonable, although granularity control is required in some cases. Also, we observe that the availability of unrestricted parallelism contributes to better observed speedups

CiteSeerX

Archivo Digital UPM

Towards a High-Level Implementation of Execution Primitives for Unrestricted, Independent And-Parallelism

Author: Carro Liñares Manuel
Casas Amadeo
Hermenegildo Manuel V.
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/2007
Field of study

Most efficient implementations of parallel logic programming rely on complex low-level machinery which is arguably difficult to implement and modify. We explore an alternative approach aimed at taming that complexity by raising core parts of the implementation to the source language level for the particular case of and-parallellism. We handle a significant portion of the parallel implementation at the Prolog level with the help of a comparatively small number of concurrency.related primitives which take case of lower-level tasks such as locking, thread management, stack set management, etc. The approach does not eliminate altogether modifications to the abstract machine, but it does greatly simplify them and it also facilitates experimenting with different alternatives. We show how this approach allows implementing both restricted and unrestricted (i.e., non fork-join) parallelism. Preliminary esperiments show thay the performance safcrifieced is reasonable, although granularity of unrestricted parallelism contributes to better observed speedups

CiteSeerX

Archivo Digital UPM

PLACES'10: The 3rd Workshop on Programmng Language Approaches to concurrency and Communication-Centric Software

Author: Honda Kohei
Mycroft Alan
Publication venue
Publication date: 30/12/2013
Field of study

Paphos, Cyprus. March 201

Queen Mary Research Online

Lock-free Concurrent Data Structures

Author: Cederman Daniel
Gidenstam Anders
Ha Phuong
Papatriantafilou Marina
Sundell Håkan
Tsigas Philippas
Publication venue
Publication date: 01/01/2013
Field of study

Concurrent data structures are the data sharing side of parallel programming. Data structures give the means to the program to store data, but also provide operations to the program to access and manipulate these data. These operations are implemented through algorithms that have to be efficient. In the sequential setting, data structures are crucially important for the performance of the respective computation. In the parallel programming setting, their importance becomes more crucial because of the increased use of data and resource sharing for utilizing parallelism. The first and main goal of this chapter is to provide a sufficient background and intuition to help the interested reader to navigate in the complex research area of lock-free data structures. The second goal is to offer the programmer familiarity to the subject that will allow her to use truly concurrent methods.Comment: To appear in "Programming Multi-core and Many-core Computing Systems", eds. S. Pllana and F. Xhafa, Wiley Series on Parallel and Distributed Computin

arXiv.org e-Print Archive

Chalmers Research

MPF: A portable message passing facility for shared memory multiprocessors

Author: Malony Allen D.
Mcguire Patrick J.
Reed Daniel A.
Publication venue
Publication date
Field of study

The design, implementation, and performance evaluation of a message passing facility (MPF) for shared memory multiprocessors are presented. The MPF is based on a message passing model conceptually similar to conversations. Participants (parallel processors) can enter or leave a conversation at any time. The message passing primitives for this model are implemented as a portable library of C function calls. The MPF is currently operational on a Sequent Balance 21000, and several parallel applications were developed and tested. Several simple benchmark programs are presented to establish interprocess communication performance for common patterns of interprocess communication. Finally, performance figures are presented for two parallel applications, linear systems solution, and iterative solution of partial differential equations

NASA Technical Reports Server

Threads and Or-Parallelism Unified

Author: Carro
Correia
Gupta
INÊS DUTRA
Moura
Pontelli
RICARDO ROCHA
Rocha
Santos Costa
Santos Costa
Shen
Stevens
VíTOR SANTOS COSTA
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 26/07/2010
Field of study

One of the main advantages of Logic Programming (LP) is that it provides an excellent framework for the parallel execution of programs. In this work we investigate novel techniques to efficiently exploit parallelism from real-world applications in low cost multi-core architectures. To achieve these goals, we revive and redesign the YapOr system to exploit or-parallelism based on a multi-threaded implementation. Our new approach takes full advantage of the state-of-the-art fast and optimized YAP Prolog engine and shares the underlying execution environment, scheduler and most of the data structures used to support YapOr's model. Initial experiments with our new approach consistently achieve almost linear speedups for most of the applications, proving itself as a good alternative for exploiting implicit parallelism in the currently available low cost multi-core architectures.Comment: 17 pages, 21 figures, International Conference on Logic Programming (ICLP 2010

arXiv.org e-Print Archive

Crossref

A Case Study in Tightly Coupled Multi-paradigm Parallel Programming

Author: A. Ferrari
A. Gursoy
A.L. Lastovetsky
C.-C. Chiang
G. Zheng
J. Leichtl
J. Nieplocha
L.V. Kale
R. Abedi
S.-E. Choi
T. El-Ghazawi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Crossref

{VSync}: {P}ush-Button Verification and Optimization for Synchronization Primitives on Weak Memory Models

Author: Behrens D.
Bhat K.
Chen H.
Fu M.
Kim J.
Lima de Chehab R.
Oberhauser J.
Oberhauser L.
Paolillo A.
Vafeiadis V.
Wen Y.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2021
Field of study

MPG.PuRe

Concurrent data representation synthesis

Author: Alex Aiken
Herlihy Maurice
Hicks Michael
Kathleen Fisher
Martin Rinard
Mooly Sagiv
Peter Hawkins
Smaragdakis Yannis
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2012
Field of study

We describe an approach for synthesizing data representations for concurrent programs. Our compiler takes as input a program written using concurrent relations and synthesizes a representation of the relations as sets of cooperating data structures as well as the placement and acquisition of locks to synchronize concurrent access to those data structures. The resulting code is correct by construction: individual relational operations are implemented correctly and the aggregate set of operations is serializable and deadlock free. The relational specification also permits a high-level optimizer to choose the best performing of many possible legal data representations and locking strategies, which we demonstrate with an experiment autotuning a graph benchmark

CiteSeerX

DSpace@MIT

Crossref