Search CORE

10,683 research outputs found

Transformation of functional programs for identification of parallel skeletons

Author: Kannan Venkatesh
Publication venue: Dublin City University. School of Computing
Publication date: 01/01/2017
Field of study

Hardware is becoming increasingly parallel. Thus, it is essential to identify and exploit inherent parallelism in a given program to effectively utilise the computing power available. However, parallel programming is tedious and error-prone when done by hand, and is very difficult for a compiler to do automatically to the desired level. One possible approach to parallel programming is to use transformation techniques to automatically identify and explicitly specify parallel computations in a given program using parallelisable algorithmic skeletons. Current existing methods for systematic derivation of parallel programs or parallel skeleton identification allow automation. However, they place constraints on the programs to which they are applicable, require manual derivation of operators with specific properties for parallel execution, or allow the use of inefficient intermediate data structures in the parallel programs. In this thesis, we present a program transformation method that addresses these issues and has the following attributes: (1) Reduces the number of inefficient data structures used in the parallel program; (2) Transforms a program into a form that is more suited to identifying parallel skeletons; (3) Automatically identifies skeletons that can be efficiently executed using their parallel implementations. Our transformation method does not place restrictions on the program to be parallelised, and allows automatic verification of skeleton operator properties to allow parallel execution. To evaluate the performance of our transformation method, we use a set of benchmark programs. The parallel version of each program produced by our method is compared with other versions of the program, including parallel versions that are derived by hand. Consequently, we have been able to evaluate the strengths and weaknesses of the proposed transformation method. The results demonstrate improvements in the efficiency of parallel programs produced in some examples, and also highlight the role of some intermediate data structures required for parallelisation in other examples

Irish Universities

DCU Online Research Access Service

MaSiF: Machine learning guided auto-tuning of parallel skeletons

Author: Cole Murray
Collins Alexander
Fensch Christian
Leather Hugh
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/12/2013
Field of study

Heriot Watt Pure

Costing JIT Traces

Author: Maier Patrick
Morton John Magnus
Trinder Philip
Publication venue: School of Computing Science, University of Glasgow
Publication date
Field of study

Tracing JIT compilation generates units of compilation that are easy to analyse and are known to execute frequently. The AJITPar project aims to investigate whether the information in JIT traces can be used to make better scheduling decisions or perform code transformations to adapt the code for a specific parallel architecture. To achieve this goal, a cost model must be developed to estimate the execution time of an individual trace. This paper presents the design and implementation of a system for extracting JIT trace information from the Pycket JIT compiler. We define three increasingly parametric cost models for Pycket traces. We perform a search of the cost model parameter space using genetic algorithms to identify the best weightings for those parameters. We test the accuracy of these cost models for predicting the cost of individual traces on a set of loop-based micro-benchmarks. We also compare the accuracy of the cost models for predicting whole program execution time over the Pycket benchmark suite. Our results show that the weighted cost model using the weightings found from the genetic algorithm search has the best accuracy

Enlighten

Safe Concurrency Introduction through Slicing

Author: Bozó I.
Cesarini F.
Cheng J.
Nyblom P.
Tóth M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2015
Field of study

Traditional refactoring is about modifying the structure of existing code without changing its behaviour, but with the aim of making code easier to understand, modify, or reuse. In this paper, we introduce three novel refactorings for retrofitting concurrency to Erlang applications, and demonstrate how the use of program slicing makes the automation of these refactorings possible

CiteSeerX

Crossref

Kent Academic Repository

JIT-Based cost analysis for dynamic program transformations

Author: Abella
Albert
Albert
Albert
Aspinall
Autili
Aycock
Barthe
Bauman
Bolz
Bolz
Bozó
Brandolese
Brown
Cole
Dal Lago
Ferdinand
Hao
Hofmann
John Magnus Morton
Jones
Kersten
Lewis Steele
Magnus Morton
Magnus Morton
Maier
Maier
Pape
Patrick Maier
Peyton Jones
Phil Trinder
Rodrigues
Scaife
Spoonhower
Trinder
Wilhelm
Publication venue: 'Elsevier BV'
Publication date: 01/12/2016
Field of study

Tracing JIT compilation generates units of compilation that are easy to analyse and are known to execute frequently. The AJITPar project investigates whether the information in JIT traces can be used to dynamically transform programs for a specific parallel architecture. Hence a lightweight cost model is required for JIT traces. This paper presents the design and implementation of a system for extracting JIT trace information from the Pycket JIT compiler. We define three increasingly parametric cost models for Pycket traces. We determine the best weights for the cost model parameters using linear regression. We evaluate the effectiveness of the cost models for predicting the relative costs of transformed programs

Crossref

Sheffield Hallam University Research Archive

Edinburgh Research Explorer

Enlighten

Source-to-Source Transformations for Parallel Optimizations in STAPL

Author: Kelley Brian M
Publication venue
Publication date: 10/06/2019
Field of study

Programs that use the STAPL C++ parallel programming library express their control and data flow explicitly through the use of skeletons. Skeletons can be simple parallel operations like map and reduce, or the result of composing several skeletons. Composition is implemented by tracking the dependencies among individual data elements in the STAPL runtime system. However, the operations and dependencies within a compose skeleton can be determined at compile time from the C++ abstract syntax tree. This enables the use of source-to-source transformations to fuse the composed skeletons. Transformations can also be used to replace skeletons entirely with equivalent code. Both transformations greatly reduce STAPL runtime overhead, and zip fusion also allows a compiler to optimize the work functions as a single unit. We present a Clang compiler plugin and wrapper that automatically perform these transformations, and demonstrate its ability to improve performance

PARALLEL SKELETONS FOR STRUCTURED COMPOSITION

Author: DARLINGTON J
GUO YK
TO HW
YANG J
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/08/1995
Field of study

Spiral - Imperial College Digital Repository