Search CORE

491 research outputs found

C Language Extensions for Hybrid CPU/GPU Programming with StarPU

Author: Courtès Ludovic
Publication venue
Publication date: 05/04/2013
Field of study

Modern platforms used for high-performance computing (HPC) include machines with both general-purpose CPUs, and "accelerators", often in the form of graphical processing units (GPUs). StarPU is a C library to exploit such platforms. It provides users with ways to define "tasks" to be executed on CPUs or GPUs, along with the dependencies among them, and by automatically scheduling them over all the available processing units. In doing so, it also relieves programmers from the need to know the underlying architecture details: it adapts to the available CPUs and GPUs, and automatically transfers data between main memory and GPUs as needed. While StarPU's approach is successful at addressing run-time scheduling issues, being a C library makes for a poor and error-prone programming interface. This paper presents an effort started in 2011 to promote some of the concepts exported by the library as C language constructs, by means of an extension of the GCC compiler suite. Our main contribution is the design and implementation of language extensions that map to StarPU's task programming paradigm. We argue that the proposed extensions make it easier to get started with StarPU,eliminate errors that can occur when using the C library, and help diagnose possible mistakes. We conclude on future work

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Algorithms versus architectures for computational chemistry

Author: Bauschlicher C. W., Jr.
Partridge H.
Publication venue
Publication date
Field of study

The algorithms employed are computationally intensive and, as a result, increased performance (both algorithmic and architectural) is required to improve accuracy and to treat larger molecular systems. Several benchmark quantum chemistry codes are examined on a variety of architectures. While these codes are only a small portion of a typical quantum chemistry library, they illustrate many of the computationally intensive kernels and data manipulation requirements of some applications. Furthermore, understanding the performance of the existing algorithm on present and proposed supercomputers serves as a guide for future programs and algorithm development. The algorithms investigated are: (1) a sparse symmetric matrix vector product; (2) a four index integral transformation; and (3) the calculation of diatomic two electron Slater integrals. The vectorization strategies are examined for these algorithms for both the Cyber 205 and Cray XMP. In addition, multiprocessor implementations of the algorithms are looked at on the Cray XMP and on the MIT static data flow machine proposed by DENNIS

NASA Technical Reports Server

Adapting the interior point method for the solution of LPs on serial, coarse grain parallel and massively parallel computers

Author: Andersen J
Levkovitz R
Mitra G
Tamiz M
Publication venue: Brunel University
Publication date: 01/01/1990
Field of study

In this paper we describe a unified scheme for implementing an interior point algorithm (IPM) over a range of computer architectures. In the inner iteration of the IPM a search direction is computed using Newton's method. Computationally this involves solving a sparse symmetric positive definite (SSPD) system of equations. The choice of direct and indirect methods for the solution of this system, and the design of data structures to take advantage of serial, coarse grain parallel and massively parallel computer architectures, are considered in detail. We put forward arguments as to why integration of the system within a sparse simplex solver is important and outline how the system is designed to achieve this integration

CiteSeerX

Brunel University Research Archive

XcalableMP PGAS Programming Language

Author: Sato Mitsuhisa
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

XcalableMP is a directive-based parallel programming language based on Fortran and C, supporting a Partitioned Global Address Space (PGAS) model for distributed memory parallel systems. This open access book presents XcalableMP language from its programming model and basic concept to the experience and performance of applications described in XcalableMP.　 XcalableMP was taken as a parallel programming language project in the FLAGSHIP 2020 project, which was to develop the Japanese flagship supercomputer, Fugaku, for improving the productivity of parallel programing. XcalableMP is now available on Fugaku and its performance is enhanced by the Fugaku interconnect, Tofu-D. The global-view programming model of XcalableMP, inherited from High-Performance Fortran (HPF), provides an easy and useful solution to parallelize data-parallel programs with directives for distributed global array and work distribution and shadow communication. The local-view programming adopts coarray notation from Coarray Fortran (CAF) to describe explicit communication in a PGAS model. The language specification was designed and proposed by the XcalableMP Specification Working Group organized in the PC Consortium, Japan. The Omni XcalableMP compiler is a production-level reference implementation of XcalableMP compiler for C and Fortran 2008, developed by RIKEN CCS and the University of Tsukuba. The performance of the XcalableMP program was used in the Fugaku as well as the K computer. A performance study showed that XcalableMP enables a scalable performance comparable to the message passing interface (MPI) version with a clean and easy-to-understand programming style requiring little effort

OAPEN Library

Reflectance Transformation Imaging (RTI) System for Ancient Documentary Artefacts

Author: Earl G
Martinez K
Pagi H
Publication venue
Publication date: 01/01/2010
Field of study

This tutorial summarises our uses of reflectance transformation imaging in archaeological contexts. It introduces the UK AHRC funded project reflectance Transformation Imaging for Anciant Documentary Artefacts and demonstrates imaging methodologies

Southampton (e-Prints Soton)

Acta Cybernetica : Volume 18. Number 2.

Author
Publication venue
Publication date: 01/01/2007
Field of study

University of Szeged

Resource allocation in a university environment : a test of the Ruefli, Freeland, and Davis goal programming decomposition algorithms / BEBR No. 735

Author: Davis Wayne J.
Whitford David Thomas
Publication venue: [Urbana] : College of Commerce and Business Administration, Bureau of Economic and Business Research, University of Illinois at Urbana-Champaign,
Publication date: 01/01/1981
Field of study

Bibliography: p. 20-22

Illinois Digital Environment for Access to Learning and Scholarship Repository

An Optimization-Based Decision Support System for Strategic Planning in a Process Industry: The Case of an Aluminum Company in India

Author: Dutta Goutam
Gupta Narain
Robert Fourer
Publication venue
Publication date
Field of study

<div align="justify">We describe how a generic multi-period optimization-based decision support system (DSS) can be used for strategic planning in process industries. The DSS is built on five fundamental elements . materials, facilities, activities, storage areas and time periods. It requires little direct knowledge of optimization techniques to be used effectively. Results based on real data from an aluminum company in India demonstrate significant potential for improvement in profits.</div>

Research Papers in Economics