Search CORE

348 research outputs found

Dataflow Computing with Polymorphic Registers

Author: Ciobanu Catalin
Gaydadjiev Georgi N.
Pilato Christian
Sciuto Donatella
Publication venue
Publication date: 01/01/2013
Field of study

Heterogeneous systems are becoming increasingly popular for data processing. They improve performance of simple kernels applied to large amounts of data. However, sequential data loads may have negative impact. Data parallel solutions such as Polymorphic Register Files (PRFs) can potentially accelerate applications by facilitating high speed, parallel access to performance-critical data. Furthermore, by PRF customization, specific data path features are exposed to the programmer in a very convenient way. PRFs allow additional control over the registers dimensions, and the number of elements which can be simultaneously accessed by computational units. This paper shows how PRFs can be integrated in dataflow computational platforms. In particular, starting from an annotated source code, we present a compiler-based methodology that automatically generates the customized PRFs and the enhanced computational kernels that efficiently exploit them

Archivio istituzionale della ricerca - Politecnico di Milano

Chalmers Research

Chalmers Publication Library

The Case for Polymorphic Registers in Dataflow Computing

Author: Ciobanu Cătălin Bogdan
Gaydadjiev Georgi
Pilato Christian
Sciuto Donatella
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Heterogeneous systems are becoming increasingly popular, delivering high performance through hardware specialization. However, sequential data accesses may have a negative impact on performance. Data parallel solutions such as Polymorphic Register Files (PRFs) can potentially accelerate applications by facilitating high-speed, parallel access to performance-critical data. This article shows how PRFs can be integrated into dataflow computational platforms. Our semi-automatic, compiler-based methodology generates customized PRFs and modifies the computational kernels to efficiently exploit them. We use a separable 2D convolution case study to evaluate the impact of memory latency and bandwidth on performance compared to a state-of-the-art NVIDIA Tesla C2050 GPU. We improve the throughput up to 56.17X and show that the PRF-augmented system outperforms the GPU for 9×9 or larger mask sizes, even in bandwidth-constrained systems

Archivio istituzionale della ricerca - Politecnico di Milano

UvA-DARE

The Case for Polymorphic Registers in Dataflow Computing

Author: Ciobanu C.B.
Gaydadjiev G.
Pilato C.
Sciuto D.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2018
Field of study

International Migration, Integration and Social Cohesion online publications

Morpheus unleashed: Fast cross-platform SpMV on emerging architectures

Author: Brown Nick
Jesus Ricardo
Klaisoongnoen Mark
Stylianou Christodoulos
Weiland Michele
Publication venue
Publication date: 11/05/2023
Field of study

Edinburgh Research Explorer

Morpheus unleashed: Fast cross-platform SpMV on emerging architectures

Author: Brown Nick
Jesus Ricardo
Klaisoongnoen Mark
Stylianou Christodoulos
Weiland Michele
Publication venue
Publication date: 19/04/2023
Field of study

Sparse matrices and linear algebra are at the heart of scientific simulations. Over the years, more than 70 sparse matrix storage formats have been developed, targeting a wide range of hardware architectures and matrix types, each of which exploit the particular strengths of an architecture, or the specific sparsity patterns of the matrices. In this work, we explore the suitability of storage formats such as COO, CSR and DIA for emerging architectures such as AArch64 CPUs and FPGAs. In addition, we detail hardware-specific optimisations to these targets and evaluate the potential of each contribution to be integrated into Morpheus, a modern library that provides an abstraction of sparse matrices (currently) across x86 CPUs and NVIDIA/AMD GPUs. Finally, we validate our work by comparing the performance of the Morpheus-enabled HPCG benchmark against vendor-optimised implementations

arXiv.org e-Print Archive

Bytecode verification on Java smart cards

Author: Aho
Briggs
Brisset
Chaitin
Chen
Cohen
Freund
Gong
Gosling
Grimaud
Hagiya
Hartel
Leroy
Lindholm
McGraw
Muchnick
Necula
Nipkow
O'Callahan
Posegga
Pusch
Qian
Qian
Rose
Stata
Stärk
Sun Microsystems
Sun Microsystems
Sun Microsystems
Vigna
Yellin
Publication venue: 'Wiley'
Publication date: 01/01/2002
Field of study

Crossref

Integrated Java Bytecode Verification

Author: Franz Michael
Gal Andreas
Probst Christian
Publication venue
Publication date: 01/01/2005
Field of study

AbstractExisting Java verifiers perform an iterative data-flow analysis to discover the unambiguous type of values stored on the stack or in registers. Our novel verification algorithm uses abstract interpretation to obtain definition/use information for each register and stack location in the program, which in turn is used to transform the program into Static Single Assignment form. In SSA, verification is reduced to simple type compatibility checking between the definition type of each SSA variable and the type of each of its uses. Inter-adjacent transitions of a value through stack and registers are no longer verified explicitly. This integrated approach is more efficient than traditional bytecode verification but still as safe as strict verification, as overall program correctness can be induced once the data flow from each definition to all associated uses is known to be type-safe

Elsevier - Publisher Connector

Online Research Database In Technology

A formally verified compiler back-end

Author: A Dold
A Dold
A Hobor
A Pnueli
ACJ Fox
AJ Chlipala
AW Appel
AW Appel
AW Appel
BK Rosen
C Lindig
CW Barrett
D Cachera
D Lacey
D Leinenbach
D Leinenbach
E Eide
F Henderson
G Barthe
G Barthe
G Barthe
G Barthe
G Clemmensen
G Goos
G Klein
G Li
G Li
G Morrisett
G Morrisett
GA Kildall
GC Necula
GC Necula
GC Necula
GC Necula
GJ Chaitin
GP Huet
H-J Boehm
IBM Corporation
J Chen
J Guttman
J Knoop
J Knoop
J McCarthy
J-B Tristan
J-B Tristan
JO Blech
JR Ellis
JS Moore
JS Moore
L Beringer
L Chirica
L George
L Rideau
LD Zuck
M Huisman
M Müller-Olm
M Strecker
MA Dave
N Benton
P Letouzey
P Letouzey
PH Hartel
PW O’Hearn
Q Huang
R Milner
R Stärk
S Beyer
S Blazy
S Blazy
S Coupet-Grimal
S Gulwani
S Lerner
SL Peyton Jones
SS Muchnick
TC Hales
WM McKeeman
X Feng
X Leroy
X Leroy
X Leroy
X Leroy
X Rival
Xavier Leroy
Y Bertot
Y Bertot
Z Shao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

This article describes the development and formal verification (proof of semantic preservation) of a compiler back-end from Cminor (a simple imperative intermediate language) to PowerPC assembly code, using the Coq proof assistant both for programming the compiler and for proving its correctness. Such a verified compiler is useful in the context of formal methods applied to the certification of critical software: the verification of the compiler guarantees that the safety properties proved on the source code hold for the executable compiled code as well

arXiv.org e-Print Archive

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

Hailstorm : A Statically-Typed, Purely Functional Language for IoT Applications

Author: Sarkar Abhiroop
Sheeran Mary
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2020
Field of study

With the growing ubiquity of Internet of Things (IoT), more complex logic is being programmed on resource-constrained IoT devices, almost exclusively using the C programming language. While C provides low-level control over memory, it lacks a number of high-level programming abstractions such as higher-order functions, polymorphism, strong static typing, memory safety, and automatic memory management.We present Hailstorm, a statically-typed, purely functional programming language that attempts to address the above problem. It is a high-level programming language with a strict typing discipline. It supports features like higher-order functions, tail-recursion and automatic memory management, to program IoT devices in a declarative manner. Applications running on these devices tend to be heavily dominated by I/O. Hailstorm tracks side effects like I/O in its type system using resource types. This choice allowed us to explore the design of a purely functional standalone language, in an area where it is more common to embed a functional core in an imperative shell. The language borrows the combinators of arrowized FRP, but has discrete-time semantics. The design of the full set of combinators is work in progress, driven by examples. So far, we have evaluated Hailstorm by writing standard examples from the literature (earthquake detection, a railway crossing system and various other clocked systems), and also running examples on the GRiSP embedded systems board, through generation of Erlang

arXiv.org e-Print Archive

Chalmers Research