Search CORE

54 research outputs found

A new fast algorithm for optimal register allocation in modulo scheduled loops

Author: B.R. Rau
C. Eisenbeis
E. R. Altman
J. Wang
L.J. Hendren
M. S. Lam
M.R. Garey
P.A. Steenkiste
R. A. Huff
R. Bodik
R. Cytron
W. Mangione-Smith
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Register Allocation and Optimal Spill Code Scheduling in Software Pipelined Loops Using 0-1 Integer Linear Programming Formulation

Author: A. Aleta
B.R. Rau
B.R. Rau
B.R. Rau
C.M. Chen
D.W. Goodwin
J. Llosa
J. Zalamea
J.C. Dehnert
K. Ebcioglu
K. Wilken
K.D. Cooper
M. Lam
P. Feautrier
Q. Ning
R. Govindarajan
V.H. Allan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

A Two-Way Loop Algorithm for Exploiting Instruction-Level Parallelism in Memory System

Author: B.R. Rau
C.D. Cantrell
J. Hennessy
J.E. Smith
S.P. Vijay
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

Embedded computing New directions in architecture and automation

Author: Hewlett-Packard Laboratories Bristol (United Kingdom)
Rau B.R.
Schlansker M.S.
Publication venue
Publication date: 01/01/2000
Field of study

SIGLEAvailable from British Library Document Supply Centre-DSC:4335.26205(2000-115) / BLDSC - British Library Document Supply CentreGBUnited Kingdo

OpenGrey Repository

High-level synthesis of nonprogrammable hardware accelerators

Author: Aditya S.
Hewlett-Packard Laboratories Bristol (United Kingdom)
Rau B.R.
Schreiber R.
Publication venue
Publication date: 01/01/2000
Field of study

SIGLEAvailable from British Library Document Supply Centre-DSC:4335.26205(2000-31) / BLDSC - British Library Document Supply CentreGBUnited Kingdo

OpenGrey Repository

HPL-PD architecture specification Version 1.1

Author: Hewlett-Packard Laboratories Bristol (United Kingdom)
Kathail V.
Rau B.R.
Schlansker M.S.
Publication venue
Publication date: 01/01/2000
Field of study

This is a revised version of technical report HPL-93-80, February 1994SIGLEAvailable from British Library Document Supply Centre-DSC:4335.26205(93-80(R.1)) / BLDSC - British Library Document Supply CentreGBUnited Kingdo

OpenGrey Repository

Code size minimization and retargetable assembly for custom EPIC and VLIW instruction formats

Author: Aditya S.
Hewlett-Packard Laboratories Bristol (United Kingdom)
Mahlke S.A.
Rau B.R.
Publication venue
Publication date: 01/01/2000
Field of study

SIGLEAvailable from British Library Document Supply Centre-DSC:4335.26205(2000-141) / BLDSC - British Library Document Supply CentreGBUnited Kingdo

OpenGrey Repository

A constructive solution to the juggling problem in systolic array synthesis

Author: Darte A.
Hewlett-Packard Laboratories Bristol (United Kingdom)
Rau B.R.
Schreiber R.
Publication venue
Publication date: 01/01/2000
Field of study

SIGLEAvailable from British Library Document Supply Centre-DSC:4335.26205(2000-30) / BLDSC - British Library Document Supply CentreGBUnited Kingdo

OpenGrey Repository

Fast design space exploration through validity and quality filtering of subsystem designs

Author: Abraham S.G.
Hewlett-Packard Laboratories Bristol (United Kingdom)
Rau B.R.
Schreiber R.
Publication venue
Publication date: 01/01/2000
Field of study

SIGLEAvailable from British Library Document Supply Centre-DSC:4335.26205(2000-98) / BLDSC - British Library Document Supply CentreGBUnited Kingdo

OpenGrey Repository

Profile-Driven Instruction Level Parallel Scheduling with Application to Super Blocks

Author: B. Natarajan B.R. Rau, M.Schlansker
C. Chekuri
R. Johnson
R. Motwani
Publication venue
Publication date
Field of study

Code scheduling to exploit instruction level parallelism (ILP) is a critical problem in compiler optimization research, in light of the increased use of long-instruction-word machines. Unfortunately, optimum scheduling is computationally intractable, and one must resort to carefully crafted heuristics in practice. If the scope of application of a scheduling heuristic is limited to basic blocks, considerable performance loss may be incurred at block boundaries. To overcome this obstacle, basic blocks can be coalesced across branches to form larger regions such as super blocks. In the literature, these regions are typically scheduled using algorithms that are either oblivious to profile information (under the assumption that the process of forming the region has fully utilized the profile information), or use the profile information as an addendum to classical scheduling techniques. We believe that even for the simple case of linear code regions such as super blocks, additional performanc..

CiteSeerX