Search CORE

11 research outputs found

Speeding-Up Expensive Evaluations in High-Level Synthesis Using Solution Modeling and Fitness Inheritance

Author: A. Kuehlmann
C. Brandolese
C. Mandal
C.T. Hwang
E. Zitzler
G. Grewal
G. Micheli De
J. Dennis
J.J. Grefenstette
K. Deb
K. Sastry
K. Sastry
L. Stok
M. Meribout
M. Palesi
P. Kollig
P.G. Paulin
R. Cordone
R.E. Smith
S. Huband
V. Chaiyakul
V. Krishnan
X. Llor‘a
Y. Jin
Z. Gu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

High-Level Synthesis (HLS) is the process of developing digital circuits from behavioral specifications. It involves three interdependent and NP-complete optimization problems: (i) the operation scheduling, (ii) the resource allocation, and (iii) the controller synthesis. Evolutionary Algorithms have been already effectively applied to HLS to find good solution in presence of conflicting design objectives. In this paper, we present an evolutionary approach to HLS that extends previous works in three respects: (i) we exploit the NSGA-II, a multi-objective genetic algorithm, to fully automate the design space exploration without the need of any human intervention, (ii) we replace the expensive evaluation process of candidate solutions with a quite accurate regression model, and (iii) we reduce the number of evaluations with a fitness inheritance scheme. We tested our approach on several benchmark problems. Our results suggest that all the enhancements introduced improve the overall performance of the evolutionary search

Archivio istituzionale della ricerca - Politecnico di Milano

Crossref

Coordinated parallelizing compiler optimizations and high-level synthesis

Author: Aho A.
Alexandru Nicolau
Bergamaschi R.
Chaiyakul V.
Ebcioglu K.
Fisher J.
Gupta S.
Gupta S.
Gupta S.
Gupta S.
Gupta S.
Gupta S.
Iqbal Z.
Janssen M.
Kountouris A.
Ku D.
Li J.
Lobo D.
Nicolau A.
Nikil D. Dutt
Novack S.
Orailoglu A.
Peymandoust A.
Potkonjak M.
Rajesh Kumar Gupta
Sreedhar V.
Sumit Gupta
Wakabayashi K.
Wakabayashi K.
Walker R.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/10/2004
Field of study

We present a high-level synthesis methodology that applies a coordinated set of coarse-grain and fine-grain parallelizing transformations. The transformations are applied both during a presynthesis phase and during scheduling, with the objective of optimizing the results of synthesis and reducing the impact of control flow constructs on the quality of results. We first apply a set of source level presynthesis transformations that include common sub-expression elimination (CSE), copy propagation, dead code elimination and loop-invariant code motion, along with more coarse-level code restructuring transformations such as loop unrolling. We then explore scheduling techniques that use a set of aggressive speculative code motions to maximally parallelize the design by re-ordering, speculating and sometimes even duplicating operations in the design. In particular, we present a new technique called "Dynamic CSE" that dynamically coordinates CSE and code motions such as speculation and conditional speculation during scheduling. We implemented our parallelizing high-level synthesis in the SPARK framework. This framework takes a behavioral description in ANSI-C as input and generates synthesizable register-transfer level VHDL. Our results from computationally expensive portions of three moderately complex design targets, namely, MPEG-1, MPEG-2 and the GIMP image processing too], validate the utility of our approach to the behavioral synthesis of designs with complex control flows

Crossref

eScholarship - University of California

Changes in cellular microRNA expression induced by porcine circovirus type 2-encoded proteins

Author: A Kozomara
A Mankertz
A Mankertz
AK Cheung
C Chae
C Missero
Chang-Yong Choi
D Chen
DJ Adams
Dokyun Na
DP Bartel
E Gottwein
F Xiao
GM Allan
GM Allan
GP Wagner
H Guo
IS Cho
J Ellis
J Ellis
J Kach
J Kim
J Krol
J Liu
J Segales
J Winter
J Yu
JA Whelan
Jae-Sang Hong
JL Umbach
JS Tsang
JT Mendell
Jun-Seong Lee
K Hirasawa
KA O’Donnell
L Ma
L Wei
L Wei
M Chaiyakul
M Hackenberg
M Ramirez-Boo
M Rehmsmeier
M Tini
M Zuker
Nam-Hoon Kim
NE Davey
P Meerts
P Nawagitgul
PA Maroney
R Li
RC Friedman
RD Morin
RE Sanchez Jr
RL Skalsky
S Guil
S Pleschka
S Timmusk
S Timmusk
SL Ameres
SM Hammond
T Finsterbusch
T Finsterbusch
Taehoon Chun
TC Chang
V Ambros
V Ambros
W Huang da
W Li
W Sun
X Si
X Zhang
Y Altuvia
Y Lee
Y Lee
Young Sik Lee
YS Lee
Z Paroo
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Recommended from our members

Linking register-transfer and physical levels of design

Author: Chaiyakul V.
Gajski D. D.
Kurdahi F J.
Ramachandran C.
Publication venue: eScholarship, University of California
Publication date: 27/05/1993
Field of study

System and chip synthesis must evaluate candidate Register-Transfer (RT} architectures with respect to finished physical designs. Current RT level cost measures, however, are highly simplified and do not reflect the real physical design. Complete physical design, on the other hand, is quite costly, and infeasible to be iterated many times. In order to establish a more realistic assessment of layout effects, we proposed a new layout model which efficiently accounts for the effects of wiring and floorplanning on the area and performance of RT level designs, before the physical design process. Benchmarking has shown that our model is quite accurate

eScholarship - University of California

Recommended from our members

Linking register-transfer and physical levels of design

Author: Chaiyakul V.
Gajski D. D.
Kurdahi F J.
Ramachandran C.
Publication venue: eScholarship, University of California
Publication date: 27/05/1993
Field of study

eScholarship - University of California

A Compound Information Model for High-Level Synthesis

Author: AC Wu
Cfi
DD Gajski
DD Gajski
DW Knapp
E Rundensteiner
E Rundensteiner
J Peterson
MA Marshall
P Conradi
RY Lau
V Chaiyakul
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1995
Field of study

Crossref

Rapid VLIW Processor Customization for Signal Processing Applications Using Combinational Hardware Functions

Author: A Capitanio
AK Jones
AK Jones
B Hassibi
B Hassibi
B Khailany
BA Levine
C Ebeling
C Ebeling
C Ebeling
C Lee
CN Hinds
D Black
DC Cronquist
DC Cronquist
DC Suresh
DJ Pursley
E Atzori
E Jung
E Mirsky
G De Micheli
G Golub
H Schmit
H Schmit
I Ghosh
J Hilgenstock
JC Alves
JD Owens
JR Hauser
K Bartleson
L Lavagno
L Zhang
P Banerjee
P Banerjee
R Garg
R Goering
R Hoare
S Cadambi
S Chappell
S Dutta
S Gupta
S Gupta
S Hauck
S Hauck
S McCloud
SC Goldstein
SC Goldstein
T Bridges
T Callahan
TJ Callahan
UJ Kapasi
V Chaiyakul
V Chaiyakul
VA Chouliaras
X Tang
Y Chobe
Publication venue: SpringerOpen
Publication date: 01/01/2006
Field of study

<p/> <p>This paper presents an architecture that combines VLIW (very long instruction word) processing with the capability to introduce application-specific customized instructions and highly parallel combinational hardware functions for the acceleration of signal processing applications. To support this architecture, a compilation and design automation flow is described for algorithms written in C. The key contributions of this paper are as follows: (1) a 4-way VLIW processor implemented in an FPGA, (2) large speedups through hardware functions, (3) a hardware/software interface with zero overhead, (4) a design methodology for implementing signal processing applications on this architecture, (5) tractable design automation techniques for extracting and synthesizing hardware functions. Several design tradeoffs for the architecture were examined including the number of VLIW functional units and register file size. The architecture was implemented on an Altera Stratix II FPGA. The Stratix II device was selected because it offers a large number of high-speed DSP (digital signal processing) blocks that execute multiply-accumulate operations. Using the MediaBench benchmark suite, we tested our methodology and architecture to accelerate software. Our combined VLIW processor with hardware functions was compared to that of software executing on a RISC processor, specifically the soft core embedded NIOS II processor. For software kernels converted into hardware functions, we show a hardware performance multiplier of up to <inline-formula><graphic file="1687-6180-2006-046472-i1.gif"/></inline-formula> times that of software with an average <inline-formula><graphic file="1687-6180-2006-046472-i2.gif"/></inline-formula> times faster. For the entire application in which only a portion of the software is converted to hardware, the performance improvement is as much as 30X times faster than the nonaccelerated application, with a 12X improvement on average.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals