Search CORE

135 research outputs found

ASAM : Automatic Architecture Synthesis and Application Mapping; dl. 3.2: Instruction set synthesis

Author: Corvino R.
Diken E.
Jordans R.
Jozwiak L.
Publication venue: 'Anadolu Universitesi Bilim ve Teknoloji Dergisi C : Yasam Bilimleri ve Biyoteknoloji'
Publication date: 01/01/2011
Field of study

No abstract

Pure OAI Repository

A comparison of heuristic algorithms for custom instruction selection

Author: Casseau Emmanuel
Liu Wanjun
Wang Shanshan
Xiao Chenglong
Publication venue: Elsevier
Publication date: 01/08/2016
Field of study

International audienc

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

HAL Descartes

HAL-Rennes 1

Design methodologies for instruction-set extensible processors

Author: PAN YU
Publication venue
Publication date: 08/04/2009
Field of study

Ph.DDOCTOR OF PHILOSOPH

ScholarBank@NUS

Algorithms for Improving the Automatically Synthesized Instruction Set of an Extensible Processor

Author: Sovietov Peter
Publication venue
Publication date: 01/01/2024
Field of study

Processors with extensible instruction sets are often used today as programmable hardware accelerators for various domains. When extending RISC-V and other similar extensible processor architectures, the task of designing specialized instructions arises. This task can be solved automatically by using instruction synthesis algorithms. In this paper, we consider algorithms that can be used in addition to the known approaches and improve the synthesized instruction sets by recomputing common operations (the result of which is consumed by multiple operations) of a program inside clustered synthesized instructions (common operations clustering algorithm), and by identifying redundant (which have equivalents among the other instructions) synthesized instructions (subsuming functions algorithm). Experimental evaluations of the developed algorithms are presented for the tests from the domains of cryptography and three-dimensional graphics. For Magma cipher test, the common operations clustering algorithm allows reducing the size of the compiled code by 9%, and the subsuming functions algorithm allows reducing the synthesized instruction set extension size by 2 times. For AES cipher test, the common operations clustering algorithm allows reducing the size of the compiled code by 10%, and the subsuming functions algorithm allows reducing the synthesized instruction set extension size by 2.5 times. Finally, for the instruction set extension from Volume Ray-Casting test, the additional use of subsuming functions algorithm allows reducing problem-specific instruction extension set size from 5 to only 2 instructions without losing its functionality

arXiv.org e-Print Archive

FPGA-aware techniques for rapid generation of profitable custom instructions

Author: Clarke C.T.
Lam S.-K.
Prakash A.
Srikanthan T.
Publication venue: 'Elsevier BV'
Publication date: 01/05/2013
Field of study

OPUS

Crossref

CHIPS: Custom Hardware Instruction Processor Synthesis

Author: Can Ozturan
GÜnhan Dundar
Kubilay Atasu
Oskar Mencer
Wayne Luk
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Automated application-specific instruction set generation

Author: XU CE
Publication venue
Publication date: 09/02/2006
Field of study

Master'sMASTER OF ENGINEERIN

ScholarBank@NUS

Exact and Approximate Algorithms for the Extension of Embedded Processor Instruction Sets

Author: Atasu Kubilay
Ienne Paolo
Pozzi Laura
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 08/08/2005
Field of study

Infoscience - École polytechnique fédérale de Lausanne

Rapid evaluation of custom instruction selection approaches with FPGA estimation

Author: Clarke Christopher T.
Lam Siew Kei
Srikanthan Thambipillai
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 10/03/2014
Field of study

The main aim of this article is to demonstrate that a fast and accurate FPGA estimation engine is indispensable in design flows for custom instruction (template) selection. The need for a FPGA estimation engine stems from the difficulty in predicting the FPGA performance measures of selected custom instructions. We will present a FPGA estimation technique that partitions the high-level representation of custom instructions into clusters based on the structural organization of the target FPGA, while taking into account general logic synthesis principles adopted by FPGA tools. In this work, we have evaluated a widely used graph covering algorithm with various heuristics for custom instruction selection. In addition, we present an algorithm called Refined Largest Fit First (RLFF) that relies on a graph covering heuristic to select non-overlapping superset templates, which typically incorporate frequently used basic templates. The initial solution is further refined by considering overlapping templates that were ignored previously to see if their introduction could lead to higher performance. While RLFF provides the most efficient cover compared to the ILP method and other graph covering heuristics, FPGA estimation results reveals that RLFF leads to the worst performance in certain applications. It is therefore a worthy proposition to equip design flows with accurate FPGA estimation in order to rapidly determine the most profitable custom instruction approach for a given application.</jats:p

OPUS

Crossref