Search CORE

735 research outputs found

OPTIMAL AREA AND PERFORMANCE MAPPING OF K-LUT BASED FPGAS

Author: Alexandru E. ŞUŞU
Cornel POPESCU
George CULEA
Ion I. BUCUR
Publication venue
Publication date
Field of study

FPGA circuits are increasingly used in many fields: for rapid prototyping of new products (including fast ASIC implementation), for logic emulation, for producing a small number of a device, or if a device should be reconfigurable in use (reconfigurable computing). Determining if an arbitrary, given wide, function can be implemented by a programmable logic block, unfortunately, it is generally, a very difficult problem. This problem is called the Boolean matching problem. This paper introduces a new implemented algorithm able to map, both for area and performance, combinational networks using k-LUT based FPGAs.k-LUT based FPGAs, combinational circuits, performance-driven mapping.

Research Papers in Economics

Technology Mapping for Circuit Optimization Using Content-Addressable Memory

Author: Lucas Joshua Michael
Publication venue
Publication date: 31/01/2006
Field of study

The growing complexity of Field Programmable Gate Arrays (FPGA's) is leading to architectures with high input cardinality look-up tables (LUT's). This thesis describes a methodology for area-minimizing technology mapping for combinational logic, specifically designed for such FPGA architectures. This methodology, called LURU, leverages the parallel search capabilities of Content-Addressable Memories (CAM's) to outperform traditional mapping algorithms in both execution time and quality of results. The LURU algorithm is fundamentally different from other techniques for technology mapping in that LURU uses textual string representations of circuit topology in order to efficiently store and search for circuit patterns in a CAM. A circuit is mapped to the target LUT technology using both exact and inexact string matching techniques. Common subcircuit expressions (CSE's) are also identified and used for architectural optimization---a small set of CSE's is shown to effectively cover an average of 96% of the test circuits. LURU was tested with the ISCAS'85 suite of combinational benchmark circuits and compared with the mapping algorithms FlowMap and CutMap. The area reduction shown by LURU is, on average, 20% better compared to FlowMap and CutMap. The asymptotic runtime complexity of LURU is shown to be better than that of both FlowMap and CutMap

D-Scholarship@Pitt

FPGA technology mapping optimizaion by rewiring algorithms.

Author
Publication venue
Publication date: 01/01/2005
Field of study

Tang Wai Chung.Thesis (M.Phil.)--Chinese University of Hong Kong, 2005.Includes bibliographical references (leaves 40-41).Abstracts in English and Chinese.Abstract --- p.iAcknowledgement --- p.iiiChapter 1 --- Introduction --- p.1Chapter 2 --- Rewiring Algorithms --- p.3Chapter 2.1 --- REWIRE --- p.5Chapter 2.2 --- RAMFIRE --- p.7Chapter 2.3 --- GBAW --- p.8Chapter 3 --- FPGA Technology Mapping --- p.11Chapter 3.1 --- Problem Definition --- p.13Chapter 3.2 --- Network-flow-based Algorithms for FPGA Technology Mapping --- p.16Chapter 3.2.1 --- FlowMap --- p.16Chapter 3.2.2 --- FlowSYN --- p.21Chapter 3.2.3 --- CutMap --- p.22Chapter 4 --- LUT Minimization by Rewiring --- p.24Chapter 4.1 --- Greedy Decision Heuristic for LUT Minimization --- p.27Chapter 4.2 --- Experimental Result --- p.28Chapter 5 --- Conclusion --- p.38Bibliography --- p.4

CUHK Digital Repository

Optimal simultaneous mapping and clustering for FPGA delay optimization

Author: J. Cong
J.Y. Lin
null Deming Chen
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2008
Field of study

Crossref

Support-reducing decomposition for FPGA mapping

Author: Cortadella Jordi
Machado Lucas
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

Decomposition is a technology-independent process, in which a large complex function is broken into smaller, less complex functions. The costs of two-level or factored-form representations (cubes and literals) are used in most decomposition methods, as they have a high correlation with the area of cell-based designs. However, this correlation is weaker for field-programmable gate arrays (FPGAs) based on look-up tables. Furthermore, local optimizations have limited power due to the structural bias of the circuit descriptions. This paper tries to reduce the structural biasing by remapping the LUT network and decomposing the derived functions using the support as cost function. The proposed method improves the FPGA mapping results of a commercial tool for the 20 largest MCNC benchmarks, with gains of 28% in delay plus 18% in area when targeting delay, and a reduction of 28% in area plus 14% in delay with area as cost function. Results with 23% less area and 6% less delay are obtained after physical synthesis (post place-and-route). Moreover, 12 of the best known results for delay (and 3 for area) of the EPFL benchmarks are improved.Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Algorithms and Models for Optimal Power Management on Smartphones

Author: Dobson Richard Mark
Publication venue
Publication date: 01/01/2014
Field of study

King's Research Portal

Real-Time Dense Stereo Matching With ELAS on FPGA Accelerated Embedded Devices

Author: Frost Duncan
Miksik Ondrej
Rahnama Oscar
Torr Philip H. S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

For many applications in low-power real-time robotics, stereo cameras are the sensors of choice for depth perception as they are typically cheaper and more versatile than their active counterparts. Their biggest drawback, however, is that they do not directly sense depth maps; instead, these must be estimated through data-intensive processes. Therefore, appropriate algorithm selection plays an important role in achieving the desired performance characteristics. Motivated by applications in space and mobile robotics, we implement and evaluate a FPGA-accelerated adaptation of the ELAS algorithm. Despite offering one of the best trade-offs between efficiency and accuracy, ELAS has only been shown to run at 1.5-3 fps on a high-end CPU. Our system preserves all intriguing properties of the original algorithm, such as the slanted plane priors, but can achieve a frame rate of 47fps whilst consuming under 4W of power. Unlike previous FPGA based designs, we take advantage of both components on the CPU/FPGA System-on-Chip to showcase the strategy necessary to accelerate more complex and computationally diverse algorithms for such low power, real-time systems.Comment: 8 pages, 7 figures, 2 table

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive

Optimizing Scrubbing by Netlist Analysis for FPGA Configuration Bit Classification and Floorplanning

Author: Schmidt Bernhard
Teich Jürgen
Ziener Daniel
Zöllner Christian
Publication venue: 'Elsevier BV'
Publication date: 25/07/2017
Field of study

Existing scrubbing techniques for SEU mitigation on FPGAs do not guarantee an error-free operation after SEU recovering if the affected configuration bits do belong to feedback loops of the implemented circuits. In this paper, we a) provide a netlist-based circuit analysis technique to distinguish so-called critical configuration bits from essential bits in order to identify configuration bits which will need also state-restoring actions after a recovered SEU and which not. Furthermore, b) an alternative classification approach using fault injection is developed in order to compare both classification techniques. Moreover, c) we will propose a floorplanning approach for reducing the effective number of scrubbed frames and d), experimental results will give evidence that our optimization methodology not only allows to detect errors earlier but also to minimize the Mean-Time-To-Repair (MTTR) of a circuit considerably. In particular, we show that by using our approach, the MTTR for datapath-intensive circuits can be reduced by up to 48.5% in comparison to standard approaches

arXiv.org e-Print Archive