4,128 research outputs found
Technology Mapping for Circuit Optimization Using Content-Addressable Memory
The growing complexity of Field Programmable Gate Arrays (FPGA's) is leading to architectures with high input cardinality look-up tables (LUT's). This thesis describes a methodology for area-minimizing technology mapping for combinational logic, specifically designed for such FPGA architectures. This methodology, called LURU, leverages the parallel search capabilities of Content-Addressable Memories (CAM's) to outperform traditional mapping algorithms in both execution time and quality of results. The LURU algorithm is fundamentally different from other techniques for technology mapping in that LURU uses textual string representations of circuit topology in order to efficiently store and search for circuit patterns in a CAM. A circuit is mapped to the target LUT technology using both exact and inexact string matching techniques. Common subcircuit expressions (CSE's) are also identified and used for architectural optimization---a small set of CSE's is shown to effectively cover an average of 96% of the test circuits. LURU was tested with the ISCAS'85 suite of combinational benchmark circuits and compared with the mapping algorithms FlowMap and CutMap. The area reduction shown by LURU is, on average, 20% better compared to FlowMap and CutMap. The asymptotic runtime complexity of LURU is shown to be better than that of both FlowMap and CutMap
An Energy-efficient Capacitive-Memristive Content Addressable Memory
Content addressable memory is popular in the field of intelligent computing
systems with its searching nature. Emerging CAMs show a promising increase in
pixel density and a decrease in power consumption than pure CMOS solutions.
This article introduced an energy-efficient 3T1R1C TCAM cooperating with
capacitor dividers and RRAM devices. The RRAM as a storage element also acts as
a switch to the capacitor divider while searching for content. CAM cells
benefit from working parallel in an array structure. We implemented a 64 x 64
array and digital controllers to perform with an internal built-in clock
frequency of 875MHz. Both data searches and reads take 3x clock cycles. Its
worst average energy for data match is reported to be 1.71 fJ/bit-search and
the worst average energy for data miss is found with 4.69 fJ/bit-search. The
prototype is simulated and fabricated in 0.18 um technology with in-lab RRAM
post-processing. Such memory explores the charge domain searching mechanism and
can be applied to data centers that are power-hungry.Comment: This work has been submitted to the IEEE TCAS-I for possible
publication. Copyright may be transferred without notice, after which this
version may no longer be accessibl
- …