Search CORE

68,770 research outputs found

Content addressable memory project

Author: Hall J. Storrs
Levy Saul
Miyake Keith M.
Smith Donald E.
Publication venue
Publication date
Field of study

A parameterized version of the tree processor was designed and tested (by simulation). The leaf processor design is 90 percent complete. We expect to complete and test a combination of tree and leaf cell designs in the next period. Work is proceeding on algorithms for the computer aided manufacturing (CAM), and once the design is complete we will begin simulating algorithms for large problems. The following topics are covered: (1) the practical implementation of content addressable memory; (2) design of a LEAF cell for the Rutgers CAM architecture; (3) a circuit design tool user's manual; and (4) design and analysis of efficient hierarchical interconnection networks

NASA Technical Reports Server

A processing element architecture for high-density focal plane analog programmable array processors

Author: Domínguez Castro Rafael
Espejo Meana Servando Carlos
Liñán Cembrano Gustavo
Rodríguez Vázquez Ángel Benito
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2002
Field of study

The architecture of the elementary Processing Element - PE- used in a recently designed 128×128 Focal Plane Analog Programmable Array Processor is presented. The PE architecture contains the required building blocks to implement bifurcated data flow vision algorithms based on the execution of 3 × 3 convolution masks. The vision chip has been implemented in a standard 0.35μm CMOS technology. The main PE related figures are: 180 cells/mm2, 18 MOPS/cell, and 180 μW/cell.Office of Naval Research (USA) N68171-98-C-9004Euopean Union IST-1999-19007Comisión Interministerial de Ciencia y Tecnología TIC1 999-082

idUS. Depósito de Investigación Universidad de Sevilla

Performance comparison of single-precision SPICE Model-Evaluation on FPGA, GPU, Cell, and multi-core processors

Author: DeHon André
Kapre Nachiket
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2009
Field of study

Automated code generation and performance tuning techniques for concurrent architectures such as GPUs, Cell and FPGAs can provide integer factor speedups over multi-core processor organizations for data-parallel, floating-point computation in SPICE model-evaluation. Our Verilog AMS compiler produces code for parallel evaluation of non-linear circuit models suitable for use in SPICE simulations where the same model is evaluated several times for all the devices in the circuit. Our compiler uses architecture specific parallelization strategies (OpenMP for multi-core, PThreads for Cell, CUDA for GPU, statically scheduled VLIW for FPGA) when producing code for these different architectures. We automatically explore different implementation configurations (e.g. unroll factor, vector length) using our performance-tuner to identify the best possible configuration for each architecture. We demonstrate speedups of 3- 182times for a Xilinx Virtex5 LX 330T, 1.3-33times for an IBM Cell, and 3-131times for an NVIDIA 9600 GT GPU over a 3 GHz Intel Xeon 5160 implementation for a variety of single-precision device models

Crossref

Caltech Authors

DR-NTU (Digital Repository of NTU)

Channelized Time-and Space-integrating Acousto-optical Processor

Author
Publication venue
Publication date
Field of study

A novel channelized, hybrid time- and space-integrating acousto-optic (AO) spectrum analyzer is described. The architecture consists of two acousto-optic cells in a crossed cell configuration. The first acousto-optic cell is a wide bandwidth device that performs space-integrating spectral analysis and channelizes signals according to carrier frequency. The second acousto-optic cell, in conjunction with a modulated source, performs time-integrating spectral analysis of the signal envelope using the chirp algorithm. One possible application of the processor is to determine the carrier frequency and pulse repetition frequency (PRF) of received radar signals.Georgia Tech Research Corporatio

Scholarly Materials And Research @ Georgia Tech

A 16-bit CORDIC rotator for high-performance wireless LAN

Author: Banerjee Swapna
Grass Eckhard
Krstic Milos
Maharatna Koushik
Troya Alfonso
Publication venue
Publication date: 01/01/2004
Field of study

In this paper we propose a novel 16-bit low power CORDIC rotator that is used for high-speed wireless LAN. The algorithm converges to the final target angle by adaptively selecting appropriate iteration steps while keeping the scale factor virtually constant. The VLSI architecture of the proposed design eliminates the entire arithmetic hardware in the angle approximation datapath and reduces the number of iterations by 50% on an average. The cell area of the processor is 0.7 mm2 and it dissipates 7 mW power at 20 MHz frequency

Southampton (e-Prints Soton)

dRail: a novel physical layout methodology for power gated circuits

Author: Al-Hashimi Bashir M.
Biggs John
Flynn David
Mistry Jatin N.
Myers James
Publication venue
Publication date: 08/08/2012
Field of study

In this paper we present a physical layout methodology, called dRail, to allow power gated and non-power gated cells to be placed next to each other. This is unlike traditional voltage area layout which separates cells to prevent shorting of power supplies leading to impact on area, routing and power. To implement dRail, a modified standard cell architecture and physical layout is proposed. The methodology is validated by implementing power gating on the data engine in an ARM Cortex-A5 processor using a 65nm library, and shows up to 38% reduction in area cost when compared to traditional voltage area layou

Southampton (e-Prints Soton)