1,147 research outputs found
On the Implementation of Efficient Channel Filters for Wideband Receivers by Optimizing Common Subexpression Elimination Methods
No abstract availabl
Energy Aware Design and Analysis for Synchronous and Asynchronous Circuits
Power dissipation has become a major concern for IC designers. Various low power design techniques have been developed for synchronous circuits. Asynchronous circuits, however. have gained more interests recently due to their benefits in lower noise, easy timing control, etc. But few publications on energy reduction techniques for asynchronous logic are available.
Power awareness indicates the ability of the system power to scale with changing conditions and quality requirements. Scalability is an important figure-of-merit since it allows the end user to implement operational policy. just like the user of mobile multimedia equipment needs to select between better quality and longer battery operation time.
This dissertation discusses power/energy optimization and performs analysis on both synchronous and asynchronous logic. The major contributions of this dissertation include:
1 ) A 2-Dimensional Pipeline Gating technique for synchronous pipelined circuits to improve their power awareness has been proposed. This technique gates the corresponding clock lines connected to registers in both vertical direction (the data flow direction) and horizontal direction (registers within each pipeline stage) based on current input precision.
2) Two energy reduction techniques, Signal Bypassing & Insertion and Zero Insertion. have been developed for NCL circuits. Both techniques use Nulls to replace redundant Data 0\u27s based on current input precision in order to reduce the switching activity while Signal Bypassing & Insertion is for non-pipelined NCI, circuits and Zero Insertion is for pipelined counterparts. A dynamic active-bit detection scheme is also developed as an expansion.
3) Two energy estimation techniques, Equivalent Inverter Modeling based on Input Mapping in transistor-level and Switching Activity Modeling in gate-level, have been proposed. The former one is for CMOS gates with feedbacks and the latter one is for NCL circuits
Recommended from our members
Efficient FPGA implementation and power modelling of image and signal processing IP cores
This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.Field Programmable Gate Arrays (FPGAs) are the technology of choice in a number ofimage
and signal processing application areas such as consumer electronics, instrumentation,
medical data processing and avionics due to their reasonable energy consumption, high performance, security, low design-turnaround time and reconfigurability. Low power FPGA
devices are also emerging as competitive solutions for mobile and thermally constrained platforms. Most computationally intensive image and signal processing algorithms also consume a lot of power leading to a number of issues including reduced mobility, reliability concerns and increased design cost among others. Power dissipation has become one of the most important challenges, particularly for FPGAs. Addressing this problem requires optimisation and awareness at all levels in the design flow. The key achievements of the
work presented in this thesis are summarised here. Behavioural level optimisation strategies have been used for implementing matrix product and inner product through the use of mathematical techniques such as Distributed Arithmetic (DA) and its variations including offset binary coding, sparse factorisation and novel vector level transformations. Applications to test the impact of these algorithmic and arithmetic transformations include the fast Hadamard/Walsh transforms and Gaussian mixture models. Complete design space exploration has been performed on these cores, and where appropriate, they have been shown to clearly outperform comparable existing implementations. At the architectural level, strategies such as parallelism, pipelining and systolisation have been successfully applied for the design and optimisation of a number of
cores including colour space conversion, finite Radon transform, finite ridgelet transform and circular convolution. A pioneering study into the influence of supply voltage scaling for FPGA based designs, used in conjunction with performance enhancing strategies such as parallelism and pipelining has been performed. Initial results are very promising and indicated significant potential for future research in this area.
A key contribution of this work includes the development of a novel high level power macromodelling technique for design space exploration and characterisation of custom IP cores for FPGAs, called Functional Level Power Analysis and Modelling (FLPAM). FLPAM
is scalable, platform independent and compares favourably with existing approaches. A hybrid, top-down design flow paradigm integrating FLPAM with commercially available design tools for systematic optimisation of IP cores has also been developed
A low-power quadrature digital modulator in 0.18um CMOS
Quadrature digital modulation techniques are widely used in modern communication systems because of their high performance and flexibility. However, these advantages come at the cost of high power consumption. As a result, power consumption has to be taken into account as a main design factor of the modulator.In this thesis, a low-power quadrature digital modulator in 0.18um CMOS is presented with the target system clock speed of 150 MHz. The quadrature digital modulator consists of several key blocks: quadrature direct digital synthesizer (QDDS), pulse shaping filter, interpolation filter and inverse sinc filter. The design strategy is to investigate different implementations for each block and compare the
power consumption of these implementations. Based on the comparison results, the implementation that consumes the lowest power will be chosen for each block. First of all, a novel low-power QDDS is proposed in the thesis. Power consumption
estimation shows that it can save up to 60% of the power consumption at 150 MHz system clock frequency compared with one conventional design. Power consumption estimation results also show that using two pulse shaping blocks to process
I/Q data, cascaded integrator comb (CIC) interpolation structure, and inverse sinc
filter with modified canonic signed digit (MCSD) multiplication consume less power than alternative design choices. These low-power blocks are integrated together to achieve a low-power modulator. The power consumption estimation after layout shows that it only consumes about 95 mW at 150 MHz system clock rate, which is much lower than similar commercial products. The designed modulator can provide a low-power solution for various quadrature modulators. It also has an output bandwidth from 0 to 75 MHz, configurable pulse shaping filters and interpolation filters, and an internal sin(x)/x correction filter
FPGA-Based Degradation and Reliability Monitor for Underground Cables
The online Remaining Useful Life (RUL) estimation of underground cables and their reliability analysis requires obtaining the cable failure time probability distribution. Monte Carlo (MC) simulations of complex thermal heating and electro-thermal degradation models can be employed for this analysis, but uncertainties need to be considered in the simulations, to produce accurate RUL expectation values and confidence margins for the results. The process requires performing large simulation sets, based on past temperature or load measurements and future load predictions. Field Programmable Gate Arrays (FPGAs) permit accelerating simulations for live analysis, but the thermal models involved are complex to be directly implemented in hardware logic. A new standalone FPGA architecture has been proposed for the fast and on-site degradation and reliability analysis of underground cables, based on MC simulation, and the effect of load uncertainties on the predicted cable End Of Life (EOL) has been analyzed from the results
Low Power Adaptive Equaliser Architectures for Wireless LMMSE Receivers
Power consumption requires critical consideration during system design for portable wireless
communication devices as it has a direct influence on the battery weight and volume required
for operation. Wideband Code Division Multiple Access (W-CDMA) techniques are favoured
for use in future generation mobile communication systems. This thesis investigates novel low
power techniques for use in system blocks within a W-CDMA adaptive linear minimum mean
squared error (LMMSE) receiver architecture. Two low power techniques are presented for
reducing power dissipation in the LMS adaptive filter, this being the main power consuming
block within this receiver. These low power techniques are namely the decorrelating transform,
this is a differential coefficient technique, and the variable length update algorithm which is a
dynamic tap-length optimisation technique.
The decorrelating transform is based on the principle of reducing the wordlength of filter
coefficients by using the computed difference between adjacent coefficients in calculation of
the filter output. The effect of reducing the wordlength of filter coefficients being presented to
multipliers in the filter is a reduction in switching activity within the multiplier thus reducing
power consumed. In the case of the LMS adaptive filter, with coefficients being continuously
updated, the decorrelating transform is applied to these calculated coefficients with minimal
hardware or computational overhead. The correlation between filter coefficients is exploited to
achieve a wordlength reduction from 16 bits down to 10 bits in the FIR filter block.
The variable length update algorithm is based on the principle of optimising the number of
operational filter taps in the LMS adaptive filter according to operating conditions. The number
of taps in operation can be increased or decreased dynamically according to the mean squared
error at the output of the filter. This algorithm is used to exploit the fact that when the SNR in
the channel is low the minimum mean squared error of the short equaliser is almost the same
as that of the longer equaliser. Therefore, minimising the length of the equaliser will not result
in poorer MSE performance and there is no disadvantage in having fewer taps in operation. If
fewer taps are in operation then switching will not only be reduced in the arithmetic blocks but
also in the memory blocks required by the LMS algorithm and FIR filter process. This reduces
the power consumed by both these computation intensive functional blocks. Power results are
obtained for equaliser lengths from 73 to 16 taps and for operation with varying input SNR.
This thesis then proposes that the variable length LMS adaptive filter is applied in the adaptive
LMMSE receiver to create a low power implementation. Power consumption in the receiver
is reduced by the dynamic optimisation of the LMS receiver coefficient calculation. A
considerable power saving is seen to be achieved when moving from a fixed length LMS
implementation to the variable length design. All design architectures are coded in Verilog
hardware description language at register transfer level (RTL). Once functional specification
of the design is verified, synthesis is carried out using either Synopsys DesignCompiler or
Cadence BuildGates to create a gate level netlist. Power consumption results are determined at
the gate level and estimated using the Synopsys DesignPower tool
The use of a reconfigurable functional cache in a digital signal processor: power and performance
Due to the computationally intensive nature of the tasks that digital signal processors (DSP) are required to perform it is desirable to decrease the time required to execute these tasks. Minimizing the execution time required for the various algorithms that are commonly and frequently executed (ex: FIR filters) will improve the overall performance. It is known that hardware is able to execute algorithms faster than software, however, due to the size limitations of embedded DSP, not all of the necessary algorithms can be implemented in hardware. A reconfigurable cache architecture in combination with a DSP is proposed as an alternative to increase algorithm performance by using reconfigurable hardware rather than dedicated hardware. Another important issue to consider for embedded processors is the power consumption of the DSP. Due to the fact that most embedded processors operate by battery power, energy efficiency is a necessity. This study looks at the power requirements of a DSP with reconfigurable cache to determine the viability of such an architecture in an embedded system. Others have shown that reconfigurable cache in conjunction with a general purpose processor improves performance for some DSP benchmarks. This study shows that a DSP/reconfigurable cache combination can achieve kernel performance gains ranging from 10-350 times that of a DSP architecture operating alone and can achieve overall benchmark speedups ranging from 1.02 to 1.91 times that of the existing DSP architecture. Further, relative power consumption results show that the power consumption of the reconfigurable architecture is approximately 85 to 95% of the current architecture (5-15% power savings) and attains energy savings ranging from approximately 14 to 50%
- …