2,591 research outputs found

    Pipeline-Based Power Reduction in FPGA Applications

    Get PDF
    This paper shows how temporal parallelism has an important role in the power dissipation reduction in the FPGA field. Glitches propagation is blocked by the flip-flops or registers in the pipeline. Several multiplication structures are implemented over modern FPGAs, StratixII and Virtex4, comparing their results with and without pipeline and hardware duplication

    Low Power Processor Architectures and Contemporary Techniques for Power Optimization – A Review

    Get PDF
    The technological evolution has increased the number of transistors for a given die area significantly and increased the switching speed from few MHz to GHz range. Such inversely proportional decline in size and boost in performance consequently demands shrinking of supply voltage and effective power dissipation in chips with millions of transistors. This has triggered substantial amount of research in power reduction techniques into almost every aspect of the chip and particularly the processor cores contained in the chip. This paper presents an overview of techniques for achieving the power efficiency mainly at the processor core level but also visits related domains such as buses and memories. There are various processor parameters and features such as supply voltage, clock frequency, cache and pipelining which can be optimized to reduce the power consumption of the processor. This paper discusses various ways in which these parameters can be optimized. Also, emerging power efficient processor architectures are overviewed and research activities are discussed which should help reader identify how these factors in a processor contribute to power consumption. Some of these concepts have been already established whereas others are still active research areas. © 2009 ACADEMY PUBLISHER

    A Technology Aware Magnetic QCA NCL-HDL Architecture

    Get PDF
    Magnetic Quantum Dot Cellular Automata (MQCA) have been recently proposed as an attractive implementation of QCA as a possible CMOS technology substitute. Marking a difference with respect to previous contributions, in this work we show that it is possible to develop and describe complex MQCA computational blocks strongly linking technology and having in mind a feasible realization. Thus, we propose a practicable clock structure for MQCA baptised "snake-clock", we stick to this while developing a system level Hardware Description Language (HDL) based description of an architectural block, and we suggest a delay insensitive Null Convention Logic (NCL) implementation for the magnetic case so that the "layout=timing" problem can be solved. Furthermore we include in our model aspects critically related to technology and real production, that is timing, power and layout, and we present the preliminary steps of our experiments, the results of which will be included in the architecture descriptio

    PieceTimer: A Holistic Timing Analysis Framework Considering Setup/Hold Time Interdependency Using A Piecewise Model

    Full text link
    In static timing analysis, clock-to-q delays of flip-flops are considered as constants. Setup times and hold times are characterized separately and also used as constants. The characterized delays, setup times and hold times, are ap- plied in timing analysis independently to verify the perfor- mance of circuits. In reality, however, clock-to-q delays of flip-flops depend on both setup and hold times. Instead of being constants, these delays change with respect to different setup/hold time combinations. Consequently, the simple ab- straction of setup/hold times and constant clock-to-q delays introduces inaccuracy in timing analysis. In this paper, we propose a holistic method to consider the relation between clock-to-q delays and setup/hold time combinations with a piecewise linear model. The result is more accurate than that of traditional timing analysis, and the incorporation of the interdependency between clock-to-q delays, setup times and hold times may also improve circuit performance.Comment: IEEE/ACM International Conference on Computer-Aided Design (ICCAD), November 201

    Static noise margin analysis for CMOS logic cells in near-threshold

    Get PDF
    The advancement of semiconductor technology enabled the fabrication of devices with faster switching activity and chips with higher integration density. However, these advances are facing new impediments related to energy and power dissipation. Besides, the increasing demand for portable devices leads the circuit design paradigm to prioritize energy efficiency instead of performance. Altogether, this scenario motivates engineers towards reducing the supply voltage to the near and subthreshold regime to increase the lifespan of battery-powered devices. Even though operating in these regime offer interesting energy-frequency trade-offs, it brings challenges concerning noise tolerance. As the supply voltage reduces, the available noise margins decrease, and circuits become more prone to functional failures. In addition, near and subthreshold circuits are more susceptible to manufacturing variability, hence further aggravating noise issues. Other issues, such as wire minimization and gate fan-out, also contribute to the relevance of evaluating the noise margin of circuits early in the design Accordingly, this work investigates how to improve the static noise margin of digital synchronous circuits that will operate at the near/subthreshold regime. This investigation produces a set of three original contributions. The first is an automated tool to estimate the static noise margin of CMOS combinational cells. The second contribution is a realistic static noise margin estimation methodology that considers process-voltage-temperature variations. Results show that the proposed methodology allows to reduce up to 70% of the static noise margin pessimism. Finally, the third contribution is the noise-aware cell design methodology and the inclusion of a noise evaluation of complex circuits during the logic synthesis. The resulting library achieved higher static noise margin (up to 24%) and less spread among different cells (up to 62%).Os avanços na tecnologia de semicondutores possibilitou que se fabricasse dispositivos com atividade de chaveamento mais rĂĄpida e com maior capacidade de integração de transistores. Estes avanços, todavia, impuseram novos empecilhos relacionados com a dissipação de potĂȘncia e energia. AlĂ©m disso, a crescente demanda por dispositivos portĂĄteis levaram Ă  uma mudança no paradigma de projeto de circuitos para que se priorize energia ao invĂ©s de desempenho. Este cenĂĄrio motivou Ă  reduzir a tensĂŁo de alimentação com qual os dispositivos operam para um regime prĂłximo ou abaixo da tensĂŁo de limiar, com o objetivo de aumentar sua duração de bateria. Apesar desta abordagem balancear caracterĂ­sticas de performance e energia, ela traz novos desafios com relação a tolerĂąncia Ă  ruĂ­do. Ao reduzirmos a tensĂŁo de alimentação, tambĂ©m reduz-se a margem de ruĂ­do disponĂ­vel e, assim, os circuitos tornam-se mais suscetĂ­veis Ă  falhas funcionais. Somado Ă  este efeito, circuitos com tensĂ”es de alimentação nestes regimes sĂŁo mais sensĂ­veis Ă  variaçÔes do processo de fabricação, logo agravando problemas com ruĂ­do. Existem tambĂ©m outros aspectos, tais como a miniaturização das interconexĂ”es e a relação de fan-out de uma cĂ©lula digital, que incentivam a avaliação de ruĂ­do nas fases iniciais do projeto de circuitos integrados Por estes motivos, este trabalho investiga como aprimorar a margem de ruĂ­do estĂĄtica de circuitos sĂ­ncronos digitais que irĂŁo operar em tensĂ”es no regime de tensĂŁo prĂłximo ou abaixo do limiar. Esta investigação produz um conjunto de trĂȘs contribuiçÔes originais. A primeira Ă© uma ferramenta capaz de avaliar automaticamente a margem de ruĂ­do estĂĄtica de cĂ©lulas CMOS combinacionais. A segunda contribuição Ă© uma metodologia realista para estimar a margem de ruĂ­do estĂĄtica considerando variaçÔes de processo, tensĂŁo e temperatura. Os resultados obtidos mostram que a metodologia proposta permitiu reduzir atĂ© 70% do pessimismo das margens de ruĂ­do estĂĄtica, Por Ășltimo, a terceira contribuição Ă© um fluxo de projeto de cĂ©lulas combinacionais digitais considerando ruĂ­do, e uma abordagem para avaliar a margem de ruĂ­do estĂĄtica de circuitos complexos durante a etapa de sĂ­ntese lĂłgica. A biblioteca de cĂ©lulas resultante deste fluxo obteve maior margem de ruĂ­do (atĂ© 24%) e menor variação entre diferentes cĂ©lulas (atĂ© 62%)

    Quantum Dot Cellular Automata Check Node Implementation for LDPC Decoders

    Get PDF
    The quantum dot Cellular Automata (QCA) is an emerging nanotechnology that has gained significant research interest in recent years. Extremely small feature sizes, ultralow power consumption, and high clock frequency make QCA a potentially attractive solution for implementing computing architectures at the nanoscale. To be considered as a suitable CMOS substitute, the QCA technology must be able to implement complex real-time applications with affordable complexity. Low density parity check (LDPC) decoding is one of such applications. The core of LDPC decoding lies in the check node (CN) processing element which executes actual decoding algorithm and contributes toward overall performance and complexity of the LDPC decoder. This study presents a novel QCA architecture for partial parallel, layered LDPC check node. The CN executes Normalized Min Sum decoding algorithm and is flexible to support CN degree dc up to 20. The CN is constructed using a VHDL behavioral model of QCA elementary circuits which provides a hierarchical bottom up approach to evaluate the logical behavior, area, and power dissipation of the whole design. Performance evaluations are reported for the two main implementations of QCA i.e. molecular and magneti
    • 

    corecore