8,339 research outputs found
Advanced Timing and Synchronization Methodologies for Digital VLSI Integrated Circuits
This dissertation addresses timing and synchronization methodologies that are critical to the design, analysis and optimization of high-performance, integrated digital VLSI systems. As process sizes shrink and design complexities increase, achieving timing closure for digital VLSI circuits becomes a significant bottleneck in the integrated circuit design flow. Circuit designers are motivated to investigate and employ alternative methods to satisfy the timing and physical design performance targets. Such novel methods for the timing and synchronization of complex circuitry are developed in this dissertation and analyzed for performance and applicability.Mainstream integrated circuit design flow is normally tuned for zero clock skew, edge-triggered circuit design. Non-zero clock skew or multi-phase clock synchronization is seldom used because the lack of design automation tools increases the length and cost of the design cycle. For similar reasons, level-sensitive registers have not become an industry standard despite their superior size, speed and power consumption characteristics compared to conventional edge-triggered flip-flops.In this dissertation, novel design and analysis techniques that fully automate the design and analysis of non-zero clock skew circuits are presented. Clock skew scheduling of both edge-triggered and level-sensitive circuits are investigated in order to exploit maximum circuit performances. The effects of multi-phase clocking on non-zero clock skew, level-sensitive circuits are investigated leading to advanced synchronization methodologies. Improvements in the scalability of the computational timing analysis process with clock skew scheduling are explored through partitioning and parallelization.The integration of the proposed design and analysis methods to the physical design flow of integrated circuits synchronized with a next-generation clocking technology-resonant rotary clocking technology-is also presented. Based on the design and analysis methods presented in this dissertation, a computer-aided design tool for the design of rotary clock synchronized integrated circuits is developed
메쉬 기반의 클락 네트워크 설계 방법론
학위논문 (박사)-- 서울대학교 대학원 : 전기·컴퓨터공학부, 2015. 2. 김태환.The clock distribution network in a synchronous digital circuit delivers a clock signal to every storage element i.e., clock sink in the circuit. However, since the continued technology scaling increases PVT (process-voltage-temperature) variation, the increase of clock skew variation is highly likely to cause performance degradation or system failure at run time. Recently, to mitigate the clock skew variation, many researchers have taken a profound interest in the clock mesh network. However, though the structure of clock mesh network is excellent in tolerating timing variation, it demands significantly high power consumption due to the use of excessive mesh wire and buffer resources. Thus, optimizing the resources required in the mesh clock synthesis while maintaining the variation tolerance is crucially important. The three major tasks that greatly affect the cost of resulting clock mesh are (1) mesh segment allocation, (2) mesh buffer allocation and sizing, and (3) clock sink binding to mesh segments. Previous clock mesh optimization approaches solve the three tasks sequentially, one by one at a time, to manage the run time complexity of the tasks at the expense of losing the quality of results. However, since the three tasks are tightly inter-related, simultaneously optimizing all three tasks is essential, if the run time is ever permitted, to synthesize an economical clock mesh network. In this dissertation, we propose an approach which is able to tackle the problem in an integrated fashion by combining the three tasks into an iterative framework of incremental updates and solving them simultaneously to find a globally optimal allocation of mesh resources while taking into account the clock skew tolerance constraints. The core parts of this dissertation are a precise analysis on the relation among the resource optimization tasks and an establishment of mechanism for effective and efficient integration of the tasks. In particular, to handle the run time problem, we propose a set of speed-up techniques i.e., modeling RC circuit for eliminating redundant matrix multiplications, exploiting sliding window scheme, and fast buffer sizing effect estimation, which are fitted into our context of fast clock skew estimation in mesh resource optimization as well as an invention of early decision policies. In summary, this dissertation presents the efficient design methodology for clock mesh synthesis with consideration on integration of three tasks and reduction of runtime complexity.Abstract i
Contents iii
List of Figures vi
List of Tables x
1 Introduction 1
1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Contributions of This Dissertation . . . . . . . . . . . . . . . . . . . 3
2 Background 5
2.1 Clock Distribution Network . . . . . . . . . . . . . . . . . . . . . . . 5
2.2 Clock Network Topologies . . . . . . . . . . . . . . . . . . . . . . . 6
2.3 Design Metrics of Clock Network . . . . . . . . . . . . . . . . . . . 7
2.4 The Effects of Variations on Clock Skew . . . . . . . . . . . . . . . . 9
3 Clock Mesh Synthesis Flow 12
3.1 Elements of Clock Mesh . . . . . . . . . . . . . . . . . . . . . . . . 12
3.2 Conventional Clock Mesh Synthesis Overview . . . . . . . . . . . . . 13
3.3 Initial Grid Generation . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.4 Mesh Buffer Placement and Sizing . . . . . . . . . . . . . . . . . . . 14
3.5 Clock Mesh Optimization . . . . . . . . . . . . . . . . . . . . . . . . 17
4 Integrated Resource Allocation and Binding in Clock Mesh Synthesis 19
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
4.2 Observations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
4.3 Framework of Clock Mesh Optimization . . . . . . . . . . . . . . . . 26
4.3.1 Incremental Resource Updates . . . . . . . . . . . . . . . . . 29
4.3.2 Constraints for Variation Tolerance . . . . . . . . . . . . . . 34
4.3.3 Early Decision Policies . . . . . . . . . . . . . . . . . . . . . 38
4.3.4 Time Complexity Analysis . . . . . . . . . . . . . . . . . . . 39
4.4 Fast Clock Skew Estimation Techniques . . . . . . . . . . . . . . . . 40
4.4.1 Partially Reusing Matrix Multiplication for Incremental Updates 41
4.4.2 Adopting Sliding Window Scheme . . . . . . . . . . . . . . . 43
4.4.3 Adjusting Delay Caused by Buffer Resizing . . . . . . . . . . 44
4.5 Experimental Results . . . . . . . . . . . . . . . . . . . . . . . . . . 46
4.5.1 Experimental Environments . . . . . . . . . . . . . . . . . . 46
4.5.2 Resource Requirement and Variation Tolerance Comparison . 48
4.5.3 Comparison with Clock Mesh Optimization using Worst Case Timing Analysis of Commercial Tool . . . . . . . . . . . . . 56
4.5.4 Analysis of the Effect of Proposed Techniques . . . . . . . . 58
4.5.5 Run Time Analysis . . . . . . . . . . . . . . . . . . . . . . . 61
4.5.6 Accuracy and Run Time of Fast Clock Skew Estimation . . . 63
4.5.7 Electromigration Analysis . . . . . . . . . . . . . . . . . . . 68
4.5.8 Run-time Analysis in Multi-thread Computing Environment . 70
4.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
5 Conclusion 74
Abstract in Korean 84Docto
Clock Distribution Network Optimization by Sequential Quadratic Programing
Clock mesh is widely used in microprocessor designs for achieving low clock
skew and high process variation tolerance. Clock mesh optimization is a very diffcult
problem to solve because it has a highly connected structure and requires accurate
delay models which are computationally expensive.
Existing methods on clock network optimization are either restricted to clock
trees, which are easy to be separated into smaller problems, or naive heuristics based
on crude delay models.
A clock mesh sizing algorithm, which is aimed to minimize total mesh wire area
with consideration of clock skew constraints, has been proposed in this research work.
This algorithm is a systematic solution search through rigorous Sequential Quadratic
Programming (SQP). The SQP is guided by an efficient adjoint sensitivity analysis
which has near-SPICE(Simulation Program for Integrated Circuits Emphasis)-level
accuracy and faster-than-SPICE speed.
Experimental results on various benchmark circuits indicate that this algorithm
leads to substantial wire area reduction while maintaining low clock skew in the clock
mesh. The reduction in mesh area achieved is about 33%
Variation and power issues in VLSI clock networks
Clock Distribution Network (CDN) is an important component of any synchronous logic circuit. The function of CDN is to deliver the clock signal to the clock
sinks. Clock skew is defined as the difference in the arrival time of the clock signal at
the clock sinks. Higher uncertainty in skew (due to PVT variations) degrades circuit
performance by decreasing the maximum possible delay between any two sequential
elements. Aggressive frequency scaling has also led to high power consumption especially in CDN. This dissertation addresses variation and power issues in the design of
current and potential future CDN. The research detailed in this work presents algorithmic techniques for the following problems: (1) Variation tolerance in useful skew
design, (2) Link insertion for buffered clock nets, (3) Methodology and algorithms for
rotary clocking and (4) Clock mesh optimization for skew-power trade off.
For clock trees this dissertation presents techniques to integrate the different
aspects of clock tree synthesis (skew scheduling, abstract topology and layout embedding) into one framework- tolerance to variations. This research addresses the issues
involved in inserting cross-links in a buffered clock tree and proposes design criteria
to avoid the risk of short-circuit current. Rotary clocking is a promising new clocking
scheme that consists of unterminated rings formed by differential transmission lines.
Rotary clocking achieves reduction in power dissipation clock skew. This dissertation
addresses the issues in adopting current CAD methodology to rotary clocks. Alternative methodology and corresponding algorithmic techniques are detailed. Clock
mesh is a popular form of CDN used in high performance systems. The problem
of simultaneous sizing and placement of mesh buffers in a clock mesh is addressed.
The algorithms presented remove the edges from the clock mesh to trade off skew
tolerance for low power.
For clock trees as well as link insertion, our experiments indicate significant reduction in clock skew due to variations. For clock mesh, experimental results indicate
18.5% reduction in power with 1.3% delay penalty on a average. In summary, this dissertation details methodologies/algorithms that address two critical issues- variation
and power dissipation in current and potential future CDN
Radiation safety based on the sky shine effect in reactor
In the reactor operation, neutrons and gamma rays are the most dominant radiation.
As protection, lead and concrete shields are built around the reactor. However, the radiation
can penetrate the water shielding inside the reactor pool. This incident leads to the occurrence
of sky shine where a physical phenomenon of nuclear radiation sources was transmitted
panoramic that extends to the environment. The effect of this phenomenon is caused by the
fallout radiation into the surrounding area which causes the radiation dose to increase. High
doses of exposure cause a person to have stochastic effects or deterministic effects. Therefore,
this study was conducted to measure the radiation dose from sky shine effect that scattered
around the reactor at different distances and different height above the reactor platform. In this
paper, the analysis of the radiation dose of sky shine effect was measured using the
experimental metho
High-performance and Low-power Clock Network Synthesis in the Presence of Variation.
Semiconductor technology scaling requires continuous evolution of all aspects of physical
design of integrated circuits. Among the major design steps, clock-network synthesis
has been greatly affected by technology scaling, rendering existing methodologies inadequate.
Clock routing was previously sufficient for smaller ICs, but design difficulty and
structural complexity have greatly increased as interconnect delay and clock frequency increased
in the 1990s. Since a clock network directly influences IC performance and often
consumes a substantial portion of total power, both academia and industry developed synthesis
methodologies to achieve low skew, low power and robustness from PVT variations.
Nevertheless, clock network synthesis under tight constraints is currently the least automated
step in physical design and requires significant manual intervention, undermining
turn-around-time. The need for multi-objective optimization over a large parameter space
and the increasing impact of process variation make clock network synthesis particularly
challenging.
Our work identifies new objectives, constraints and concerns in the clock-network synthesis
for systems-on-chips and microprocessors. To address them, we generate novel
clock-network structures and propose changes in traditional physical-design flows. We
develop new modeling techniques and algorithms for clock power optimization subject
to tight skew constraints in the presence of process variations. In particular, we offer
SPICE-accurate optimizations of clock networks, coordinated to reduce nominal skew below
5 ps, satisfy slew constraints and trade-off skew, insertion delay and power, while
tolerating variations. To broaden the scope of clock-network-synthesis optimizations, we
propose new techniques and a methodology to reduce dynamic power consumption by
6.8%-11.6% for large IC designs with macro blocks by integrating clock network synthesis
within global placement. We also present a novel non-tree topology that is 2.3x more
power-efficient than mesh structures. We fuse several clock trees to create large-scale redundancy
in a clock network to bridge the gap between tree-like and mesh-like topologies.
Integrated optimization techniques for high-quality clock networks described in this dissertation
strong empirical results in experiments with recent industry-released benchmarks
in the presence of process variation. Our software implementations were recognized with
the first-place awards at the ISPD 2009 and ISPD 2010 Clock-Network Synthesis Contests
organized by IBM Research and Intel Research.Ph.D.Electrical EngineeringUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/89711/1/ejdjsy_1.pd
- …