Dependence of within-die delay variations on power supply voltage (V DD ) is measured down to 0.4 V. The V DD dependence of the within-die delay variation of manual layout and irregular auto place and route (P&R) layout are compared for the first time. The measured relative delay (=sigma/average) variation difference between the manual layout and the P&R layout decreases from 1.56% to 0.07% with reducing V DD from 1.2 V to 0.4 V, because the random delay variations due to the random transistor variations dominate total delay variations instead of the delay variations due to interconnect length variations at low V DD .
Introduction
Reduction of power supply voltage (V DD ) is an effective method for achieving ultra low power logic circuits, and maximum energy efficiency is achieved at low V DD (e.g., 340 mV [1] ). Thus, many works have been carried out on the low V DD operation of logic circuits [1] - [4] . Transistor variation is, however, more pronounced at low V DD [2] . This means that the design of logic circuit becomes more difficult at low V DD because the transistor variation makes it difficult for logic gate paths to meet timing constraints. Therefore, in this paper, the within-die delay variation dependence on V DD in several types of design under tests (DUT's) is discussed.
Many works on within-die delay variation have been carried such as [5] . The impact of methodology of physical layout on within-die delay variation, however, is not clear. Layout of logic circuits is usually designed by place and route (P&R) tools. In this case, the interconnect capacitance is larger than the manual layout. The effect of the auto P&R layout on delay variation is not clear at low voltages. Thus, DUT delay dependence on methodology of physical layout is mainly discussed in this paper.
Section 2 presents an all digital delay measurement circuit, and explains its operation. Section 3 shows the experimental results. The delay variation dependence on several types of DUT's are discussed. The manual layout and the auto P&R are also compared. Section 4 concludes this paper. When the DUT delay is shorter than the clock period (T), an error does not occur. On the other hand, when the DUT delay is longer than T, an error is detected and the Error Out signal changes from low to high. Error Out signal is converted to V DDH level by the level shifter and is stored in an error memory. Output of this error memory can be read using the scan chain. In this way, DUT delay is measured digitally. Figure 2 shows the chip micrograph. The die size is 1500 μm × 490 μm fabricated in 65 nm standard CMOS process. The test chip has two types of DUT's and each DUT type consists of 128 delay measurement units shown in Fig. 1(a) . This enables the measurement of within-die delay variation of 128 DUT's.
Various types of DUT's are implemented to survey dependence of DUT delay on gate type, gate width, and layout methodology. Regarding the gate type, ×1 size inverter and 1 size NAND are implemented. Regarding the gate width, ×0.5 size inverter and ×1 size inverter are implemented. Regarding the layout methodology, the layout of ×0.5 size inverter is designed using auto P&R tool, and compared with the manual layout. Each DUT includes 100 stages. This means that the measured within-die variation is 1/ √ 100 compared to the delay variation at each stage.
In the case of the manual layout, 128 paths of 100-stage inverters are in a line and the layout of each path is completely regular. In contrast, in the auto P&R layout 100×128 inverters are entangled and the layout of each path is irregular. Table 1 compares key features of the manual layout and the auto P&R layout. In case of manual layout, interconnect delay is small and there is no interconnect length difference. On the other hand, auto P&R has larger interconnect delay and interconnect length of each path is different.
Experimental Results
Figure 3(a) shows the measured histogram of within-die DUT delay at V DD =1.2 V. The auto P&R ×0.5 inverter has the largest variation of DUT delay because of large interconnect variation. Figure 3(b) shows the measured histogram of DUT delay at V DD =0.4 V. By comparing Figs. 3(a) and (b) , DUT delay and its delay variation at 0.4 V are larger than those at 1.2 V. At 0.4 V, the difference of delay variation between the manual layout and the auto P&R layout is reduced compared with that at 1.2 V. Figure 4 shows the dependence of average delay of 128 DUT's on V DD . This shows average DUT delay (μ) increases with lowering V DD . The delay of the manual layout inverter is always smaller than that of the auto P&R layout. The ×1 inverter is the fastest due to its high current drivability. Figure 5 shows the measured dependence of standard deviation (σ) of 128 DUT's on V DD . Although the difference of σ between auto P&R layout and other gates is large at high V DD , the difference reduces as V DD is reduced. In order to fairly compare delay variation, relative delay variation (σ/μ) is introduced. Figure 6 shows the measured dependence of relative delay variation (σ/μ) on V DD . This shows that the difference of relative delay variation between manual layout and auto P&R decreases from 1.56% to 0.07% when reducing V DD from 1.2 V to 0.4 V. This effect is explained by the following equations. Regarding manual layout, average delay (μ M ), standard deviation (σ M ) and relative delay variation (σ M /μ M ) is expressed as Eqs. (1)- (3),
where R T is resistance of transistor, ΔR T is standard deviation of R T , C T is transistor capacitance, and ΔC T is standard deviation of C T . The resistance and capacitance of the interconnect are ignored in Eq. (1). Regarding auto P&R layout, Average (μ A ), Standard deviation (σ A ) and relative delay variation (σ A /μ A ) of manual layout is expressed as (4)- (6),
where C I is capacitance of interconnection and ΔC I is stan- dard deviation of C I . The resistance of the interconnect is ignored in Eq. (4). In (3) and (6), ΔR T /R T term dominants σ M /μ M and σ A /μ A at low V DD , the transistor variation is larger than the interconnect length variation at low V DD . In this way, the difference of relative delay variation becomes small in low V DD region. In Fig. 6 , NAND has the smallest σ/μ, because the total gate size is the largest in four types of DUT's. More specifically, NAND consists of 4 transistors and other gates have 2 transistors. It also proved that σ/μ of ×1 inverter is smaller than ×0.5 manual layout inverter, because transistor variation is proportional to 1/ √ LW [6] . The spatial distribution of delay variation for the manual layout and the auto P&R layout is investigated. Figure 7 shows the measured spatial distribution of within-die DUT delay normalized by the average delay. This figure also shows the measured histogram of normalized DUT delay. Figure 7(a) shows the distribution of the manual layout inverter and Fig. 7(b) shows the distribution of the auto P&R layout inverter. In Fig. 7(a) , DUT delay has only random variations caused by the random transistor variations. The magnitude of the DUT delay variation at 0.4 V is larger than that at 1.2 V. Figure 7(b) shows that center location has larger delay than the both ends. This is caused by interconnect length difference in the auto P&R layout. In order to check the randomness of the spatial delay distribution at 0.4 V in Fig. 7(a) , the correlation coefficient of the measured delay of 128 DUT's of two dies at 0.4 V is calculated. The correlation coefficient is 0.023. Therefore, the measured variation of manual layout inverter is mainly caused by the random transistor variations. Figure 8 shows the measured and simulated spatial within-die DUT delay distributions at 1.2 V. DUT is the auto P&R layout inverter. The simulated results are obtained by Monte Carlo SPICE simulation after LPE. In this simulation the threshold voltage of each transistor is changed according to standard deviation calculated from A VT of this technology. The spatial delay distribution is similar between the measured and simulated results. This affirms that the longer delay seen at the center location is caused by the interconnect length difference.
Conclusion
V DD dependence of the within-die delay variation on layout methodology (manual layout and auto P&R layout) is investigated and measured for the first time. The relative delay (=σ/μ) variation difference between the manual layout and the P&R layout at 1.2 V is large (1.56%), because the delay variations due to interconnect length difference is larger than the random delay variations due to the random transistor variations. In contrast, the difference at 0.4 V is very small (0.07%), because the random delay variations due to random transistor variation increases with the reduced V DD and dominates the total delay variation.
