Power supply noise is becoming more and more influential on timing, though noise aware timing analysis has not been well established yet, because of several difficulties such as its dependency on input vectors and dynamic behavior. This paper proposes a static timing analysis considering power supply noise in which the dependency of noise on input vectors and spatial and temporal correlations are handled in a statistical manner. We construct a statistical model of power supply voltage that dynamically varies with spatial and temporal correlation, and represent it as a set of uncorrelated variables. We demonstrate that power voltage variation is highly correlated and adopting principal component analysis as an orthogonalization technique is effective in variable reduction. Experiments confirm the validity of our model and the accuracy of timing analysis. We also discuss the accuracy and CPU time in association with variable reduction.
INTRODUCTION
In nano-meter technology era, manufacturing variability fluctuates circuit performance significantly, and variation-aware timing analysis has been intensively studied [1, 2, 3] . In addition, timing verification considering power/ground noise has been eagerly demanded. Power supply noise is expected to become a more and more serious problem on timing in the future because of increasing current consumption and decreased power supply voltage. A severe obstacle for noise aware timing analysis is the difficulty to identify Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. ISPD'08, April 13-16, 2008 , Portland, Oregon, USA. Copyright 2008 ACM 978-1-60558-048-7/08/04 ...$5.00. the worst-case noise for timing. Power supply noise depends on given input signals and internal register states, and changes within a clock cycle as well as cycle by cycle. As circuit scale becomes larger, the combinations of input signals and register states increase exponentially, which makes it prohibitively expensive to find the true worst-case noise.
Dynamic timing simulation with power/ground network and input patterns can provide timing information with noise. However, dynamic timing analysis can not cover all paths, and verifies part of path delays, which is a well known drawback. Even if a test pattern that maximizes voltage drop is found, the vector does not necessarily correspond to the worst-case for timing, because the circuit structure and the layout are also associated with the timing. Preparing effective test vectors for noise aware dynamic timing verification is a computationally expensive problem, and it is impossible to solve in a practical time.
To consider the impact of power/ground noise on timing, static timing analysis (STA) is commonly performed supposing that a constant (DC) voltage drop, for example the maximum voltage variation, is given to all gates. This approach is computationally efficient, but there is no systematic way to determine the voltage drop without optimism and excessive pessimism. When the maximum voltage drop is given to all gates, the estimated timing is too pessimistic, which causes timing convergence problem and overdesign. To solve this problem, timing analysis considering dynamic voltage variation has been proposed [4, 5] , and some commercial tools are available. However, it is necessary to obtain or assume the worst-case noise, which means the pattern-dependency problem of power supply noise remains unsolved.
Although finding the exact worst-case noise for timing is extremely difficult, designers have to assure that the designed circuit operates at the target frequency in a quantitative manner before fabrication. Therefore, a systematic technique that can estimate not exact but reasonable worst-case timing is necessary. Path-based methods to estimate the maximum delay have been proposed [6, 7, 8] . These methods, however, have to be applied to many potential critical paths, and hence the computational cost could be very high. Recently, Ref. [9] has proposed an approach to estimate the effect of power supply noise on timing by solving an optimization problem. The problem is formulated as a non-linear delay maximization problem under the given constraints of current consumption. However, the circuit size reported in [9] is limited, and the applicability to larger circuits is not clear. As another approach, a statistical treatment has been introduced into power supply noise aware timing analysis [10, 11, 12] . Reference [10] estimates voltage variation by convolution of statistically modeled current consumption and impulse response of power/ground network. In [11] , first, authors derive the average and the standard deviation of every block and the correlation coefficients between blocks, and then estimate the delay. Reference [12] focuses on spatial correlation of power supply noise and proposes to use principal component analysis (PCA) for modeling of power supply noise. Path delay distribution is then computed with uncorrelated variables. We know an argument that timing failure due to power supply noise must be verified in a deterministic manner, since a certain input pattern necessarily causes a problem. However, an exact verification in a vast input pattern and register state space is impossible, and thus we believe that a statistical approach helps designers estimate timing performance quantitatively and systematically.
In this paper, we propose a statistical timing analysis method considering dynamic power supply noise. The overall flow of the proposed method is shown in Fig. 1 . The proposed method models power/ground noise statistically. Spatially and temporally correlated power supply noise is transformed to uncorrelated variables by using orthogonalization techniques, such as PCA and independent component analysis (ICA). We then perform statistical static timing analysis using the derived statistical power/ground noise model. Statistical timing analysis with a statistical model of power supply noise with PCA has been proposed by Kim[12] , however, it is a preliminary work and several important issues, such as nonGaussian distribution shape of variables and dynamic voltage fluctuation within a clock cycle, remain unsolved or not addressed. Further, variable reduction by PCA due to tight correlation among variables is not aggressively exploited for reducing CPU cost of statistical timing analysis.
In this work, we experimentally demonstrate that PCA-based statistical modeling, with some distribution transformation techniques (e.g. Box-Cox transformation) if necessary, works well, though the distribution of power supply noise is not exactly Gaussian. To take dynamic noise behavior within a clock cycle into consideration, we propose to discretize a clock cycle into several time slots, and assign a random variable to each time slot to construct a statistical model of dynamic power supply noise. We focus on an observation that power supply noise is highly correlated not only spatially but also temporally, and model power supply noise with a small set of random variables, which helps to reduce CPU time for timing verification. We also demonstrate that adaptive spatial discretization for variable assignment reduces PCA cost significantly. You might think that SSTA with PCA is a well-known approach, but this similarity is a big advantage to keep the compatibility to conventional SSTA, and the proposed method can be easily integrated into SSTA tools for manufacturing variability, i.e. statistical timing analysis considering both manufacturing variability and power supply noise is easily realized in a unified approach.
This paper assumes that information on power supply noise needed for the statistical modeling is given. Generally, the estimation of power supply noise is not easy. We, however, think that sophisticated methods, such as impulse response and convolution with logic simulation results for power estimation and functional verification [9] give us the information. An efficient information preparation is another research topic to study, and hence we do not discuss it further in this paper.
This paper is organized as follows. Section 2 discusses difficulties of timing analysis considering power supply noise. We show how to statistically model power/ground voltage variation in Section 3 . Section 4 explains SSTA procedure with the proposed noise model. We demonstrate experimental results in Section 5, and Section 6 concludes the discussion.
DIFFICULTIES OF NOISE AWARE TIMING ANALYSIS AND PROPOSED APPROACH
When performing timing analysis considering power supply noise, a problem is that the maximum voltage drop does not necessarily cause the worst-case delay. The supply voltage changes spatially and temporally within a clock cycle as well as cycle by cycle. The observation of power supply noise only can not necessarily detect a timing failure due to power supply noise, because the timing depends on the position of critical paths as mentioned in [9] . Figure 2 shows an example that the maximum voltage drop does not always cause the worst delay. The solid lines and broken lines represent power supply noise of cycle #(c) and cycle #(d) respectively. Suppose a critical path exists in area A. In this case, the delay of cycle #(c) would be worse than that of cycle #(d). However, if a critical path is located in area B, it is unclear which cycle is the worse-case for timing in this chip. In area B, the noise of cycle #(c) delays the gate switching at the beginning of clock cycle, whereas it less affects the switching at the latter half of clock cycle. On the other hand, in cycle #(d), the switching at the latter half is much slowed down. Thus, the voltage fluctuation within a clock cycle can influence the gate delay much or less depending on the switching timing, where the switching timing is basically determined by the circuit structure.
The noise waveform shape varies according to given input vectors. As mentioned earlier, the space in input vectors and internal logic states is tremendously huge and can not be explored thoroughly. We thus model power supply noise statistically preserving the spatial and temporal correlation, and apply it to a statistical static timing analysis. The proposed approach can solve the problem described above, i.e. the position of critical paths and spatial and temporal difference of power supply noise are considered simultaneously. We report an experimental result that the maximum noise does not necessarily involve the worst-case delay in Section 5.
Principal component analysis (PCA), which is one of orthogonalization methods, has a preferable advantage. Highly correlated Gaussian variables are transformed into a small set of Gaussian variables with a small sacrifice of accuracy. We here show an example that power supply noise is highly correlated in space. We evaluated power supply noise of an FPU circuit in 1×1mm 2 area [13] , and set 10×10 variables associated with spatially divided 10×10 grids. Each variable represents cycle-average supply voltage of VDD side at each grid. The evaluation condition is the same with experiments in Section 4. Figure 3 shows the histogram of correlation coefficients between variables. We can see that variables are highly correlated, and 36.2% of coefficients are above 0.9 indeed. We thus expect that a compact statistical model with a small number of variables is derived. A small number of variables enable us to perform SSTA and Monte Carlo simulation efficiently. On the other hand, when we choose current consumption as a variable instead of supply voltage, the correlation between the variables is weaker than power supply voltage as shown in Fig. 4 , and hence variable reduction can not be efficient. Although current consumptions at the adjacent nodes are not much correlated, the impedance of the power network strengthens the spatial correlation of power supply voltage. When using PCA, we have to pay attention to the distribution shape of variables, because PCA assumes Gaussian distribution. A problem to apply PCA to power supply noise modeling is nonGaussian noise distribution, which may cause undesirable modeling error. Solutions to this problem include Gaussianizing the variables, e.g. Box-Cox transformation [14] . This transformation improves Gaussianity of the variable. In this paper, we experimentally demonstrate that PCA-based modeling is reasonable from the standpoint of practical use, though, rigidly speaking, the distribution is not Gaussian. When the variables are quite far from Gaussian distribution, another orthogonalization technique, such as independent component analysis, should be applied, which is similar to [3] .
An advantage of the proposed method using variable orthogonalization is a compatibility with SSTA developed for manufacturing variability[1, 2, 3] . The derived statistical model of power supply noise is expressed in a similar manner with manufacturing variability, and thus importing noise effect to SSTA is straightforward, though handling within-cycle voltage variation requires a modification. We therefore can perform SSTA covering both the process and voltage variation in a unified manner. The proposed method has a possibility to give a new sign-off criteria that considers both manufacturing and voltage variation, though there remains several points to study. Temporal discretization. Dividing a clock cycle into time spans.
PROPOSED STATISTICAL MODELING OF POWER SUPPLY NOISE
This section explains the proposed modeling of power supply noise. From now, we assume distributions of power supply voltage are Gaussian or can be transformed to Gaussian by variable transformation techniques. We thus use PCA as an orthogonalization method in this paper. We experimentally demonstrate nonGaussianity of the distribution is not significant in Section 5. Note that even when distribution of power supply noise is far from Gaussian, the basic concept of the proposed method works by using ICA instead of PCA similar to [3] .
Spatial and temporal discretization
Power supply noise varies continuously in space and time, and rigidly speaking, every cell has different noise waveform. However, observation points of power supply noise are limited because of cost, and the number of points is much reduced by clustering cells. We first set up observation points by discretizing a chip spatially. We also discretize power supply variation within a clock cycle temporally. We then assign a random variable to each time span at each spatial grid.
Spatial discretization is performed by partitioning a chip/block area into a 2D grid and choosing a representative value for each divided partition. As a representative value, for example, the voltage at the center point (Fig. 5) or the average voltage in each partition is a candidate. The voltages of all nodes in the same partition are assumed to be identical. Figure 5 is an example of uniform discretization, which is widely used for manufacturing variability modeling. In the case of power supply noise, more sophisticated discretization is desirable, since power/ground voltage sometimes fluctuates locally. Fine discretization should be applied to heavily fluctuating area, whereas coarse discretization is good enough for calm area. We here explain an adaptive discretization method as an example. First, we divide a chip/block area into partitions which include only a single observation node. We then assess whether two partitions can be regarded to have the same voltage fluctuation, i.e. the differences of average and standard deviation are small enough and the correlation coefficient is large enough. When these partitions can be regarded as equivalence, we merge these partitions into a single partition. This operation continues until all primal partitions are evaluated.
Another important difference of power supply noise from manufacturing variability is its dynamic behavior. Temporal continuousness also needs to be removed. We partition a clock cycle into several time spans, and compute a representative voltage (e.g. average as shown in Fig. 6 ).
We then treat the value at every clock cycle as a different sample. Figure 6 shows an example when the voltage at position (x, y) is divided into three time spans and its random variables are denoted as Vx,y,1, Vx,y,2 and Vx,y,3. The number of time spans is determined according to the modeling requirement, i.e. when we need to accurately model dynamic variation within a clock cycle, the number of spans should be increased, otherwise a few spans are sufficient.
Variable transformation with orthogonalization
Given a set of variables, we translate the variables and derive a compact statistical model with Gaussianization and orthogonalization.
Gaussianization
The first step of the variable transformation is to improve Gaussianity of the variables. This step can be skipped when the supply voltage distribution can be reasonably treated as Gaussian. A famous transformation to improve Gaussianity is Box-Cox transformation [14] . There are several transformation equations of BoxCox transformation, and the equation we use in this paper is expressed as follows.
where z is the original variable,ẑ is the transformed variable and Λ is a parameter. In our modeling, z corresponds to a variable of power supply noise Vx,y,t. The optimum Λ that maximizes Gaussianity is computed for every variable Vx,y,t individually by maximum likelihood procedure, and is given to SSTA.
Orthogonalization by PCA
PCA maps a given set of correlated random variables to a new set of uncorrelated random variables, which are called principal components (PCs). Given a variance-covariance matrix, PCA transforms the variable zi into Eq. (2), where λj is the jth largest eigenvalue, eij is the element of the jth eigenvector which corresponds to zi, μi is the average of zi, and σi is the standard deviation of zi. k is the number of PCs and pcj is the jth principal component. Principal component pcj is expressed as Eq. (4), which is a linear summation of n original variables of zi. The principal components are random variables mutually uncorrelated with each other, which eases computation of correlation significantly in SSTA[1]. Moreover, zi is often approximated as Eq. (3) with the reduced number of PCs k (k < k), when the original variables of zi are correlated. When Box-Cox transformation is applied to zi beforehand, we just replace zi withẑi in Eqs. (2), (3) and (4).
Computational complexity
Let m and n denote the number of samples and variables respectively. The optimal Λ for Box-Cox transformation in Eq. . Consequently, the total cost of the variable transformation is O`n 3´. The complexity is not low, but the variable transformation is performed only once before SSTA, and hence this computational cost is expected to be acceptable, which is similar to other SSTA methods [1, 2] . Table 1 shows the execution time of PCA implemented in R[15] on a computer with Opteron processor 2.4GHz and 16GB memory. Even in the case that PCA cost is not acceptable, for example, the number of variables is larger than 10k, the modeling chip region can be reasonably reduced, because power voltage variation has a property of locality [16] .
(1) is derived by likelihood function, and its complexity is O(m). The transformation of all n variables requires the effort of O(mn). On

SSTA WITH STATISTICAL MODEL OF POWER SUPPLY NOISE
This section discusses the application of the statistical model of power supply noise to SSTA. The proposed model is applicable to both path-based and block-based SSTA.
Equation (5) is a common gate delay model in a canonical form that is widely used in SSTA implementation. We adopt this form, because this form realizes fundamental sum and max operations in SSTA efficiently as long as the variables are Gaussian[1].
Here, ai,j is a sensitivity coefficient associated with pcj. The power and ground level difference between drivers and a receiver affects the switching delay of the receiver, as reported in references (e.g. [9, 17] ). Figure 7 explains the level difference between a driver and a receiver. Suppose the receiver is placed at (x, y) grid and switching in (t) time span. Similarly, the driver is placed at (x l , y l ) grid and switching in (t l ) time span. VDDr/VSSr is the supply/ground voltage of the receiver side at (x, y) grid in (t) time span. Similarly, V DDd l /V SSd l is the supply/ground voltage of the lth driver side at (x l , y l ) grid in (t l ) time span. To consider the level difference, we use the following canonical delay form.
The first and second terms in RHS of Eq. (7) correspond to the delay variation due to the voltage variation at the receiver. The other terms mean the delay variation caused by the voltage variation at the driver. In multiple-input cells, there are several inputs. Even the voltages of stable (not switching) inputs affect the propagation delay [17] , and hence we sum up the terms with respect to every voltage variable at drivers. When the Box-Cox transformation is applied, VDD and VSS are translated intoVDD andVSS. ∂d/∂V is the sensitivity of the delay toV , and is easily computed from ∂d/∂V and the derivative of Eq. (1). The form of Eq. (6) is compatible with Eq. (5), and hence we can easily take manufacturing variability and power supply noise into consideration in the same manner.
Unlike the process variation, the proposed method needs a special consideration. In the case of spatial discretization, a grid, i.e. a variable parameter, is assigned to a gate definitely. However, in the case of temporal discretization, the correspondence to a variable is sometimes obscure, because a switching transition may occur at the boundary of temporal division. Furthermore, when the temporal division is rough, i.e. the number of time span is small, the voltage difference between successive two time spans is large, which may cause a large timing estimation error near the boundary. In order to mitigate this error, we revise a weighted-average calculation to cope with a case that input and output transition timings of a gate are included in different time spans. Let tI and tO represent input and output transition timings, where each time belongs to Span #(m) and Span #(m + 1) respectively. First, we estimate tO with the use of μr m which is the average delay in Span #(m), that is tO = tI + μr m . Using these values, average μ r and the coefficient of Eq. (6), a r,j , are recalculated by
where ΔtI is the time from tI to the boundary time, ΔtO is the time from the boundary time to tO, μr m+1 is the average delay of Span #(m + 1), ar m,j and ar m+1 ,j are the coefficients of Eq. (6) in Span #(m) and Span #(m + 1), respectively.
EXPERIMENTAL RESULTS
This section demonstrates experimental results. We first validate the statistical modeling of power supply noise, and then verify the accuracy of the proposed timing analysis.
Experimental conditions
For constructing the proposed model of power supply noise, we use an FPU circuit and a tiny64 processor [13] , as noise generators. These circuits were synthesized by a commercial logic synthesizer and placed and routed by a commercial tool with a 90nm standard cell library. The circuit sizes are 39k and 20k gates respectively. We attached a power/ground network shown in Fig. 8 to each noise generator circuit and simulated the power supply noise. A flip chip package with bump connections is assumed. Input vectors of 2000 clock cycles are given to each circuit. The simulation results are used for PCA including correlation matrix calculation. Please note that other methods for power noise estimation can be used, though a fast circuit simulator is used in this paper. We implement block-based SSTA and Monte Carlo simulation in C++ and perform them for ISCAS85 benchmark circuits, a 64-bit multiplier, an ALU circuit for vector operation and an H-tree for clock distribution on a computer with Opteron processor 2.4GHz and 16GB memory. These circuits except H-tree were synthesized, placed and routed by commercial tools. In the H-tree, a single path is selected and its jitter is evaluated. The power supply noise of the FPU circuit or the tiny64 processor described above is given to the benchmark circuits.
Validation of statistical modeling of power supply noise
Distribution of power supply noise
We here show a distribution of power supply voltage as an example. We choose a distribution of power supply noise that is relatively far from Gaussian (Fig. 9) , whereas a large portion of variables are close to Gaussian. Figure 10 is the normal probability plot of Fig. 9 . In the case of a normal distribution, all dots are placed on the diagonal line. When the dots are far from the diagonal line, the distribution is much different from Gaussian. In Fig. 10 , many dots are not on the diagonal line, which means the distribution is different from Gaussian, as shown in Fig. 9 .
On the other hand, the variable transformed by Box-Cox transformation approaches Gaussian (Fig. 11) . In the normal probability plot of Fig. 12 , the dots are closely placed to the diagonal line, which means the Gaussianity is much improved. We experimentally reveal that SSTA results are accurate compared with Monte Carlo analysis, which will be discussed later in Section 5.3. We thus conclude that orthogonalization with PCA for power supply noise is reasonable.
Variable reduction rate
When the correlation between random variables is high, the original distribution can be reproduced with a small number of PCs. This section discusses how many PCs can be reduced. When we reduce the number of PCs, a metric called cumulative proportion is used [18] . The cumulative proportion is expressed as where n is the number of the variables. As the cumulative proportion approaches 1, the original distribution is well reproduced. Figure 13 shows the proportion of the first PC (i.e. cumulative proportion1) when the division number is changed. The solid line shows the relationship between the spatial division number and the proportion of variance in the case that the temporal division within a cycle is not executed. The strongly-correlated variables allow the first PC to keep high proportion. On the other hand, the broken line gives the result when the temporal division number varies while keeping the spatial division unchanged. The increase of temporal division number does not affect the proportion very much, because the parasitic capacitor in the chip smooths power supply noise and strengthens temporal correlation. Furthermore, if intentional decoupling capacitance is inserted, the spatial and temporal correlation of power supply noise becomes strong, and the modeling efficiency improves further. Power noise also has correlation with ground noise. Therefore, even when the number of variables is very large, a small number of PCs can achieve high cumulative proportion. Let us show an example. Suppose the spatial and temporal division number of the difference between power and ground are 10×10 and 10 respectively. We examine the number of PCs whose cumulative proportion exceeds 90%. Only six PCs are capable of attaining the target value, though the total number of variables is 1000. In this instance, more than 99% of the variables are reducible, which helps to reduce computational cost of SSTA, because the complexity is proportional to the number of principal components[1].
Adaptive spatial discretization
We show an example that the adaptive spatial discretization explained in Section 3.1 is applied to FPU. In this experiment, threshold values of average and standard deviation used for the equivalent partition checking are set to a tithe of differences between the maximum and the minimum values in the whole area, and the threshold of correlation coefficient is set to 0.9. Figure 14 shows the result of the adaptive discretization. In this case the number of divided area is 142. The region where voltage is fluctuating locally is discretized finely. If all the area is divided with the finest resolution, the division number becomes 840. As mentioned in Section 3.2.3, the complexity of PCA is O(n 3 ), and hence the variable reduction from 840 to 142 corresponds to up to over 200x cost reduction of PCA.
SSTA results for power supply noise
We first verify the accuracy of the proposed timing analysis method. In this experiment, the numbers of spatial and temporal division are set to 10×10 and 10, respectively. Here, we perform Monte Carlo by using the noise information of 2000 cycles that is the same with the information given to PCA. Noisy power voltage waveforms of each cycle is are given for all cells considering the placement. The delay of each cell is calculated with the voltage value corresponding to the cell position and switching timing. With these gate delays, conventional STA is performed and the circuit delay of each cycle is obtained. Therefore, the number of Monte Carlo evaluation is 2000. The Monte Carlo result does not include the errors that originate from discretization, PCA for incomplete Gaussian distribution and SSTA operation. The results of Monte Carlo are compared to those of SSTA as ideal solutions. Table 2 lists the average and standard deviation of the delay acquired by SSTA with and without Box-Cox transformation and Monte Carlo simulation. We can see that the proposed SSTA with and without Box-Cox transformation estimates the timing accurately. The estimation error of the average delay is 0.465% and that of the standard deviation is 14.4%. The accuracy improvement due to Box-Cox transformation is not significant, but it reduces the estimation error of standard deviation from 14.4% to 12.7%. The effect of Box-Cox transformation is limited, because most variables are originally close to Gaussian. When the noise information of tiny64 processor is given, the errors of average and standard deviation are 0.566% and 19.7% respectively. The proposed method helps designers to quantitatively know how the circuit delay fluctuates depending on input vectors in a systematic way. The Monte Carlo results show the worst-case delay does not always occur when power/ground noise is maximum. In circuit c1355, even when the supply voltage, which is averaged temporally within a clock cycle and spatially within a block area, is the minimum, the circuit delay is not the largest. This situation indeed corresponds to the case of the 970th longest circuit delay among 2000 evaluated cycles. Thus, finding the maximum power/ground noise is not sufficient for timing verification. Table 3 shows the relation between the number of PCs (cumulative proportion) and delay estimation accuracy for 64-bit multiplier. The spatial and temporal division number are 10×10 and 10 respectively and the noise generator is tiny64 processor. In this case, the result with only eight PCs is very close to that with all PCs, which enables considerable variable reduction. The CPU time is reduced from 11800ms to 205ms by 98.3%.
SSTA result both for power supply noise and manufacturing variability
We finally demonstrate that the proposed method estimates delay distribution considering both dynamic power supply noise and static manufacturing variability in a unified manner. In this experiment, threshold voltage (Vth) is fluctuated. Its variation consists of a spatially correlated constituent and a random fluctuation constituent. As for the spatial correlation, we assume that the correlation coefficient of Vth is given by a function f (x) = e −2x , where xmm is the distance between two gates [19] . We suppose that the magnitudes of both variational components are the same and the total standard deviation is 25mV, which is a typical value in a 90nm process [19] . For the sake of simplicity, intra-gate fluctuation is not considered in this experiment. We also assume that manufacturing variability and power supply noise are uncorrelated in this experiment, though this mutual dependence is analyzed in [20] . The mutual correlation can be modeled by PCA in nature, as long as we can obtain the statistical data. Here, this experiment aims to demonstrate the feasibility that the proposed method can cope with manufacturing variability and power supply noise in a unified approach. Figure 15 shows the delay distribution of a 64-bit multiplier in the case that the spatial and temporal division numbers are set 10×10 and 10 respectively. The power supply noise of FPU is given. The difference of two distributions at 50% cumulative density is 4ps, and it is a quite small error, which means the proposed method well copes with both variabilities. If the timing margin 3σ is set for each variation individually, total margin becomes 142.7ps. However, simultaneous consideration of the variations by the proposed method reduces timing margin to 104.8ps. This result indicates a possibility that the proposed method gives a new sign-off criteria both considering manufacturing and supply voltage fluctuation, though several studies are needed before applying it to a practical design.
CONCLUSION
In this paper, we proposed SSTA considering dynamic power supply noise with orthogonalization technique. We confirmed that dynamic power/ground noise could be modeled statistically with PCA though the distribution of power supply voltage was not rigidly Gaussian. The experiments showed that the proposed method estimated delay variation due to power supply noise accurately. We experimentally demonstrated that a small number of principal components obtained by PCA were capable of accurate delay estimation thanks to spatial and temporal correlation of power supply noise.
ACKNOWLEDGEMENT
This work is supported in part by Semiconductor Technology Academic Research Center (STARC), New Energy and Industrial
