I. INTRODUCTION
As very large scale integration (VLSI) technology advances with decreasing feature size as well as increasing operating frequency, inductive effects of on-chip interconnects become increasingly significant in terms of delay variations, degradation of signal integrity, and aggravation of signal crosstalk [1] , [2] . Since inductance is defined with respect to the closed current loop, the loop-inductance extraction needs to simultaneously specify both the signal net current and its returned current. To avoid the difficulty of determining the path of the returned current, the partial element equivalent circuit (PEEC) model [3] can be Manuscript received March 7, 2003 ; revised June 3, 2004 . This paper is partially supported by the National Science Foundation (NSF) CAREER award CCR-0401682, SRC grant 1100, a UC MICRO grant sponsored by Analog Devices, Fujitsu Laboratories of America, Intel, and LSI Logic, and a Faculty Partner Award by IBM. The authors used computers donated by Intel. This paper was recommended by Associate Editor C.-J. R. Shi.
The authors are with the Electrical Engineering Department, University of California at Los Angeles, Los Angeles, CA 90095 USA (e-mail: hy255@ee.ucla.edu; lhe@ee.ucla.edu).
Digital Object Identifier 10.1109/TCAD. 2005.850820 used, where each conductor forms a virtual loop with the infinity and the partial inductance is extracted.
To accurately model inductive interconnects in the high frequency region, resistor-inductor-capacitor-mutual inductance (RLCM) networks under the PEEC formulation are generated from discretized conductors by volume decomposition according to the skin depth and longitudinal segmentation according to the wavelength at the maximum operating frequency. The extraction based on this approach [4] - [6] has high accuracy but typically results in a huge RLCM network with densely coupled partial inductance matrix L. A dense inductively coupled network sacrifices the sparsity of the circuit matrix and slows down the circuit simulation or makes the simulation infeasible. Because the primary complexity is due to the dense inductive coupling, efficient yet accurate inductance sparsification becomes a need for extraction and simulation of inductive interconnects in the high-speed circuit design.
Because the partial inductance matrix in the PEEC model is not diagonal dominant, simply truncating off-diagonal elements leads to negative eigenvalues and the truncated matrix loses passivity [7] . There are several inductance sparsification methods proposed with guaranteed passivity. The return-limited inductance model [8] assumes that the current for a signal wire returns from its nearest power/ground (P/G) wires. This model loses accuracy when the P/G grid is sparsely distributed. The shift truncation model [9] calculates a sparse inductance matrix by assuming that the current returns from a shell with shell radius r 0 . But it is difficult to determine the shell radius to obtain the desired accuracy. Because the inverse of the inductance matrix called K element (susceptance) matrix is strictly diagonal dominant, off-diagonal elements can be truncated without affecting the passivity [10] , [11] . Because K is a new circuit element not included in a conventional circuit simulator such as SPICE, new circuit analysis tools considering K have been developed [12] , [13] . Alternatively, double-inversion-based approaches have been proposed in [11] and [14] . Using the control volume to extract adjacently coupled effective resistances to model inductive effects, the vector potential equivalent circuit (VPEC) model is recently introduced [15] . Its sparsified and SPICE-compatible circuit model is obtained based on a locality assumption that the coupling under the VPEC model exists only between adjacent wire filaments.
This paper presents an in-depth study on the VPEC model. The authors find that the locality assumption in [15] does not hold in general and its integration-based extraction becomes impractical for largesized interconnects as it requires to optimize the size of the control volume for each filament. The authors rigorously derive an accurate full VPEC model considering the coupling between any pair of filaments by inverting the partial inductance matrix. The authors further prove that the resulting circuit matrix for the full VPEC model is passive and strictly diagonal dominant. The diagonal dominance enables truncating small-valued off-diagonal elements to obtain a sparsified VPEC model named truncated VPEC (tVPEC) model with guaranteed passivity. To avoid inverting the entire inductance matrix, the authors also present another sparsified VPEC model with preserved passivity, the windowed VPEC (wVPEC) model, by inverting a number of inductance submatrices. Both full and sparsified VPEC models are SPICE compatible.
The rest of this paper is organized as follows. Section II introduces an accurate inversion-based VPEC model with detailed derivation in the Appendix. The resulting full VPEC model considers coupling between any pair of filaments. In contrast, the VPEC model in [15] is integration based, localized but not accurate in general. In Section III, the authors prove that the effective resistance matrixĜ in the full VPEC model is passive and strictly diagonal dominant. Section IV presents a truncation-based sparsification that leverages the passivity of theĜ matrix. It 0278-0070/$20.00 © 2005 IEEE TABLE I  TABLE OF NOTATIONS truncates small-valued off-diagonal elements of theĜ matrix obtained from the full inversion of the inductance matrix. In Section V, the authors further present a more efficient sparsification approach based on windowing. It avoids inverting the full inductance matrix and is more efficient and more accurate compared to the truncation-based sparsification. Section VI further presents the scalability of the runtime and model size for the sparsified VPEC, full VPEC, and PEEC model. Finally, Section VII concludes the paper.
II. INVERSION-BASED FULL VPEC MODEL
The VPEC model from [15] considering coupling only between adjacent filaments can be called the localized VPEC model. In this section, the authors first derive the system equation of the full VPEC to model the inductive effect between any pair of filaments and then show that the localization assumption in [15] does not hold in general. They then introduce the inversion-based method to calculate the full VPEC model. Finally, experiments are presented to show that the full VPEC is as accurate as the PEEC model.
A. Full VPEC
Same as in FastHenry [5] with the magneto-quasi-static assumption, the conductor can be divided into a number of rectilinear filaments. The current density is constant over the cross section of the filament. In this paper, the authors use superscripts x; y; z to denote spatial components of a vector variable. Let A be the vector potential determined by the distribution of the current density J. Then J k and A k are the components in the k direction (k = x; y; z). The authors further use the subscript i for variables associated with filament ai (i 2 N), and every filament a i has a length l by adequately discretizing in the k direction. Table I summarizes the notations used in this paper with detailed definitions in the Appendix. To extract the VPEC, the integration-based approach in [15] needs to determine the localized flux B k ij (j 2 n i ), where n i is the set of filaments adjacent to ai. The explicit calculation of B k ij is hard, and only considering the localized flux as in [15] loses the accuracy. The integration-based VPEC model in [15] needs to use a control volume for each filament, but no method was presented in [15] to find an accurate control volume. To avoid explicitly calculating B k ij and using the locality assumption, the authors derive a full VPEC model and then present an inversion-based extraction in Section II-B. The detailed derivation of the full VPEC model is presented in the Appendix, where the authors obtain two Kirchhoff's current law (KCL) and Kirchhoff's current law (KVL) equations
where the vector potential current and voltage are related to the electrical branch current and voltage bŷ
Clearly, we can see the physical meaning of the effective resistance by (1) : given a unit current change at the ith filament, the vector potential observed at the jth filament is exactlyR k ij when all other filaments are connected to vector potential ground. Furthermore, (2) describes the relation between the vector potential and its corresponding electrical voltage drop caused by the inductive effect.
The authors present a SPICE-compatible VPEC model for three filaments in Fig. 1 . The model consists of two blocks: the electrical circuit (PEEC resistance and capacitance) and the magnetic circuit (VPEC effective resistance and unit inductance). They are connected by controlled sources. It includes the following components. Although the number of magnetic circuit blocks increases with more filaments, sparsified VPEC models will be introduced in Sections III-V to greatly reduce coupling resistances in magnetic circuit blocks with preserved passivity. Moreover, because the VPEC model largely reduces reactive elements (i.e., inductance) and its effective resistance is less densely stamped in the modified nodal analysis (MNA) matrix compared to the partial inductance under the PEEC model, the full VPEC model reduces the simulation time compared to the PEEC model (see experiments in Section VI).
Note that the summation in KCL (1) for the full VPEC model is carried out over each pair of filaments. In contrast, this summation in [15] is carried out only for adjacent filaments. Pacelli [15] (4) Equation (4) means that the conducting current I(x;y; z) is locally related to the flux of the electrical field E(x;y; z) (0r) on the surface S. Equation (4) is correct because electrons only locally transport in the conductor. However, for the magnetic coupling problem, the flux B k ij is caused by the magnetic field that is not localized. Therefore, the authors still need nonlocal resistances to accurately model the long-range effect of inductance. Hence, the KCL (1) in this paper is related to not only the localizedR k ij ( j 2 n i ) but also all otherR k ij ( j 6 = i, j 2 N ).
The experimental results below will show that compared to the PEEC model, the full VPEC model considering all filaments is accurate, but not the localized VPEC model from [15] . 
B. VPEC via PEEC Inversion
Due to the difficulty of explicitly determining B k ij , there is no efficient calculation method for effective resistances in [15] . In this part, the authors will derive the circuit-level system equation based on the VPEC effective resistance matrixĜ and then present an efficient method to calculate effective resistances from the inversion of the partial inductance matrix.
The authors take the time derivative at both sides of (1) and then use (2) to replace the time derivative of the vector potential. Consequently, the authors obtain
The authors define the circuit matrixĜ of the VPEC model aŝ
Then, the system equations can be rewritten aŝ
Compared to the system equation based on the inductance matrix L k
The authors find thatĜ k and S k only differ by a geometrical factor l
Therefore, starting with the L matrix under the PEEC model, the authors first obtain the inverse of L and then haveR under the VPEC model the extraction formulâ
Because the major computation effort is the inversion of the L matrix, the authors call this method as an inversion-based VPEC model, where effective resistances are obtained from a unique relation (10) . This inversion-based VPEC model leverages the existing PEEC extractor. In contrast, the localized VPEC model is integration based and needs to explicitly calculate the local flux B k ij ( j 2 ni) from scratch, where its accuracy is sensitive to the size of the control volume during the integration [15] . Therefore, it has a high accuracy only for a few number of filaments and needs to optimize the size of the control volume for each filament when system has a large number of filaments. Clearly, it becomes impractical for the full-chip extraction. Note that the above (7)- (10) can be used to derive the K element (susceptance) based model in [10] and [11] from first principles. Although the K element method and VPEC are both derived from the inverse of L, the VPEC model is realized quite differently from the K element: 1) the VPEC model is SPICE compatible but the K element needs to introduce the new circuit element to the simulator; and 2) the current K element simulator is based on nodal analysis [13] , where the admittance form of the K element is 0 = A l L 01 A T l =s (A l is the incident matrix for the inductance) in the frequency domain. Clearly, the 0 matrix becomes indefinite when s ! 0. Therefore, it will lose correct dc information. On the other hand, the VPEC model can be stamped in the MNA matrix with the correct dc information in both frequency and time-domain simulations [16] , which enables the correct circuit reduction [17] to further reduce the model order.
C. Accuracy Comparisons
In this part, the authors use the aligned parallel bus to compare the PEEC model with the full VPEC model and localized VPEC model.
1) PEEC Model Extraction:
The experiment setting is illustrated as follows. The authors assume the copper conductor ( = 1:7 2 10 08 m) and low-k ( = 2) dielectric. The conductor is volume discretized according to skin depth and longitudinal segmented by onetenth of the wavelength. Capacitance is extracted by a lookup table [18] interpolated from FastCap [4] . Because capacitive coupling is a short-range effect, only adjacent couplings are considered. Partial inductance is extracted by FastHenry [5] at 10 GHz (as the maximum operating frequency), where each wire segment is modeled by one filament, and coupling between any pair of segments (including segments in a same line) is considered. The authors then generate the distributed type RLCM circuit under the PEEC model. Furthermore, interconnect drivers and receivers are modeled by the resistance R d = 120 and the loading capacitance C L = 10 fF. All circuit models are simulated by HSPICE on a SUN Ultra-5 workstation. Note that the same experiment setting is applied for all other experiments in this paper unless specified otherwise.
2) Simulation of Aligned Parallel Bus:
A 5-bit bus with one segment per line is considered. Each bus line is 1000 m long, 1 m wide, and 1 m thick. The space between lines is 2 m. With the extracted RLCM parameters under the PEEC model, the authors further construct the full VPEC model (with couplingRij between all bits) from this paper and the localized VPEC model (with couplingR ij between adjacent bits) from [15] . They measured responses at the far-end of all five bits and compared waveforms of the second bit in Fig. 2 Note that when the number of conductors is small, there is no simulation speedup observed. However, for larger sized interconnect examples in Sections IV-VI, the simulation time of the full VPEC model (without sparsification) is less than that for the PEEC model. The detailed analysis of complexity scaling can be found in Section VI. Furthermore, in the comparison, the authors did not use the implementation of the localized model from [15] , which depends on the height of the integration box (i.e., the size of the controlled volume) and is an approximated solution without a method to find the accurate size of the controlled volume. Instead, the authors find an accurate full VPEC model and then only keep the adjacently coupled resistances to obtain an accurate localized VPEC model. 1 
III. INDUCTANCE SPARSIFICATION UNDER THE VPEC MODEL
This section first derives the magnetic energy under the VPEC model and then proves that the circuit matrixĜ under the VPEC model is positive definite. Moreover, the authors prove thatĜ is also strictly diagonal dominant. This property enables the passivity preserved matrix sparsification methods. Finally, the authors present the sparsification flow for the VPEC model.
A. Magnetic Energy in VPEC Model
Generally, the magnetic energy is given by the space integral [19] 
1 Based on the communication with [15] , the localized VPEC model used in this paper has a similar accuracy compared to the one used in [15] .
For the full VPEC model, (11) can be rewritten to
Furthermore, when the KCL (31) is rewritten in terms of theĜ matrix
the authors have for the magnetic energy under the full VPEC model the relation
It is proven below that theĜ matrix is positive definite.
B. Property of theĜ Matrix
Theorem 1: The circuit matrixĜ k (k = x; y; z) in the VPEC model is positive definite. BecauseĜ only differs from the K matrix by a positive geometric constant, the proof of the matrix property (passivity and strict diagonal dominance) for K is equivalent forĜ. The existing proofs in [10] and [13] are based on the analogy [L] = [C 01 ], which holds when [C] [L] = constant. However, this relation does not hold in general as shown in [20] . Below, the authors present a direct proof for the VPEC model.
Proof: According to (14) , because the energy u k m (k = x; y; z)
is positive, it automatically results in a positive definite matrixĜ k [21] .
Therefore, the corresponding VPEC model is passive. However, to further guarantee a passive model after truncating small-valued offdiagonal elements from the original positive definite matrix, the authors will prove that the matrixĜ is strictly diagonal dominant [21] , i.e., where the induced current I k j is determined by the coupling flux between ai and aj . Equation (1) can be further rewritten tô
The positiveness of the ground resistanceR k i0 can be easily proved in a similar fashion. With this lemma, the authors can further prove the following theorem. 
That is, the circuit matrixĜ is strictly diagonal dominant. Note that truncating small off-diagonal entries from a strictly diagonal dominant matrix still leads to a positive definite matrix, i.e., a passive circuit model [21] . Based on Theorem 2, such a truncation-based sparsification still leads to passive circuit models. Intuitively, truncating small off-diagonal entries in theĜ matrix (equivalent to truncating larger off-diagonal entries inR matrix) results in ignoring larger resistors in the equivalent resistance network. Because larger resistors are less sensitive to and also contribute less to current change, the resulting sparsified model can still have a good waveform accuracy as presented in Section IV. Moreover, the proof assumes that wires can be decomposed into short wires with similar length. Therefore, in the experiments, the authors always segment wires to one-tenth of the maximum operating frequency when wire lengths are different (see spiral inductor in Section V).
C. VPEC-Based Inductance Sparsification
With Theorem 2, the authors present the flow below for the inductance sparsification based on the VPEC model.
• Generate the partial inductance matrix L by FastHenry or the formula from [22] and [23] .
• (Option 1: tVPEC model) Invert the full L matrix to obtain matrix G,R, and full VPEC model, and then generate the sparsified VPEC model by truncating the full VPEC model. • (Option 2: wVPEC model) Find a sparse approximated inverse matrix S 0 of L to obtainĜ 0 ,R 0 , and the sparsified VPEC model simultaneously. Note that during the inductance extraction at low frequency, the authors assume that each wire segment is modeled by one filament. When the frequency is beyond 10 GHz, the volume filament [5] or conduction mode [24] based decomposition can be applied to consider the skin and proximity effects. In this paper, the authors use the three-dimensional (3-D) frequency-dependent solver FastHenry [5] to accurately extract the partial inductance matrix. Because inductance has weak dependence on geometry, the formula-based [23] or lookup table-based [25] approaches can also be applied to efficiently obtain the full-chip inductance.
As further discussed in Sections VI and V, the authors will apply two sparsifications that depend on the scale of the interconnect: 1) when the scale of interconnect is small (less than 1000 wires), the direct LU or Cholesky factorization-based inversion is sufficiently efficient 
], and the authors can apply simple truncation-based sparsifications; 2) when the scale of interconnect is large, the authors extend a window-based extraction [11] to obtain a sparse approximated inverse of L and simultaneously extract a sparsified VPEC model. It reduces the computation expense to O(N b
3 ), where b is the size of the window. As shown by experiments, the wVPEC model also reduces the error introduced by the sparsification when compared to the tVPEC model.
IV. tVPEC MODEL
This section presents the tVPEC model. After the full inversion of L, the authors obtain a strictly diagonal dominant matrixĜ. As explained in Section III, its small-valued off-diagonal elements can be truncated without loss of passivity. The authors present two truncating approaches below: the geometrical VPEC (gtVPEC) and numerical VPEC (ntVPEC) truncations. The first one is applicable to the aligned parallel bus, and the second is applicable to conductors of any shapes.
A. Geometrical Truncation
For the aligned parallel bus, the authors can define a truncating window (NW , NL) for each wire segment, where NW and NL are the numbers of coupled segments in the directions of wire width and length, respectively. The coupling along the wire length is the forward coupling, and the one along the wire width is the aligned coupling. Because of the symmetry introduced by aligning and paralleling, each wire segment will have the same sized truncating window. As a result, the tVPEC model only containsRij within the truncating window for each wire segment and is called gtVPEC in Table II. The authors consider a 32-bit bus with eight segments per line and four differently sized truncating windows, (32, 8), (32, 2), (16, 2) , and (8, 2), and summarize the experiment setting and result in Table II . Clearly, there is a smooth tradeoff between runtime and accuracy for different truncating window sizes, where the average voltage differences and associated standard deviations are calculated for all time steps in SPICE simulation. The authors first compare results of different truncating windows. The truncating window (8, 2) achieves the highest speedup of 302 and the largest difference of about 0.2 mV on average, less than 2% of the noise peak, and the truncating window (32, 2) has the highest accuracy with 0.06 mV on average but a reduced speedup of 102. Furthermore, the small difference between windows (32, 8) and (32, 2) implies that the forward couplings between nonadjacent segments are negligible. However, an N W larger than N L (as shown in Table II ) is needed to achieve a high accuracy. This implies that the aligned coupling is stronger than the forward coupling and considering only the adjacent aligned coupling may lead to a large error.
B. Numerical Truncation
For numerical truncation, the authors define the coupling strength as the ratio of an off-diagonal element to its correspondent diagonal element at each row ofĜ. They then truncate those off-diagonal elements with coupling strength smaller than a specified threshold. Fig. 3 plots simulation results under the numerical sparsification for the nonaligned parallel bus with 128 bits and one segment per line. The sparse factor is the ratio between the numbers of circuit elements in the truncated and full VPEC models. The waveform difference is small in terms of the noise peak for sparse factors up to 30.5%. Table III summarizes the truncation setting and simulation result, where values in parentheses of column 1 are truncating thresholds, and the runtime includes both SPICE simulation and matrix inversion in case of VPEC models. One can see from the table that up to 302 speedup is achieved when the average waveform differences is up to 0.377 mV, less than 1% of the noise peak. A larger speedup factor can be expected as a higher waveform difference can be tolerated in practice. The authors also compare the full VPEC model and the PEEC model. The full VPEC simulation is 7 times faster and has a negligible waveform difference.
V. wVPEC MODEL
The sparsification in Section IV needs full matrix inversion and becomes computationally expensive for the large-sized interconnect. Furthermore, the directly truncated matrixĜ may be not accurate enough to represent the original fullĜ matrix. As shown in [11] , all entries of the L inverse matrix can be approximately reconstructed from entries of the submatrices in L corresponding to the coupling window of the active aggressor. Using this windowing technique, the authors further present two wVPEC models based on the geometry (gwVPEC) and numerical value (nwVPEC), respectively.
A. Geometrical Windowing
Owing to the regularity of the aligned parallel bus, a coupling window with uniform size b for each wire can also be defined. For each wire in turn as the aggressor, the authors first construct N small submatrices (coupling windows) all with size b, then invert each submatrix and build a sparse approximated inverse for L. It is described in details in the following two steps. n is always negative with sufficient discretizations [13] . The circuit matrix of a stable system needs (19) As a result, the authors can construct the sparsified yet passive VPEC model based on (10) with the sparse approximated inverse S 0 . The authors first compare the extraction efficiency of geometrical tVPEC and wVPEC models in Fig. 4 by extracting the VPEC model for buses up to 2000 bits. The size for geometrical truncation is (NW ; NL ) = (8; 1) and for the geometrical wVPEC is b = 8. The extraction time for a truncation-based method includes the full inversion and truncation. When the scale of interconnects is small (below 128 bits), the truncation-based method is actually as efficient as the window-based method. But when the scale of interconnects becomes larger, the window-based extraction is faster than the truncation-based approach with a 902 speedup for the 2048-bit bus (8.6 s versus 543.1 Fig. 4 . Extraction time comparisons of bus lines with one segment each line using geometrical tVPEC and wVPEC models. s). Therefore, it is more efficient to apply the winnowed VPEC model for large-scale interconnects.
The authors further compare the waveforms between gtVPEC and ntVPEC models for the 128-bit bus with one segment per line. The authors apply a pulse voltage at the first bit and observe the transient responses at the far ends of second and 64th bit bus, respectively. The window size is b = 32 for the gwVPEC model and (N W ; N L ) = (32; 1) is set for the gtVPEC model to achieve the same sparsification ratio. Fig. 5 shows waveforms of the PEEC model, full VPEC model, gtVPEC model, and gwVPEC model. Compared to accurate models such as the PEEC model and full VPEC model, the gwVPEC and gtVPEC models have virtually no error at the second bit bus. But for the 64th bit, the gtVPEC model has nonnegligible error but the gwVPEC model still has a high accuracy. More accuracy comparisons are presented in Table IV by changing the window size b to 64, 32, 16, and 8, where the waveform difference is calculated from responses at far ends of the 64th bit between the sparsified VPEC and PEEC models. The authors find that the window-based extraction has 22 higher accuracy than the truncation-based approach on average. It is due to the fact that the matrix element generated from windowing is interpolated with other elements in the inductance matrix.
B. Numerical Windowing
The coupling window usually has different sizes for different wires because the general layout does not have the regularity as the aligned parallel bus. Therefore, the authors first need to determine the size of coupling window for each wire. Similar to the numerical truncation approach, the authors define the coupling strength as the ratio of an offdiagonal element to its correspondent diagonal element at each row of the L matrix. The authors then construct the coupling window by only considering those off-diagonal elements that have coupling strength larger than a specified threshold value. With the heuristic selection (18), the authors can build the sparse approximated inverse and consequently a sparsified VPEC model.
In an example shown in Fig. 6 , the authors extract the partial inductance by FastHenry at 10 GHz for a three-turn spiral inductor on the lossy substrate. The heavily doped substrate modeled as lossy ground plane with = 1:0 2 10 05 m. The metal is volume discretized according to skin depth and longitudinal segmented by one-tenth of the wavelength, resulting in 92 segments. The substrate is also discretized according to [26] , and its contribution (Eddy current loss) is lumped to the segmented conductor on top of the substrate. 
VI. COMPLEXITY SCALING OF VPEC MODELS
The authors further compare runtimes and model size by extracting and simulating a number of aligned parallel buses using the PEEC model, full VPEC model, and gwVPEC model (b = 8), respectively. The runtime for the full or wVPEC model includes both VPEC extraction and SPICE simulation times. The model size refers to the file size of the resulting SPICE netlists. The authors plot the runtime versus the bus size in Fig. 8(a) and the model size versus the bus size in Fig. 8(b) .
Due to the additional introduced circuit elements, the size of SPICE netlist for the full VPEC model is around 10% larger than the full PEEC model on average. Further, when the scale of wire number is small, there is no runtime speedup observed for the full VPEC model. However, when the scale of wire becomes larger (greater than 64 bits), the runtime of the full VPEC model is found to be 10 times faster than the PEEC model on average. For the 256-bit bus, the full VPEC model is 472 (185.39 s versus 8726.85 s) faster than the PEEC model. This is due to the fact that: 1) its resultant network has fewer reactive elements (i.e. inductances); and 2) its resultant MNA matrix in SPICE is sparser than the direct inductance formulation. SPICE converges faster with fewer time derivatives and integrals, and its internal sparse solver is more efficient for a less dense matrix.
Moreover, both PEEC and full VPEC models can only handle a bus circuit with up to 256 bits due to memory limitations. On the other hand, the gwVPEC model (b = 8) can handle a larger size of up to thousand bits. Moreover, it is easy to see that the scalability of the gwVPEC models shows a slow increase with respect to the increase of the bus line numbers. For example, it achieves over 10002 (9.71 s versus 8726.85 s) speedup for a 256-bit bus in runtime compared to the PEEC model. In all the simulation, the wVPEC model has a very small waveform difference (less than 3%) in terms of delay when compared to the PEEC model.
VII. CONCLUSION
Using the equivalent resistance network and controlled voltage and current sources to replace the inductance network, the authors develop the full VPEC model that is as accurate as the PEEC model but takes less simulation time. Although the full VPEC model has a slightly higher circuit complexity compared to the PEEC model, SPICE can handle the VPEC model more efficiently because the VPEC model has fewer reactive elements (i.e., inductances) and the modified nodal admittance matrix becomes sparser under VPEC model than that under the PEEC model.
Moreover, the resulting circuit matrixĜ for the equivalent resistance network in the full VPEC model is passive and strictly diagonal dominant. This enables truncation-based sparsification methods with guaranteed passivity. The authors have presented the truncation-based method and have achieved orders of magnitude speedup in circuit simulation with small errors compared to the PEEC model. Furthermore, the window-based extraction method has been developed to avoid the full inductance matrix inversion and can obtain a higher accuracy compared to the truncation-based approach. The authors have also shown that the matrixĜ can be used to justify the K element or susceptance-based simulation [10] - [14] from first principles. Note that SPICE is able to directly simulate the VPEC model but not the K element-based model.
The primary contribution of this paper is to derive the inversionbased full VPEC model for multiple inductive interconnects and illustrate how to build sparsified VPEC for SPICE simulation with guaranteed passivity. To further reduce the complexity of the resulting sparsified VPEC models, the authors intend to develop model order reduction for the VPEC model.
APPENDIX
To model the inductive effect, the authors start with differential Maxwell equations in terms of A [19] 
To construct the system equation in the form of the integral equation, the authors apply the volume and line integration to (20) and (21), respectively. For filament a i , when (20) of fault coverage. Among the factors contributing to the complexity of this problem [3] are: 1) the difficulty in controlling and observing deeply buried flip-flops (FFs); 2) the need to work with a model consisting of an iterative array of time-frames whose number is, in the worst case, an exponential function (4 n ) of the number of FFs in the circuit (n); 3) the existence of illegal states, which may cause the ATPG algorithm to waste a lot of time trying to justify them; and 4) the existence of untestable faults, which require the ATPG algorithm to complete an exhaustive search before they are identified as untestable.
Several different approaches have been proposed for sequential circuit test generation, including deterministic techniques, genetic techniques, symbolic techniques, and various combinations of the three [4] - [15] . Deterministic algorithms tend to be highly complex and time consuming [4] - [6] . As an example, the HITEC test generator [6] uses both forward and reverse time processing to generate a test for a target fault. During test generation, each target fault must be excited and the fault effects propagated to a primary output (PO), either in the same time-frame in which the fault is excited or in a subsequent time-frame. The required state must then be justified. If conflicts are found, backtracking is required, and alternative decisions must be made. Genetic algorithm (GA) based approaches simplify test generation by processing in the forward direction only [8] - [12] . Populations of candidate test sequences are evolved over several generations, with more highly fit sequences propagating from one generation to the next. Fitness is determined based on fault detection capabilities, amount of circuit activity induced, and other factors related to detecting the target faults. GA-based techniques have been especially effective for data-dominant circuits. Symbolic techniques [7] , [14] based on binary decision diagrams (BDDs) have also been used very effectively on control-dominant circuits. Very high fault coverages have been achieved, although the size of the circuit that can be handled is limited.
Various design-for-testability (DFT) techniques have also been proposed to ease the test generation process and improve the fault coverage for sequential circuits, including full scan design [16] , [17] , partial scan design [18] - [21] , clock control [22] , [23] , and probe point insertion [24] , [25] . With full scan design, all flip-flops are arranged in a chain when a circuit is being tested, and node values at the flip-flops are scanned in before each test and scanned out after each test. Complete controllability and observability are thus provided at the flip-flops, reducing the complexity of the test generation process to that for a combinational circuit. In partial scan design, only a subset of flip-flops are placed in the scan chain. Partial scan design is a cost effective alternative; delay and area penalties are reduced, and test application time may be reduced as well. However, the remaining circuit is still sequential, so a sequential circuit test generator is needed. With clock control, flip-flops in a circuit are divided into groups, and each group is controlled by a single clock signal, thus reducing the correlation between flip-flops and increasing the number of reachable states. By using probe points, the observability of some internal lines, including the outputs of flip-flops and any other lines, can be enhanced.
In this paper, we propose two DFT techniques based on clock partitioning and clock freezing, selective clock freezing and CLOCK-WAVE, to ease the test generation process for sequential circuits and two opportunistic ATPG algorithms that take advantage of the DFT techniques and detect most faults using combinational techniques [26] 
