With increasing chip size and the number of data bits, many long global lines run in parallel in the same layer.
quickly with increasing spacing, the nearest neighboring lines (first neighbors) will see the most of the charge excited by the aggressor. Other farther neighboring lines (second and the higher order neighbors) contribute minimally to capacitance coupling. In this sense, capacitance coupling is a 'short range' effect, and we only need to include the first neighbors into delay and noise calculation. 
Introduction
As VLSI design enters Deep-Sub-Micron (DSM) era, scaled feature sizes, an increased number of metal layers, and larger chip dimensions have helped to achieve faster, more complex, and more powerful design 113. Chip operation frequency nowadays is pushed into Gigahertz region with much shorter signal rise time. To accommodate this technology trend, designers must be more careful in handling interconnect delay and cross-talk noise, especially in critical path timing analysis, to reach higher clock frequency. In design automation, interconnect is usually modeled as a resistance-capacitance netlist for timing and noise prediction [2-41, because line inductance is negligible under low clock frequency. When the clock frequency approaches one Gigahertz, the impedance contributed by line inductance (wL) will be comparable to the line resistance [5-61. The inclusion of inductance not only slows down the signal delay, but also causes voltage overshoot and reduces rise time, which can introduce large crosstalk noise on neighboring lines [7j.
b. Three major paths for inductive current return:
(1) +cc; (2) In contrast to the RC case, inductance coupling excites an induced current that needs to find a return path to form the current loop. Unfortunately, with the receiver's gate capacitance (C,.,,,,,,,) loaded at the end of wire, there is no DC path for the return current directly back to the ground. The orthogonal layer cannot be the ground plane anymore because L, is zero for two perpendicular lines. The retum current must go through C,, C,, or Crecrever of the co-planar signal lines until it goes back to AC ground or hits the power and ground lines, which can shield further coupling, as shown in Fig. 1 b. Because the loop impedances of each of these paths are comparable, there is no dominant route for inductive current return. Furthermore, all higher neighbors should be taken into consideration because the mutual inductance Lm decays slowly with increasing spacing . Therefore, inductance coupling is a 'long range' issue and the full inductance matrix is necessary for a correct RLC model representation [lo] .
Because of the long range coupling characteristics of inductance coupling, accurate estimation of delay and noise for RLC lines is usually a time consuming task. For example, if there are N signal lines, we must include NC,, (N+l)C,, and a (N+2)x(N+2) partial inductance (L,T and Lm) matrix into simulation for correct results. For this reason, a full partial inductance representation is physically accurate but computationally costly. Particularly in determining the operating frequency of a chip, current EDA tools usually check a specific single line chain on critical path and examine the worst-case delay for repeater insertion and timing optimization. Since the operating frequency affects the entire chip and must be checked globally, use of a full netlist would be very computationally intensive. For this purpose, it would be preferable to develop an algorithm to reduce the multiple line condition to a single line model. This equivalent single line model should be able to take the neighboring line effects into account for delay estimation while greatly improving efficiency.
In the conventional RC line model, due to short-range capacitance coupling, this simplification is realized by introducing a switch factor to effectively model first order coupling capacitances. The specific line can then be decoupled from the others by using an effective capacitance Ce~(C,+2qC,), as shown in Fig. 2a . Depending on the switching pattern, q is between ( 0 , 2) for a step input or {-1, 3 ) for a ramp input [ll] . However, for the RLC line model, all neighboring lines between the same powerlground lines should be included, thus increasing the decoupling complexity. In the following sections, we will first introduce the effective loop inductance (Le8) concept to simplify the multiple RLC line problem into a single RL,f&,ff model as shown in Fig. 2b , and then explain a practical approach to model 15, which can handle the extra delay and noise caused by inductance. Applications to repeater insertion and critical path delay optimization will also be discussed. 
Effective Loop Inductance Modeling

Effective Loop Inductance Lefl
Partial inductances L, and L, can accurately model the RLC line behavior, but they involve too many elements for simulation. With the knowledge that the induced current forms a loop and that all the inductance effects can be reflected by the loop RLC elements response, a loop inductance (Ll0,,J can be used to simplify the problem [7] . Using the correct L/<,,,,, value to replace L,5, we can then isolate the target line out for further analysis. Therefore, an accurate 4, model is essential for efficient timing analysis. For the multiple line-only case, RaphaelTM RI3 can directly simulate the value of L,,,, [12, 131. However in realistic design with a receiver gate loading the line end, there is no DC path for short connection. Simple L,,,op estimation from RI3 can cause a large error [14] . Fig. 3 shows an example of two identical 3mm coupling RLC lines. We use 0.18pm A1 interconnect technology in this example and in all following simulations. One line is excited by a loops rising time ramp input while the other is quiet. To match the output waveform, the actual Llc,(,p value is more than 500% of the simple RI3 result. line width w, line length 1, line spacing s, the number of signal lines between power/ground N , driver strength wdrrver, receiver strength wrrcelver, and signal rise time before the driver t,,,. For such a large number of variables, we need to make some fairly realistic assumptions to simply the L, modeling work:
. All the signal lines are parallel to each other and have the same dimensions, spacing and drivedreceiver sizes. This assumption is usually met for global lines.
. Wire thickness is a technology dependent parameter and it is fixed for the corresponding metal layer. 8 The ratio of the power/ground line width of the signal line width is fixed. Their spacing is the same.
8 Signal delay is defined as the delay between the input point of source drivers and the end of the segment it drives (i.e. the input point of the sink receivers).
. The central signal line is selected as the worst delay and noise line on critical path due to symmetry.
. A non-inverting drivedreceiver is used and the total width of NMOS and PMOS in the final stage inverter is used to represent the drivedreceiver strength. 9 Cen and 7 are maintained the same as RC decoupling case for two reasons: 1) Such a delay estimation is neither too optimistic nor too pessimistic. 2) It is fair for RC/RLC comparison and addresses the extra delay and noise purely due to the inclusion of Lefl
Worst Case Input Vector
In addition to geometrical variables, the input vector for multiple signal lines also significantly affects delay and noise amplitude on central signal lines. If the simple RC model is applied, it is known that the opposite switching pattem of the 1'' neighbors to the victim line generates the worst delay and the largest noise. The reason is that when two lines switch in opposite directions, the effective coupling capacitance between them is largest [l 11. Since the higher order neighbor coupling is negligible is the RC case, only considering the 1" neighbors' switching pattem is enough. With the inclusion of the long-range inductance coupling, the complete switching pattem is necessary for worst case estimation. Based on symmetry, there are two candidates for the worst case input vector, as shown in Fig. 4 The first is when all the neighboring lines switch oppositely (Fig. 4a) . It sometimes generates the largest noise to slow down the central victim delay, especially when the lines are RC dominant [7] . But considering that the induced current actually runs in the same direction as the main switching current on the victim line, it can actually helps victim operation (Fig. 4a) . We may prefer that all the higher order neighbors, which are purely inductance coupling to the victim, switch in the same direction as the victim to suppress victim switching and that only 1" neighbors, in which capacitance coupling mostly dominates, switch oppositely. This is the second candidate, as shown in Fig. 4b . The first candidate is capacitancecoupling-prone and the second is inductance-couplingprone. The worst case pattern depends on the applied technology and RLC parameters. In a more thorough investigation, we sweep all seven variables (w, I, s, N, w,Ir,,,, wrrcrIvrr, frl,c) over a wide range for metal layer 6 (m6), metal layer 5 (m5) and metal layer 4 (m4) and reach such conclusions:
For top level interconnects (m6, m5, m4), about 80% worst cases are generated by the second switching pattern, i.e. 1" neighbors opposite-switching and all the other higher order neighbors switching the same as victim, as shown in Fig. 5 . This implies a large inductive effect on current 0.18pm technology global lines. We would choose the second input vector as our worst case switching pattern.
. Receiver capacitance is negligible compared to line coupling capacitance. Thus, we can fix wrecrIver at a reasonable value (about 40pm) and focus on the other six variables. 
Numerical Approach for Le,-, Modeling
The decoupling of multiple lines with Leg speeds up simulation despite the large number of variables and nonmonochromic output waveform. To guarantee the accuracy of delay analysis for further applications, we take a numerical approach to generate a look-up table for L, first to perform critical path timing analysis. The methodology and simulation flow are shown in Fig. 6 . Long range inductance coupling: A:; Fig. 7 shows, for various sizing global lines, effective loop inductance Leff goes up almost linearly with the number of signal lines. As equal spacing is assumed, coupling line spacing is proportional to the number of lines between them. So the linearly increasing of inductance demonstrates that mutual inductance drops slowly for higher order neighboring lines. Thus, their effects cannot be ignored, especially for larger lines, as shown in Fig. 7b. . Inductance coupling saturation: For smaller global lines, such as the lpm line in Fig. 7a , extra number of neighbors can saturate mutual inductance. Because the induced current prefers to return through the lowest impedance path, the farthest lines usually directly couple to the powerlground boundary instead of the central victim. The saturation part actually indicates the inductance coupling region in design. If designers layout powerlground shielding lines along this region (i.e. every five or six in Fig. 7a case) , crosstalkimmunity and chip area cost are optimized. N  a. w=lpm, s=0.8pm, w~,,...,=lOOp~n, 
Number of Signal Lines
Repeater Insertion with Inductance Effect
On global lines, repeaters are usually inserted between the source and sink to achieve optimal delay. Suitable repeater insertion can reduce the interconnect delay by as much as more than 50%. Thus, as modem design is moving to the Gigascale era with more than 30% extra delay introduced by inductance, it is crucial to clarify the repeater insertion map for RLC line. Although some analytical models have been reported to cope with this problem [15] . Thus, RLC line delay drops faster than RC when the wire length is shortened by repeater insertion. Fig. 10b shows the repeater insertion result for multiple lines: five signals between power/ground lines, each with 2pm width and spacing. Similar to the first example, they are all 1 Imm long. Both the RC and RL& models optimize delay with one repeater. The RC result, 0.2475ns, is smaller than that for RL,& 0.254211s. This is quite different from the single line case, but it can be attributed to the delay dependence on line length. There are two major factors affecting total effective loop inductance. The larger the total loop inductance, the longer delay we usually will have. One factor is the number of coupling lines. As Fig. 7 proves, with increasing number of neighboring lines, effective loop inductance goes up proportionally to that. That is why in Fig. 8 , the N=10 case generates a longer delay than N=7. The other factor is the coupling length between aggressors and the victim.
The longer the coupling length, the more L, contributes to Le#. As Fig. 8 indicates, delay decreases dramatically with line length. With N=7, when wire length is smaller than 5mm, even though L, over unit length is large enough, the total Le8 is small. The delay performance then merges into the RC case. Similarly, in the N=5, l l m m RLC line case, after one repeater inserted, the coupling length is reduced to about 5mm and Len is not large enough to differentiate its delay from pure RC one, as shown in Fig. 8 .
Therefore, we have same number for optimized repeater insertion, as demonstrated in Fig. 10b 
Summary and Conclusion
In this work, we develop an efficient way to handle delay and noise estimation for multiple on-chip global wires. Effective loop inductance, instead of partial inductance, directly determines the electrical characteristics of the coupling interconnect and greatly reduces the number of elements for further modeling and estimation. Thus, accurate Lcfl modeling becomes the main target. T o highlight the inductance effect for timing analysis, our model is focused on the extra delay introduced purely by inclusion of Le? By using the Le# look-up table, we studied the inductance effect on repeater insertion. For a single line, Lefl is the self inductance L,. Extra inductance for a single line causes a more linear dependence of a line delay on line length, which has a larger slope than the quadratic dependence in the RC case when line lenglh is small. As a result, fewer number of repeaters are needed for a single RLC line. For multiple lines, L, is smaller than L, and mostly affected by the mutual inductance, which depends on the number of coupling lines N and their coupling length 1. In the 0.18pm A1 technology we are studying, the same optimal number of repeaters is found for both RC and RLC models, unless Nor 1 is extremcly large (N>7 or segment length> 6").
In the future, we will address an analytical approach for L, to further improve simulation efficiency.
