Abstract-Antenna effect may damage gate oxides during a plasma-based fabrication process. The antenna ratio of total exposed antenna area to total gate oxide area is directly related to the amount of damage. Jumper insertion is a common technique applied at routing and post-layout stages to avoid and to fix the problems caused by the antenna effect. This paper presents an optimal algorithm for jumper insertion under the ratio upper bound. It handles Steiner trees with obstacles. The algorithm is based on dynamic programming while working on free trees. The time complexity is O(α|V | 2 ) and the space complexity is O(|V | 2 ), where |V | is the number of nodes in the routing tree and α is a factor depending on how to find a nonblocked position on a wire for a jumper.
I. INTRODUCTION
A NTENNA EFFECT is a phenomenon in very large scale integration (VLSI) fabrication where a current caused by a plasma process flows through the gate oxides and damages them. It reduces both manufacturing yield and product reliability, which are among the most important issues with the rapid scaling down of the VLSI feature sizes. The relationship between the amount of damage and the antenna ratio has been studied for a long time [3] - [5] . Generally speaking, a wire segment acting as an antenna may collect a charging current when exposed to plasma. If the segment only connects to the gate oxides but not diffusions, a voltage potential may build up and a discharging current tunneling through the gate oxides will form under the potential and damage the gate oxides. The damage can be observed via the drift of threshold voltage; the spatial variations of plasma stress across the wafer may cause variations in the threshold voltages with spatial correlations. Besides the processing parameters, which are constant, the antenna ratio of the total exposed antenna area to the total gate oxide area determines the amount of voltage potential and, thus, the damage: A smaller antenna ratio results in less damage. Antenna rules are commonly enforced as upper bounds on the antenna ratio in the design rules [6] .
Correcting the antenna problem after the placement and routing stage is feasible and effective [7] - [9] . If a wire segment violates the antenna rule, either jumpers are inserted to break the wire segment or a protection diode is added to the unused area of the chip and connected to the wire segment to form a low impedance discharging path. However, as indicated by the works in [1] and [2] , a correction after the placement and routing may not be effective or even possible beyond the 0.13-µm technology. The main reason is that with the scaling down of the VLSI feature sizes, the antenna rules become more stringent. The number of wire segments violating the rules and, thus, the numbers of the jumpers and the protection diodes increase dramatically, which is not practical, considering the limitation of the unused chip area and the routing resource. Therefore, considering the antenna effect in an earlier stage and planning ahead are a must to avoid the problems. Two recent works [10] , [11] considered the antenna effect during the routing stage by combining the jumper insertion to a layer assignment. In these approaches, the jumper insertion guides the layer assignment algorithms by predicting the jumper positions. Assigning a wire segment to a layer has the same effect of inserting a jumper if a discharging path to the source is formed. If for some reason, e.g., limited routing resource, such layer assignment is not feasible, a jumper must be inserted. Two more works [12] , [13] focused on the jumper insertion problem itself. The latter one provided an exact algorithm to solve it in general routing trees with obstacles. In the works in [10] , [12] , and [13] , a random discharging model was used and the upper bound on the total exposed antenna area connected to the gate oxides was controlled. In [11] , although the same bound was controlled, Wu et al. pointed out that when multiple sinks are presented, the algorithm will be conservative and not optimal. More specifically, since there may be multiple gate oxides connected to one wire segment, the discharging current is "shared" among those gates. Using a small upper bound on the total exposed antenna area would overconstrain the routing, while using a large upper bound on the total exposed antenna area would result in a huge number of wire segments violating the antenna ratio rule. In addition, when gates are commonly sized for performance and power consumption during design, determining a proper upper bound on the total exposed antenna area could be more difficult. Therefore, the upper bound on antenna ratio, which is the most important rule for the antenna effect, should be directly addressed to improve the accuracy in both antenna planning in routing and antenna fixing in post-layout stages.
In this paper, we present an optimal algorithm to solve the jumper insertion problem under the upper bound of the antenna ratio. Our algorithm handles general routing trees, i.e., Steiner trees, as well as obstacles. Since we directly consider the ratio upper bound, we get a better result for antenna avoidance. We first formulate the RatioJI problem that models the jumper insertion under ratio upper bound. Then, we solve it via dynamic programming by the RatioPart algorithm. The time complexity is O(α|V | 2 ) and the space complexity is O(|V | 2 ), where |V | is the number of the nodes in the routing tree and α is a factor depending on how to find a nonblocked position on a wire for a jumper. In our experiments with and without obstacles, we have an α equal to 1. Moreover, our dynamic programming algorithm works on free trees. It is different from the classical dynamic programming approaches in the VLSI CAD area [17] , [18] , which work on rooted trees. We believe that the general framework is valuable for other problems on free trees and that it is possible to exploit this characteristic to reduce the practical running time via heuristics that determine the order for performing operations dynamically.
The rest of this paper is organized as follows. In Section II, backgrounds on antenna effect are introduced. The RatioJI problem is formulated in Section III. An algorithm to solve the problem is presented in Section IV. After the experimental results given in Section V, Section VI concludes this paper.
II. ANTENNA EFFECT
A detailed overview of the antenna effect can be found in the work of Shin and Hu [3] . We briefly introduce the backgrounds here.
VLSI chips are commonly fabricated layer by layer. Since interconnect networks consist of wires and vias on and between different layers, wire segments are formed during the fabrication. Some of them connect to gate inputs, which are gate oxides, and others connect to gate outputs, which are diffusions. During radio frequency (RF) plasma processes, the exposed wire segments act as antennas. They collect ion and electron currents from the plasma. Since a wire segment also acts as a capacitor, if the two currents do not cancel each other through every RF cycle and the wire segment does not connect to a diffusion, charging on the capacitor happens and an antenna voltage V g will build up. When the wire segment only connects to the gate oxides, V g reaches a steady state when the FowlerNordheim (FN) tunneling current through the gate oxides balances the plasma caused charging current. In such a steady state, V g is also the gate voltage on all the gate oxides connected to the wire segment. The charging current density J p is a function of the antenna voltage, and the tunneling current density J FN is a function of the gate voltage when the technology parameters are given as constants. Since both the antenna voltage and the gate voltage are V g , the following equation shows the relationship between the antenna ratio and the current densities:
antenna ratio = total exposed antenna area total gate oxide area
As plotted in [3, Fig. 12 ], the charging current decreases with the increase of V g , but the tunneling current, which damages the gate oxide, increases with the increasing of V g . A higher antenna ratio means a larger V g and, thus, a larger J FN and more damage.
The methods to compute the exposed antenna area are different for different plasma-based manufacturing processes. According to the work of Shirota et al. [7] , there are three types. The first is the conductor layer pattern etching process where the perimeters of the wires are exposed. Therefore, the exposed area is computed as the perimeter length of conductor layer patterns. The second is the ashing process where the area of the wires is exposed. Thus, the exposed area is the area of the conductor layer patterns. The third is the contact etching process where the area of the contacts on the lower conductor layer is exposed. Therefore, the exposed area is the total area of the contacts. Our problem formulation is general enough to handle all these types.
III. PROBLEM FORMULATION
Considering the jumper insertion on a general routing tree for an upper bound R on the antenna ratio, let T = (V, E) be the routing tree, where V is the set of nodes representing the gates as well as the Steiner points and E is the set of tree edges representing the wires connecting those nodes. A function g is defined on every node v ∈ V . If v represents a gate, g(v) > 0 gives the gate oxide area; if v does not represent a gate but a Steiner point, let g(v) = 0. A function l is defined on an edge segment s: l(s) ≥ 0 is the exposed antenna area of s.
Jumpers will be inserted on edges to form cuts. For an edge e = (u, v) ∈ E, a cut c e divides e into edge segments. The size of the cut c e , written as |c e |, is defined as the number of the jumpers inserted. Since there is no gate within the edge, it is obvious that at most two jumpers are enough to satisfy the antenna rule. Thus, c e is written as
where
. . , k + 1 are the edge segments and |c e | = k. It is not always possible to insert a jumper at any position. Various reasons introduce obstacles along the wire. For example, the top layers may be occupied by other nets. To model the obstacles on a particular wire, which are the positions that a jumper cannot be inserted, we use C e to denote the set of allowed cuts on an edge e. By assigning one cut to each edge in the tree, T is partitioned into connected components. The partitioning is given by the set of the cuts: C = {c e : e ∈ E}. A partitioning C is called feasible if c e ∈ C e for every e ∈ E. Obviously, the number of all the jumpers inserted is e∈E |c e |. For a connected component S, let g(S) be the total gate oxide area and l(S) be the total exposed antenna area. A feasible partitioning is valid if for every connected component S,
The reason is that, according to Section II, for all the gate oxides in a specific connected component, the tunneling current densities are the same since the gate voltages are all equal to the antenna voltage. Intuitively, the plasma charging current is "shared" among those gates. A connected component can survive larger total exposed antenna area if the total gate oxide area is larger. Note that it is possible to have no valid partitioning because of the obstacles.
Since inserting the jumpers will degrade the performance and manufacturing yield, it is preferred to find the minimal number of jumpers to meet a specific ratio upper bound. The corresponding partitionings are called the optimal partitionings. The optimal jumper insertion under the ratio upper bound is formulated as the following RatioJI problem.
Problem 1 (RatioJI): Suppose that T = (V, E) is a tree, functions g and l model the gate oxide area and the exposed antenna area, respectively, and set C e models the obstacles on a wire e where jumpers cannot be inserted. For a ratio upper bound R, find an optimal partitioning, which is a partitioning C with c e ∈ C e , ∀e ∈ E and a minimal number of jumpers such that for a connected component S, either the total gate oxide area g(S) is 0 or the ratio of total exposed antenna area to total gate oxide area l(S)/g(S) ≤ R.
The RatioJI problem models both antenna fixing and antenna planning under various antenna ratio models. For antenna fixing after the layer assignment, the problem can be solved at each layer to meet the antenna ratio rule on that layer with proper parameters. For antenna planning in routing, the layer assignment can be obtained in an algorithm similar to those in [10] and [11] by predicting the jumper positions via solving the RatioJI problem. Although the predictions may be inaccurate, the previous works [10] , [11] showed that the number of jumpers required for antenna fixing later could be reduced.
Two commonly used models of antenna ratio are the partial antenna ratio (PAR) and cumulative antenna ratio (CAR) [11] . In PAR, the antenna ratio bound is applied to the antenna ratio computed at a specific layer for a specific wire segment at a time. Different layers may have different bounds. The RatioJI problem models the PAR by constructing T to represent the part of a routing tree, including that wire segment and all the interconnects and gates connected to it in lower layers which are already fabricated. Other parameters are chosen accordingly. In CAR, the antenna ratio is defined as the ratio of the cumulative antenna area to its connected gate area. The RatioJI problem models the CAR by constructing T to represent the cumulative antenna area and its connected gates and l to represent the antenna area contributing to the cumulative antenna area.
IV. OPTIMAL JUMPER INSERTION

A. Algorithm Overview
We solve the RatioJI problem by dynamic programming. Classical dynamic programming algorithms usually work on a rooted tree [17] , [18] . However, a routing tree T is a free tree without a root. Our algorithm will process the edges one by one in some order. We briefly describe our algorithm here, while the definitions of terminologies and the details of algorithm will be presented in later sections.
Our algorithm consists of two stages. In the first stage, dominant solutions are generated bottom-up. At any step, the edges are divided into two groups: processed or unprocessed. Dominant partial solutions are maintained for every tree in the forest formed by connecting all nodes via the processed edges. At the beginning, no edge is processed. Thus, every node is a tree by itself and has only one partial solution. The algorithm processes one edge at a time by combining two trees with an unprocessed edge and computing the dominant partial solutions of the new tree based on those of the two old trees. An edge can be processed only if it is the only unprocessed edge incident to one tree. An example is shown in Fig. 1 . Starting from the process in Fig. 1(a) , there are three possible edges to be processed next, as shown in Fig. 1(b)-(d) . Note that the edge (e, h) cannot be processed at the current step. When every edge is processed at the end, the forest contains one tree, which is T itself, and all the dominant solutions are generated.
In the second stage, since there is one optimal partitioning among the dominant solutions if there exists a valid partitioning, we can either find an optimal partitioning or report that there is no valid partitioning. A top-down backtracking will construct the optimal partitioning if there is one.
B. Dominant Solutions
Denote the forest formed by connecting all nodes via the processed edges by F . Suppose there are m trees in F , written as T 1 , T 2 , . . . , T m . There are m − 1 unprocessed edges currently. When m > 1, every T i has at least one unprocessed edge that connects to it because T is connected. Since an edge can be processed only if it is the only unprocessed edge to one tree, each tree T i can only have one node u i incident on the unprocessed edges.
For a tree T i , partial solutions are the partitionings of T i with the following properties. For every component S not containing u i , the ratio upper bound must be satisfied, i.e., either g(S) = 0 or l(S)/g(S) ≤ R. Denote the component containing u i by S i . If there is at least one unprocessed edge incident on u i , the upper bound can be violated in S i , which may be corrected when future combinations are executed. For a partition of T i , let n be the number of the jumpers inserted. Define
The partial solution is represented by the triple (n, x, I ). To ease the representation, we refer to the partial solution by the triple when there is no ambiguity.
A dominant relationship is defined among the partial solutions such that only necessary partial solutions are maintained for each tree T i . For two different partial solutions P 1 = (n 1 , x 1 , I 1 ) and P 2 = (n 2 , x 2 , I 2 ) of one tree, we say that
The reason is that a partial solution with smaller n and x can always substitute another one of the same I in any optimal partitioning of T .
If there are no other partial solutions dominating (n, x, I), we call (n,
For each n ∈ {0, 1, . . . , 2N i − 2} and I ∈ {0, 1}, only one dominant partial solution (n, x, I) is maintained. Thus, there are at most
When there is only one tree in the forest, i.e., m = 1, we call the dominant partial solutions of the tree dominant solutions. Note that the set of dominant solutions could be different with different orders to process the edges.
C. Generation of Dominant Solutions
In this section, we consider how to generate solutions for a new tree by assimilating one tree into another. Assume that there are more than one tree in the forest, i.e., m > 1. Consider an unprocessed edge e = (u, v) connecting two trees T u and T v in the forest F . We process the edge e and combine T u and T v only when e is the sole unprocessed edge incident on u. Denote the new tree made up of T u , T v , and e by T v . The dominant partial solutions of tree T v are generated from those of the trees T u and T v , as stated in the following lemma.
Lemma 2: Proof: We proof the lemma by examining each case. 
1) When c e = (u, v), it is straightforward that
We prove I v = 0 by contradiction. If I v = 0, we have I v = 1 and
The (n, x, I) is dominated by the partial solution in case 1), which violates our assumption. Therefore, I v should be 0 and c e should be the element in C e satisfying x u + l(u, p 1 ) ≤ 0 with the minimal l(p 1 , v) . If I u = 0, we have g(S) = 0. Therefore, c e should be the element in C e with the minimal l(p 1 , v). 
Therefore, c e should be the element in C e satisfying x u + l(u, p 1 ) ≤ 0 with the minimal l(p 2 , v). If I u = 0, we have g(S) = 0. Therefore, c e should be the element in C e with the minimal l(p 2 , v).
Suppose D v and D u are the sets of the dominant partial solutions of T v and T u , respectively. We design the Combine subroutine, as shown in Fig. 2 , based on Lemma 2. The Combine subroutine updates D v to be the set of the dominant partial solutions of T v . A set B is stored for every dominant partial solution to enable the backtracking which is used to construct the optimal partitioning later. Assume that the invariant holds on line 7. Denote the tree containing u by T u and v by T v , respectively. Denote the tree after processing e and combining T u and T v by T v . We only need to prove that the invariant holds for T v when the current loop finishes, since all the other trees and their dominant partial solutions remain unchanged.
Since all the unprocessed edges connecting to T u are incident on u, the edge e is the only unprocessed edge connecting to T u . After e is processed, there is no unprocessed edge connecting to T u . Thus, all the unprocessed edges connecting to T v are incident on v.
We prove that the D * in the Combine subroutine contains all the dominant partial solutions of T v by contradiction. Assume this is not the case. Suppose the dominant partial solution (n , x , I ) ∈ D * , and it consists of some partial solution 
, and I u = I u . Therefore, denoting the partial solution consisting of (n v , x v , I v ), (n u , x u , I u ), and c e by (n, x, I), we have n ≤ n , x ≤ x , and I = I . Since (n , x , I ) is a dominant partial solution, it must be true that n = n , x = x , and I = I . On the other hand, according to Lemma 2, either (n, x, I) ∈ D * or there exists some partial solution in D * dominating (n, x, I). This contradicts that (n , x , I ) ∈ D * is a dominant partial solution. Therefore, the invariant holds for T v after the current loop.
The code on lines 11 to 16 searches for an optimal partitioning in the set D w . It is guaranteed to find one if there is one, which is stated in the following lemma.
Lemma 4: If there is a valid partitioning of T , then an optimal partitioning is presented in the dominant solutions D w .
Proof: If there is a valid partitioning, there must be an optimal partitioning. Suppose the partial solution (n, x, I) of T w is the optimal partitioning with the minimal x. We prove (n, x, I) ∈ D w by contradiction. According to Lemma 3, D w has all the dominant partial solutions of T w . If (n, x, I) ∈ D w , then there exists (n , x , I ) ∈ D w dominating (n, x, I ). Therefore, n ≤ n, x ≤ x, and I = I. Thus, (n , x , I ) is also valid. Since (n, x, I) is the optimal partitioning with the minimal x, we should have n ≥ n and x ≥ x. Therefore, n = n , x = x , and I = I , which contradicts that (n , x , I ) dominating (n, x, I).
On line 15, (n, x, I) is the triple of the optimal partitioning, and the set B contains information to reconstruct the optimal partitioning. The subroutine ReportPart, as shown in Fig. 4 , takes B as the parameter and recursively reports the optimal partitioning. The search on line 3 of the ReportPart subroutine takes a constant time since we implement D u as an array, and the position of element with the triple (n u , x u , I u ) in the array is stored.
The correctness of the algorithm is given by the following theorem.
Theorem 1: The RatioPart algorithm terminates in finite time and gives an optimal partitioning if there is a valid one.
Proof: The While loop on lines 6 to 9 terminates because the number of unprocessed edges is limited and decreases by one each time. The ReportPart subroutine terminates because the number of edges is limited, and the cut on each edge is reported exactly once. It is straightforward that the other parts of the RatioPart algorithm terminate so the whole RatioPart algorithm terminates.
A valid partitioning is a partial solution (n, x, I) of T w such that either I = 0 or (I = 1) ∧ (x ≤ 0). By searching all such partial solution in D w , the RatioPart algorithm gives an optimal partitioning if there is a valid one according to Lemma 4. In the current implementation, the order of the edges to be processed is determined statically in the RatioPart algorithm for simplicity. From an arbitrary node, a depth-first search (DFS) [19] on the routing tree is performed. Define the finishing time of edge (u, v) to be the finishing time of v where we assume that u is discovered prior to v. The edges are ordered according to their finishing times. For example, by a DFS of the tree in Fig. 1 starting from the node e, the nodes are discovered in the order e, g, a, b, h, c, d, i, and f and finished in the order a, b, g, c, d , h, f , i, and e. Thus, the order of the edges is (a, g), (f, i) , and (i, e). Note that it is possible to dynamically determine which edge to be processed next when there are multiple choices like the situation in Fig. 1 . Although the set of the dominant solutions may be different for different choices, Theorem 1 ensures that an optimal partitioning will always be found if there is one. A good heuristic may benefit both the running time and the storage requirement. We leave this as a direction of future research.
E. Complexity of the RatioPart Algorithm
The space complexity of the RatioPart algorithm is stated in the following theorem. 2 ). To evaluate the time complexity, we assume that both functions g and l take a constant time to compute. The time complexity is stated in the following theorem.
Theorem 3: The time complexity of the RatioPart algorithm is O(α|V | 2 ), where α is a factor depending on how to find a nonblocked position on a wire for a jumper.
Proof: We use the potential method [19] to evaluate the total running time of the Combine subroutine when it is used in the RatioPart algorithm. In the Combine subroutine, suppose line 5 takes O(α) time, where α is a factor depending on how to find the element in C e according to Lemma 2, i.e., how to find a nonblocked position on a wire for a jumper. For a forest F , define its potential to be
where N i is the number of the nodes in the tree T i . Suppose a forest F is obtained after applying the Combine subroutine to process edge (u, v) and combine T v and T u . Denoting the running time for this Combine run by t, we have
Since initially the potential is (A/2)α|V | and finally the potential is (A/2)α|V | 2 , the Combine subroutine takes at most
time when it is used in the RatioPart algorithm. In the ReportPart subroutine, every D u is searched, at most, once in the recursion. Therefore, line 16 in the RatioPart algorithm takes O(|V |) time.
All the other parts in the RatioPart algorithm take at most O(|V | 2 ) time. Therefore, the time complexity is O(α|V | 2 ). 
V. EXPERIMENTS
The experiments are conducted on a Linux workstation with dual 933-MHz Pentium III processors and 512-MB memory where the RatioPart algorithm is implemented in C++ and compiled with GCC 3.4. Since there is no previous work considering the upper bound of the antenna ratio with multiple sinks, we do not compare our results to others.
Two sets of benchmarks are used in the experiments. The benchmarks in the first set are four randomly generated ones containing 100, 1000, 10 000, and 20 000 gates, respectively, where the gates are randomly located in a 10 000 by 10 000 grid. The benchmarks in the second set are the four largest industrial benchmarks in the work [16] . For each benchmark, a Steiner tree is constructed as the routing tree using the algorithm in [15] . All the gate sizes are assumed to be one, and the exposed antenna area of a wire is computed as the wire length, which is the Manhattan distance between the two nodes it connects. The statistics of the benchmarks are shown in Table I . The column "name" shows the name of each benchmark where the benchmarks with names starting with "a" are the random generated ones and the benchmarks with names starting with "n" are the industrial ones. The column "# gates" shows the number of the gates in each benchmark. The column "# nodes" shows the number of the nodes on the routing tree, including the gates and the Steiner points. The column "SMT" shows the total wire length of the Steiner tree. The column "ratio" shows the antenna ratio of the routing tree when all the wires are exposed. For any antenna ratio upper bound larger than the number in this column, no jumper is needed.
We first run the RatioPart algorithm without obstacles. For each benchmark, six upper bounds of the antenna ratio are chosen representatively according to the "ratio" column in Table I . The results are reported in Table II . The "ratio" columns show the upper bounds used. The "# cuts" columns show the number of the jumpers inserted. The "time(s)" columns show the running time in seconds. It can be seen that the practical running times depend not only on the number of the nodes but also on the bound since different bounds will result in different numbers of partial dominant solutions to maintain on each tree. In Fig. 5 , the trend of the decrease of the running time with the increase of the antenna ratio is shown by plotting the number of jumpers inserted and the running time in seconds versus different bounds for the benchmark a10 000. On the other hand, the quadratic theoretical time complexity can be verified in Fig. 6 . There is one sample for each benchmark in the figure. Each sample is the one with the smallest bound among the six bounds for one benchmark. The slope of the best fitting line in the log − log plot shows a practical running time of Θ(|V | 1.85 ). We then run the RatioPart algorithm with obstacles. We randomly choose edges to be forbidden, which means that no jumper insertion is allowed on them. For each benchmark, we choose one representative bound, which is the second largest one among the six bounds in Table II . Scenarios with different percentages of forbidden edges are tested. The results are reported in Table III . The column "% ob" shows the percentage of the forbidden edges. The cells with "−" denote that no valid partitioning under the upper bound can be found with the obstacles presented. Note that the obstacle setting here is only for simplicity while the RatioPart algorithm can handle more general settings, e.g., the one in the work of Su et al. [13] where the obstacles can forbid inserting jumpers on parts of an edge.
VI. CONCLUSION
In this paper, we presented an optimal algorithm for antenna avoidance via jumper insertion under the upper bound of the antenna ratio. The algorithm is based on the dynamic programming on free trees. The experimental results confirmed the effectiveness of our approach. Future works include routing with antenna planning and heuristics for practical running time reduction. He is an Assistant Professor of electrical engineering and computer science with Northwestern University, Evanston, IL. Before he joined the faculty of Northwestern University, he was with the Advanced Technology Group, Synopsys, Inc. His research interests include very large scale integrated computeraided design, algorithm design, and formal methods.
Dr. Zhou served on the technical program committees of many conferences on very large scale integration (VLSI) and computer-aided design such as ACM Design Automation Conference and IEEE International Conference on Computer-Aided Design. He was a recipient of the CAREER Award from the National Science Foundation in 2003.
