Abstract| A fast and convergent iteration method for piecewise-linear analysis of nonlinear resistive circuits is presented. Most of the existing algorithms are applicable only to a limited class of circuits. In general, they are either not convergent or too slow for large circuits. The new algorithm presented in the paper is much more ecient than the existing ones and can be applied to any piecewise-linear circuit. It is based on the piecewise-linear version of the Newton-Raphson algorithm. As opposed to the Newton-Raphson method, the new algorithm is globally convergent from an arbitrary starting point. It is simple to understand and it can be easily programmed. Some numerical examples are given in order to demonstrate the eectiveness of the proposed algorithm in terms of the amount o f computation.
I. INTRODUCTION
DC analysis of nonlinear resistive circuits is one of the basic problems in the computer-aided design of electronic circuits. Various methods are available for the solution of this problem. These methods can be classied into two major groups. One is based on an iterative algorithm which is applied directly to the nonlinear circuit equations. The well-known method in this group is the Newton-Raphson method [1] { [3] . The second group is based on the piecewiselinear (PWL) analysis which has been investigated by many researchers due to its computational eciency [4] { [15] .
In the PWL analysis, a nonlinear resistive circuit can be described by f(x) = y (1) where f() is a continuous PWL mapping from R n into itself, x is a point i n R n and represents a set of chosen circuit variables and y is an arbitrary point i n R n which represents the inputs to the circuit. The operating region of every nonlinear element is divided into a nite number of segments. Hence, the space R n is divided into N linear regions bounded by h yperplanes where N is a very large number. The system of PWL equations in (1) can be expressed by the following set of linear simultaneous equations A l x + w l = y ; for l ; l = 1 ; 2 ; ; N (2) where A l is a constant n n matrix (called Jacobian matrix for convenience) and w l is a constant n-vector. They characterize the circuit in linear region l . T o nd all solutions of (1), one may solve n linear simultaneous equations in (2) for each o f N linear regions to nd x (l) and decide whether x (l) lies within the considered linear region, l . I f x ( l ) lies within l , i t i s a v alid solution. This method is conceptually simple and nds all existing solutions, but it is computationally complex. Recently, a n umber of authors have proposed various methods to decrease the number of linear regions, N, b y a sign test. One of these methods [4] requires more than O(Nn 2 ) m ultiplications. Moreover, the sign test is not a simple procedure. A more ecient method is proposed in [5] . Nishi [6] has proposed a method in which the numb e r o f m ultiplications required to nd all solutions of (2) is O(Nn ). Although the method developed in [7] seems to be the best, it is computationally impractical for large PWL circuits. For example, if the circuit contains 1000 MOS transistors each of which is modeled with 4 segments, then there are 4
1000
(approximately 10 600 ) linear regions. If the sign test requires at least one multiplication for each linear region, it will take m uch more than billions of years on today's supercomputers to nd the solutions by using these methods.
In this paper, we present a new algorithm, which w e call popcorn, shown to be more ecient than the existing algorithms of the same generality. This algorithm is globally convergent for a general class of PWL resistive circuits with no restrictions. It is simple and can be easily programmed. The method of PWL analysis of nonlinear resistive circuits is reviewed in section II. The popcorn algorithm is presented in section III. Some numerical examples are given in section IV to illustrate the eectiveness of the algorithm.
II. PIECEWISE-LINEAR ANALYSIS
In PWL analysis, the well-known technique due to Katzenelson [8] has been originally applied to the circuits with twoterminal elements which are strictly monotonic. The PWL approach w as further extended to include the resistive circuits of much broader class [9] { [15] . In particular, Fujisawa and Kuh [11] have shown that the Katzenelson's algorithm can be applied to (1) and it always converges to a solution as long as the equation has a unique solution. Fujisawa, Kuh, and Ohtsuki [12] have shown that if all the Jacobian matrix determinants detA l , l = 1 ; 2 ; ; Nin (2) have the same sign, then there exists at least one solution to the equation f(x) = y and the algorithm also converges.
This property is referred to as the sign condition. This restriction of the sign condition was later removed in the generalized Katzenelson's method [13] , [15] .
There exists also a PWL version of the Newton-Raphson method [2] . However, it is well-known that for the continuous case the Newton-Raphson method may not converge depending on the initial guess. The same situation may occur in PWL case, if the initial linear region is not close enough to the linear region of the solution. The divergence can be in the form of a cyclic repetition of two or more virtual linear regions. We h a v e observed that the PWL version of the Newton-Raphson method may not converge for some circuits, particularly with multiple solutions. We have tested this method 100 times on a 128-bit shift register circuit which contains 2580 MOS transistors using dierent initial linear regions. It has converged in only 22 trials, but the convergence speed was very high. Hence, the PWL Newton-Raphson method does not guarantee convergence, but if it does converge, it is extremely fast. We h a v e developed a new algorithm as described in the next section, by modifying the PWL Newton-Raphson method to avoid its major drawback, i.e., divergence.
III. THE POPCORN ALGORITHM
The new algorithm is described as follows: If a 0 k+1;j = a k;j then a k+1;j = a 0 k+1;j with probability 1 q Any other segment with probability q ; 0 < q < 1 If a 0 k+1;j 6 = a k;j then a k+1;j = a 0 k+1;j with probability 1 p Any other segment with probability p ; 0 < p < 1 4. Set k = k + 1. Go to step 2. A segment m a y not be chosen, albeit with a very small probability q, e v en though the present solution satises the limits of the assumed segment. If the solution does not satisfy the assumed segment, the segment in which the present solution lies is chosen with a high probability (1 p). With a small probability p any other segment i s c hosen. Here the other segments are chosen with equal likelihood. The segment selection procedure for each nonlinear device is independent of the other nonlinear devices. Note that, if p = 0 and q = 0 then the algorithm becomes identical to the PWL Newton-Raphson algorithm. For q = 0 , w e have constructed a counterexample circuit with no convergence. That circuit, shown in Fig. 1, contains 
algorithms fail for this circuit, unless the initial linear region happens to be the correct one.
The popcorn algorithm assures the convergence for any initial guess since the algorithm tries all of the linear regions eventually, u n til it converges. Having such a feature, it resembles the well-known simulated annealing algorithm without a cooling procedure [16] . The convergence proof is trivial, since the probability of visiting any linear region is nite. In the worst case, the algorithm visits all linear regions and convergence is always assured. This simple proof does not tell us how fast the algorithm converges, it merely shows that it is convergent for suciently many iterations. In each iteration, the PWL Newton-Raphson method selects a new linear region to be used in the next iteration and our algorithm makes a random perturbation on that linear region by means of the parameters p and q to prevent divergence. Obviously, these parameters should be appropriately selected to improve the speed of the algorithm. We h a v e made many experiments for dierent t ype and size of circuits by c hanging the values of p and q. The results obtained have been very encouraging as can be concluded from the numerical examples given in the following section.
IV. NUMERICAL EXAMPLES
We h a v e implemented the popcorn algorithm in C programming language and analyzed various CMOS, ECL and analog bipolar circuits. Let us describe the CMOS example circuits briey. Counter is a combinational circuit which nds the number of one's in a 128-bit input. The circuit lfsr is a linear feedback shift register which produces pseudo-random binary numbers. Sh128, is a 128-bit shift register circuit consisting of master-slave ip-ops. Pgen is a pulse generating circuit. Addcs circuit is a carry-select adder. The circuit rsync is used to produce a synchronization pulse. Add18 is an 18-bit adder circuit. Status is a 5-bit register circuit which can be loaded in series or in parallel. Sh5, is a 5-bit shift register circuit. The results given below are obtained using approximately 6500 hours of CPU time on a number of SUN Sparc-2+ workstations.
Let us dene q as equal to q times the number of nonlinear elements in a given circuit. The MOS transistors are modeled with 4 PWL segments representing the cuto, saturation, linear, and reverse saturation states. The average number of iterations, k, for the example circuits are shown in Fig. 2 as a function of p while q is kept constant at 0:005. The plots in Fig. 2 are obtained by taking the mean of more than 200 simulation results for every circuit at chosen values of p and q. The mean value does not change more than ve percent after 200 simulations have been performed. As it is seen from Fig. 2 , for all of the cir-cuits except for the combinationalcircuits such a s counter, addcs, and add18, k reaches a minimum around p = 0 : 2 and it increases sharply as the value of p goes to 0 or 0:5. For combinational circuits, k increases monotonically with p. F or 4-segment PWL MOSFET model, we can say that the parameter p can be safely set to a value between 0:1 and 0:3. The standard deviation in the required number of iterations is smaller than half of the mean in the range 0:1 p 0:3.
We h a v e also analyzed the example circuits using 9-segment PWL model for MOS transistors. In the 9-segment model, there are 4 segments in the linear region, and 2 segments each in the saturation and reverse saturation regions. The average of more than 200 simulation results for each circuit is given in Fig. 3 . It is observed that the results for both 4-segment and 9-segment models have similar characteristics. The number of iterations is approximately doubled for 9-segment model. As it is seen from Fig. 3 , for 9-segment PWL MOSFET model, the minimum occurs around p = 0 : 15 and the parameter p can be set to a value between 0:05 and 0:25.
In order to nd a suitable value for q, w e h a v e analyzed the same circuits by setting p = 0 : 2 and changing the value of q. Fig. 4 shows k as a function of q using 4-segment PWL MOSFET model. As it can be seen from Fig. 4 , for all of the circuits except for the tunnel diode circuit, k increases as q approaches unity. The tunnel diode circuit, however, needs a q value close to unity to converge quickly. Therefore, a compromising value of the parameter q can be chosen between 0:02 and 0:5. The standard deviation is not larger than half of the mean value in this range.
We h a v e c hosen some example circuits to make a p e rformance comparison between the popcorn, PWL NewtonRaphson and the Katzenelson algorithms. First, we h a v e used the circuit rsync which has multiple solutions. We have set p = 0 : 2 and q = 0 : 1 in the popcorn algorithm. The results for this circuit are given in Fig. 5 . The vertical axis in Fig. 5 represents the number of transistors which could not nd the correct segment at the corresponding iteration. When this number becomes zero, it means that the solution is found. It is seen from Fig. 5 that the PWL Newton-Raphson method and the Katzenelson algorithm fail in nding any of the multiple solutions. However, the popcorn algorithm has converged to one of the solutions in each trial. Second, we h a v e c hosen several CMOS, ECL and analog bipolar circuits. Table 1 gives k for these circuits by using three dierent algorithms. Opamp is a noninverting amplier circuit containing 741 operational amplier. The MOS transistors and BJT's are modeled with 4 segments while the diodes are modeled using 2 segments. In the popcorn algorithm, the parameter values are chosen to be p = 0 : 2 and q = 0 : 1. As it is seen from Table 1 , the speed of PWL Newton-Raphson method is better than the speed of popcorn algorithm. However, the Katzenelson algorithm is relatively slow compared to the popcorn algorithm.
V. CONCLUSIONS
An ecient algorithm for nding DC solution of large PWL resistive circuits has been proposed. The algorithm is an extension of the piecewise-linear version of the NewtonRaphson method. The main feature of our approach i s t o insert some randomness into the PWL Newton-Raphson method to guarantee convergence without sacricing the speed. The degree of randomness in the algorithm is controlled by the parameters p and q. W e h a v e found appropriate values for these parameters using the large number of trials on the example circuits. In the case of multiple DC solutions, the algorithm reaches to one of the solutions in each trial. This algorithm can also be adapted to the continuous case by modifying the Newton-Raphson algorithm. 
