Abstract-Power consumption has emerged as the premier and most constraining aspect in modern microprocessor and application-specific designs. Gate sizing has been shown to be one of the most effective methods for power (and area) reduction in CMOS digital circuits. Recently, as the feature size of logic gates (and transistors) is becoming smaller and smaller, the effect of soft-error rates caused by single-event upsets (SEUs) is becoming exponentially greater. As a consequence of technology feature size reduction, the SEU rate for typical microprocessor logic at sea level will go from one in hundred years to one every minute. Unfortunately, the gate sizing requirements of power reduction and resiliency against SEU can be contradictory. 1) We consider the effects of gate sizing on SEU and incorporate the relationship between power reduction and SEU resiliency to develop a new method for power optimization under SEU constraints. 2) Although a nonlinear programming approach is a more obvious solution, we propose a convex programming formulation that can be solved efficiently. 3) Many of the optimal existing techniques for gate sizing deal with an exponential number of paths in the circuit. We prove that it is sufficient to consider a linear number of constraints. 4) We generalize our methodology to include nonlinear delay models and leakage power as well. As an important preprocessing step, we apply statistical modeling and validation techniques to quantify the impact of fault masking on the SEU rate. Furthermore, we adapt our method to incorporate process variation and evaluate our gate sizing technique under uncertainty. We evaluate the effectiveness of our methodology on ISCAS benchmarks and show that error rates can be reduced by a factor of 100%-200% while, on average, the power reduction is simultaneously decreased by less than 6%-10%, respectively, compared to the optimal power saving with no error rate constraints.
in hundred years to one every minute [26] , resulting in a clear need for addressing the problem in systematic way. SEU faults arise from energetic particles such as neutrons from cosmic rays and alpha particles from packaging material, generating electron-hole pairs as they pass through a semiconductor device [32] . During the ICs' normal operation, these faults can be caused by electromagnetic interference. Transistor source and diffusion nodes can collect these charges, and a sufficient amount of accumulated charge may invert the state of a logic device, such as an SRAM cell, a latch, or a gate, thereby introducing a logical fault into the circuit's operation. Because this type of fault does not reflect a permanent failure of the device, it is termed soft error (SE) or transient fault (TF) .
Advances in microelectronic technology, which shrink IC size to the nanometer range while also reducing the power supply, are making electronic circuits increasingly susceptible to TFs. In fact, the reduction of the charge stored on circuit nodes, along with the decrease in noise margins, greatly increases the probability of voltage glitches temporarily altering nodes' voltage values [4] . Meanwhile, the continuous increase in ICs' operating frequencies makes the sampling of such glitches increasingly probable. Consequently, TFs will become a frequent cause of failure in many applications as the technology advances.
Power consumption has been recognized as the critical constraint in modern microprocessor and application-specific designs [5] , [17] . In addition, gate sizing has been one of the most effective methods for power minimization in CMOS digital circuits. Unfortunately, gate sizing requirements for power reduction and resiliency against SEU are contradictory. The possible tradeoffs between gate sizing and power consumption have been studied in [7] . We consider the effects of gate sizing on SEU and incorporate the relationship between power reduction and SEU resiliency, and we have developed a new method of power optimization under SEU constraints, which leverages convex programming to obtain provably optimal solutions. As an important preprocessing step and consideration, we apply statistical modeling and validation techniques.
Gate sizing is a timing optimization process in highperformance very large scale integration (VLSI) circuit design. In this design process, the size of each gate in a combinational circuit is properly tuned so that circuit area and/or overall power dissipation is minimized under specified timing constraints. Gate sizing or the similar problem of transistor sizing has been an active research topic in recent years. Many approaches have been proposed before [2] , [18] , [21] , [24] , [25] , [30] . Previous approaches that have taken power considerations into account 0278-0070/$25.00 © 2008 IEEE during transistor sizing include those in [1] , [16] , and [29] . The approach in [16] utilizes linear dependence between power and gate sizes; however, since it optimizes one path at a time, the approach may lead to suboptimal solutions.
A linear programming approach for exploring the power-delay-area tradeoff for a CMOS circuit is presented in [1] ; we use more accurate nonlinear logical effort delay models in this work. Another linear-programming-based approach is presented in [29] . Power optimization with convex programming is proposed by Menezes et al. [21] . In their work, timing constraints are constructed for every path in the circuit which can potentially generate a very large (exponential) number of constraints. In [22] , a piecewise convex programming has been proposed, which only targets power with no consideration for SEU and transient errors. In order to capture the effects of logic fault masking, we introduce resubstitution-based statistical methodology and techniques [10] , [9] for quantifying error propagation through logic circuitry. We also introduce a new formulation for gate sizing problem using convex programming. Our approach is different from previous ones because the number of constraints in our formulation is linear with respect to circuit size as opposed to the exponential size of constraints in previous work. At the same time, we impose a bound on SERs and evaluate the performance of gate sizing considering SEU. This paper is an extended version of our preliminary results published in [8] .
The rest of this paper is organized in the following way. First, in Section II, we go over the preliminaries and cover the models that we have used for delay and SEs. In Section III, we introduce the statistical methodology that we have incorporated to calculate logical masking probabilities. Our formulation and problem transformation are presented in Section IV. Sections V and VI include generalization of our method and incorporation of process variation. Finally, Section IV illustrates the simulation results on ISCAS '85 benchmarks.
II. PRELIMINARIES

A. Power and Delay Models
Power dissipation of gates in digital CMOS circuits is composed of dynamic and static components. In this section, we consider dynamic power consumption, and furthermore, in Section V, we will study leakage power as well. Dynamic power corresponds to the power dissipated in charging and recharging internal capacitors in every gate, which is given by
where C i is the effective switching capacitance of the gate i, which is a linear function of the size of the gate; f clock is the clock frequency; and V DD is the power supply voltage. Φ i simply represents the linear dependence of power on size since the capacitance of a gate is a linear function of size (width of the gate). The aforementioned sum is taken over all the gates in the circuit. Gate delay can be represented as a function of the internal capacitors of logic gates. We use the logical effort method to model the delay [28] , which is a linear delay model. Furthermore, in Section V, we generalize our methodology for nonlinear delay modes as well.
The delay d i of gate i can be written as
where p i is the parasitic delay of the gate and is independent of size; g i is the logical effort of the gate, which is intuitively the driving capability of the gate; and h i is the electrical effort (gain). h i is the size-dependent term in the delay
The aforesaid sum is taken over all the loads that gate i drives.
As stated previously, gate capacitance is linearly dependent on the size of the gate, and therefore, it is a function of size. To represent the dependence of delay on size, we rewrite (2) using a new function κ
where
The logical effort model is a simplified gate delay model which may not be very accurate for current circuit technologies. We have chosen this model to illustrate the concept of SER impact on power optimization and how we can address this issue. In Section V, we generalize our formulation using accurate power and delay models.
B. SEU
An SEU is an event that occurs when a charged particle deposits some of its charge in a microelectronic device, such as a CPU, a memory chip, or a power transistor. This happens when cosmic particles collide with atoms in the atmosphere, creating cascades or showers of neutrons and protons. At deepsubmicrometer geometries, this affects semiconductor devices at sea level. In space, the problem is worse in terms of higher energies. Similar energies are possible on a terrestrial flight over the poles or at high altitude. Trace amounts of radioactive elements in chip packages also lead to SEUs. Frequently, SEUs are referred to as bit flips.
A method for estimating SER in CMOS SRAM circuits was recently developed by Hazucha et al. [15] . This model estimates SER due to atmospheric neutrons (neutrons with energies > 1 MeV) for a range of submicrometer feature sizes. It is based on a verified empirical model for 600-nm technology, which is then scaled to other technology generations. The basic form of this model is
where F is the neutron flux with energy ≥ 1 MeV (in particles per square centimeter per second), A is the area of the circuit that is sensitive to particle strikes (the sensitive area is the area of the source of the transistors, which is a function of gate size) (in square centimeters), Q crit is the critical charge (in femtocoulombs), and Q S is the charge collection efficiency of the device (in femtocoulombs). 1 Cazeaux et al. [4] presents a very accurate model for Q crit and its dependence on gate sizes. In the following model, Q crit for the gate i is dependent on gate sizes as stated in
where Q crit min is the critical charge for minimum driver conductance, minimum diffusion capacitances, and minimum fanout gate input capacitance. Coefficients a i and b j are constant parameters that weigh the contribution to Q crit . The sum is taken over all gates driven by gate i.
As seen in (5), gate sizing has an effect on Q crit ; therefore, we use a function, Θ, to represent this dependence
Furthermore, the sensitive area to SEU, A, is linearly dependent on size
Substituting Q crit and A i in (5) gives us a nonlinear relationship between error rate and gate sizes for a given logic gate. It is important to notice that even if an SE is generated in a logic gate, it does not necessarily propagate to the output. SEs can be masked due to the following factors. 1) Logical masking occurs when the output is not affected by the error in a logic gate due to subsequent gates whose outputs only depend on other inputs. 2) Temporal masking (latching-window masking) occurs in sequential circuits when the pulse generated from particle hit reaches a latch but not at the clock transition; therefore, the wrong value is not latched. 3) Electrical masking occurs when the pulse resulting from SEU attenuates as it travels through logic gates and wires. In addition, pulses outside the cutoff frequency of CMOS elements will be faded out [11] , [26] . Therefore, we assign a probability ρ to each logic gate to indicate how likely a pulse resulting from SEU can survive to the end and cause an error in the output. The final error rate (λ) assigned to each gate i would be λ i = SER i · ρ i . In Section III, we introduce a methodology for statistically computing these probabilities.
C. System Lifetime and MTTF
In this paper, we are considering SERs as a measure for system failure. If the error rate in a system is λ, the mean time to failure (MTTF) is
Therefore, if an MTTF that is greater than a given value, such as Υ, is desired, it implies that λ ≤ (1/Υ). In digital circuits, since all gates are potentially prone to SEs, the total error rate of the circuit (Λ) is
Using (5), we can derive the following equation for the total error rate of a digital circuit:
III. STATISTICAL ANALYSIS OF GATE MASKING
Extensive statistical analysis was done on the circuits from the ISCAS '85 and '89 benchmarks to determine the impact that gate masking can have on the circuit-level SER. The first approach was to observe statistically what the impact of an error in a specific gate would have on the observed error in the circuit. In other words, we compared the global output that we observed with and without SEs in gates. From this analysis, we were able to determine the probability that an error in a specific gate could result in an error in the circuit.
The analysis was conducted by simulating the output values of the circuits for randomly generated input values. First, we simulated the function of the circuit for a statistically large number of times, using random independently generated input values for all the inputs. Then, we compared the output results of the proper functioning circuit with that of the circuit where an SEU had occurred or, in other words, where a bit had been flipped. As would be expected, because of gate masking, the effect of the flipped bit was not always realized at any of the global outputs. We carried out this simulation for every gate in the circuit for all the benchmark circuits.
Specifically, in our experimentation, we considered 2000 independently random input values. Of course, this is a small fraction of the actual number of possible input values, but experimentation over various runs with different input instances revealed a large correlation among runs. We verified our results by running various instances of the experiments to verify that, indeed, the results that we were obtaining were consistent. One such instance, benchmark c432, is shown in the graph in Fig. 1 . The graph shows the 15 different runs on the same benchmark On the other hand, we are not taking into account electrical masking. Electrical masking evaluation requires postlayout information, including interconnection characteristics. Gate sizing under study, in this paper, is a prelayout process. An interesting future work will be including layout (placement and routing) information and will utilize it to calculate electrical masking.
IV. PROBLEM FORMULATION
Intuitively, gate sizing problem can be stated as a timing management problem in such a way that, given a digital circuit with distinct constituting gates, the maximum tolerable power reduction of the circuit is desired while the timing constraints of the system is not violated and, at the same time, the SER of the circuit is bounded by a given value. Although these gates are often modeled as nodes in a directed acyclic graph (DAG), the problem of timing management on nodes is a special case of a more general budgeting on edges. In this section, we illustrate how the delay budgeting on nodes is transformed to delay budgeting on edges. Therefore, we focus on delay budgeting on edges throughout this paper.
In logic synthesis, circuits are usually modeled as a DAG G = (V, E) (see Figs. 2 and 3 ). In this model, nodes represent the logic gates, and edges stand for the precedence relation between them. We transform the given graph G into G in such a way that each node v in G is split into two nodes v 1 and v 2 and that an edge connects v 1 to v 2 . In the transformed graph, the new edges are basically the logic gates from the original graphs. Fig. 3 shows an example of such transformation. In order to have a single input and a single output, nodes s and t have been added to the graph; s is connected to all primary inputs, and all primary outputs are connected to t.
The delay of a path p = s, v 1 , v 2 , . . . , t from node s to node t is equal to the summation of the delays of each edge along the path. We use the terms "delay of a path" and "the distance between nodes s and t" interchangeably; here, the sum is taken over all the edges in path p.
From this point on, we use double indices since we are dealing with edges instead of nodes. In other words, since each node is replaced by an edge, indices represent the edges. For instance, assume that node w is split and that the new edge e ij is representing the node (see Fig. 4 ). In this case, instead of P w for the power consumption of this node, we use P ij to represent its power consumption.
The problem is defined as follows. Given a DAG G = (V, E), a timing constraint T , and an error rate constraint Υ minimize ∀e ij ∈E P ij (12) such that the delay of every path from s to t is less than or equal to T and that the error rate (caused by SEU) is less than Υ. P ij is the power consumption of the ijth edge in the DAG, which is a function of the capacitance of the gate. 2 The timing constraint can be stated as e ij ∈p k d ij ≤ T for every path p k from s to t. Note that the number of paths in a DAG is exponential in terms of the number of edges in the graph, so this formulation is not efficient. Throughout the rest of this section, we will convert it to a formulation with the same objective function that has a linear number of constraints.
Theorem 1: There is an optimal gate sizing solution on a DAG such that the distance between any node u and the output t is independent of the choice of the path taken between them and that this distance is unique.
Proof: Suppose that the claim is not true, i.e., there exists a node v where its distance to t through path P 1 is less than P 2 (see Fig. 5 ). Before getting to the proof, it is important to note that the edge e u 1 u 2 is a split node. In other words, it represents the gate u in the original graph and, therefore, the incoming edge to u 1 , i.e., e * is representing an actual edge from the original graph, not any gates.
Without loss of generality, we can assume that P 1 is shorter than P 2 . We claim that there exists an edge (e * ) in P 1 that can be slowed down and still not violate the timing constraint because P 2 is on the critical path from v to t. One immediate candidate for e * is the first edge in P 1 . Increasing the delay of e * by d P 2 − d P 1 will not cause a timing violation, and since it does not contribute in the cost function nor error rate constraint, the total power dissipation and error rate remain constant. This increase is made by assigning a "dummy" delay to e * . e * is on the outgoing edge from the gate v, which represents dependence and does not contribute to the total power consumption. Therefore, we can maintain the same objectives by equalizing the delay of P 1 and P 2 , and since this optimization problem has only one global minimum, there should exist an optimal solution in which the statement in Theorem 1 holds.
Theorem 1 still holds for more complex delay-power model. As long as the power dissipation of a gate is a nondecreasing function of gate size (which indeed is), the theorem holds. Therefore, more exact timing models can be used in our proposed formulation. The following observation is immediately inferred from Theorem 1.
Corollary: There is an optimal solution in which the delay of each path in the optimal solution from the primary input node s to the primary output node t is equal to T .
An optimal solution is which the objective (power) is minimized while all the constraints (timing and SER) are met. Now that the delay between every node to the destination in the optimal solution is independent of the path taken, let t i be a variable assigned to each node v i that represents its distance to t. A similar technique was proposed in [13] , which has resulted in an efficient integer delay budgeting algorithm. We call t i the distance variable of node v i . In other words, t i is the delay of the system from node v i to the output. Therefore, the delay and power consumption of each edge (node in the original graph) are represented, respectively, by
Thus, instead of having a constraint for each path from s to t, we construct the following constraints:
Equation (13) enforces that the delay assigned to each gate is greater or equal to its minimum delay (parasitic delay), while (14) assigns a nonnegative delay to those edges in the original graph that represent connection between gates. Equation (15) guarantees that the distance from s to t is less than or equal to the timing constraint T . Equation (15) can be interpreted as a minimum delay required for the virtual edge between s and t. This edge is also shown in Fig. 3 . The constraint in (16) is simply the lower bound on transistor width. Error rate can be bounded by the following constraint:
in which Υ is the desired upper bound on SER. All the timing constraints on paths are reformulated as edge constraints, and the optimization problem can be restated as (18) subject to the constraints in (13)- (17) . The objective in (18), along with the constraints stated in (13)- (17), forms a nonlinear optimization problem with linear number of constraints in terms of the size of the graph. In the next section, we modify and solve this problem using the convex programming method. Theorem 1 is also valid for circuits with reconvergent fanouts. Although there are dependences of edges across many paths, there exists a delay assignment on an edge to equalize delays on all paths passing through it.
A. Convexity of the Optimization Problem
Global optimization is the task of finding the absolutely best set of parameters to optimize an objective function. In general, there exist solutions that are locally but not globally optimal. Consequently, global optimization problems are typically quite difficult to solve; in the context of combinatorial problems, they are often NP-hard. In convex optimization problems, a locally optimal solution is also globally optimal. These include LP and QP problems where the objective is positive definite if minimizing (and negative definite if maximizing). Furthermore, NLP problems belong to the same class where the objective is a convex function if minimizing (or a concave function if maximizing), and the constraints form a convex set [3] . In this section, we will show that our proposed formulation yields a convex optimization problem.
Convex optimization problems are far more general than linear programming problems, but they share the desirable properties of LP problems. They can be solved quickly and reliably even in very large size. A convex optimization problem is a problem where all of the constraints are convex functions and the objective is a convex function while minimizing or a concave function while maximizing. With a convex objective and a convex feasible region, there can only be one optimal solution, which is globally optimal. Several proposed methods-notably interior-point methods-can either find the globally optimal solution or prove that there is no feasible solution to the problem.
Geometrically, a function is convex if a line segment drawn from any point ( x, f ( x)) to another point ( y, f ( y))-called the chord from x to y-lies on or above the graph of f . Algebraically, f is convex if, for any x and y, and any α between zero and one
The objective defined in (18) is a linear function of the variable W i and is therefore convex. To state the convexity of the proposed formulation, it is required to show that the feasible solution space created by the constraints is, in fact, convex. The constraints in (13) can be rewritten as
Since all the variables in (20) are positive, the constraint imposed by (20) is a posynomial expression. Posynomials are not convex in this format, but with a change of variables, they can be mapped to a convex space. If each variable x in a posynomial expression is substituted with e z , the resulting expression becomes an exponential convex function [3] . The constraint presented in (17) is a nonlinear function, which is generally very hard to handle. This function has both linear and exponential dependences on a variable, which results in a nonmonotone function. Each variable W i is linearly contributing in only one term of (17) (through the sensitive area corresponding to A i ) but is exponentially included in a few terms of same equation (Q crit for gate i and all fan-in gates). Therefore, not only are there more terms that include W i in the exponent, but their effect compared to the linear dependence of A on W i is much more significant. On the other hand, shrinking a gate size reduces both the power consumption and the sensitive area to SEU. Therefore, the exponential contributions of gate sizes and power dissipation are two contradictory factors, not the sensitive area.
This observation led us to modify (17) such that we assume an average value for each A i and change the constraint to exponential form which is convex. This modification and exponential transformation in the posynomial constraints keep the objective and other constraints convex, and the solution space, which is the intersection of all subspaces created by convex constraints, is itself convex. In the simulation section, after calculating gate sizes, we use the actual resulting gate sizes to recompute total error rate, and report the correct numbers.
The significance of our optimization methodology lies within two facts. First, the problem size is linear in terms of the circuit size, which makes the problem generation (constraints and objective) a linear-time process. Second, since the problem is convex (both in simple and accurate delay-power model), finding the optimal point in solution space can be done efficiently.
B. Simulation Flow
In order to analytically evaluate our methodology, we first compute the delay and power of each benchmark and use the delay value T as the timing constraint for the optimization process. The outputs of the optimization process are gate sizes and distance variables. We use gate sizes to compute the total power consumption for comparison with the initial circuit. As discussed in Section IV-A, to evaluate SEU resiliency, we recompute SER and use this value in our report. Further information is provided in Section VII.
V. GENERALIZED DELAY AND POWER MODEL
In this section, we expand our results for the cases in which more accurate delay and power models are presented. Generally speaking, power dissipation of VLSI circuits is composed of dynamic and leakage power
We did study dynamic power before in this paper. In this section, we study leakage power and include it in our gate sizing methodology. Leakage power can be modeled according to
where I sub is the subthreshold current which is accurately described in [12] I sub = A · e Vgs−V th 0 −γVs +ηV ds
where A is the size-dependent term
where c ox is the gate oxide capacitance, μ is the bias mobility, W and L are the width and length of the gate, respectively, k is the Boltzmann constant, T is the absolute temperature, and q is the charge on the electron. Now, we merge the size-independent terms of subthreshold current and V DD into a single function Ψ and restate (22)
As seen in (25) , using this model, there is linear dependence between leakage power and size. The generalized objective of our formulation can be restated as
Remember that double indices are used since, in the transformed graph, each gate is represented by an edge.
As for the delay model, we consider the nonlinear delay model as opposed to the linear Elmore delay. A very accurate nonlinear delay model can be found in [31] in which gate delay can be represented as
where L represents the output load which is the sum of output wiring and gate loading
All a z , b z , c z , d z , and e z are real-value constants. Using the aforementioned model, timing constraints can be rewritten as
which is equivalent to
In the presence of the more general delay model, SER constraints remain intact. Constraints in the form of (30) are also in posynomial form and, therefore, with the same change of variable, can be transformed into standard convex format. Since the objective in (26) is a linear function of size, the optimization problem will be a convex optimization which maintains the same properties as discussed in Section IV-A.
A. Complexity of Optimization Problem Using General Models
An immediate concern using general delay and power models is its effect of the optimization complexity. Fortunately, this model does not effect the complexity of the problem for the following reasons: the number of constraints in the general formulation is the same as before. In other words, more accurate models do not enforce new constraints. Furthermore, the objective has the same number of terms as seen (26) . Therefore, asymptotically, the generalized problem formulation has the same running time.
VI. PROCESS VARIATION: STATISTICAL OPTIMIZATION REFINEMENT
Process variation has become an inevitable characteristic of modern CMOS circuits, particularly in sub-100-nm processes [19] , [20] . Process parameter variations may result in large variability in circuit delay affecting the yield [14] , [27] . Statistical timing analysis and gate sizing methods try to address optimization techniques considering process variations. Many of these techniques are based on adapting a known deterministic method to consider variability [6] . Process variation is not the primary focus of this paper, but in order to signify the problem and show the effectiveness of our method, we incorporated a similar approach as in [6] to address variability issues. The idea behind this approach is to dynamically change the timing constraints on the circuit and reassign gate sizes with the new constraints to meet timing and/or yield constraints.
In this approach, new timing constraint is presented in statistical way. We rewrite (15) as
where θ is the tunable bound on timing violation probability, which is set according to the desired yield. d c = t s − t t is the circuit delay which is a random variable. d c can be modeled as a normally distributed random variable [6] . Equation (31) enforces the critical path delay to be less than T with probability larger than θ. Algorithm 1 summarizes the high-level approach in the SEU-aware gate sizing with yield consideration.
Adapting SEU-aware gate sizing to process variation and yield 1: Solve optimal gate sizes with timing constraint = T , initialize T = T 2: Extract circuit delay distribution (pdf) through statistical timing analysis 3: while Prob(t s − t t ≤ T ) < θ do 4: Refine the timing constraint T , T ← T − δ 5: Reassign deterministic gate sizes using the proposed convex programming method 6: Recalculate the statistical characteristics of the circuit delay, i.e., μ and σ end while Compute the power consumption and SER for the final assignment In Algorithm 1, δ is the reduction step in the new timing constraint. The new timing constraint, T − δ, is set such that if the delay of the resulting circuit has a mean that is equal to T − δ, then Prob(t s − t t ≤ T ) ≥ θ. The new timing constraint can be calculated as
where s is a normalization factor and φ −1 (z) is the inverse of the normal distribution function with z ≡ (x − μ)/σ and dz = dx/σ for arbitrary distribution. If the standard deviation of a circuit after gate sizing remained intact, s = 1 would be the correct choice for the timing constraint adjustment. However, as the circuit goes under sizing, the standard deviation also changes, and setting s = 1 may overconstrain the problem. For evaluation purposes, we set s = 0.7, which was obtained by multiple simulation runs. It is common to set θ 84% which concludes that the mean is smaller than the timing constraint by 1σ or θ 97% which concludes that the mean is smaller than the timing constraint by 2σ.
VII. SIMULATION RESULTS
We used the MOSEK convex optimization tool [23] to solve the proposed formulation on ISCAS benchmarks. We applied our technique on 65-nm technology with the assumption that the process variation has Gaussian distribution with σ/μ = 5%. For each benchmark, we calculate the total delay of the circuit, T , with initial size values and then use that delay as the timing constraint for the optimization problem. Each benchmark has a total error rate, Λ, associated with its initial size which we used to impose constraints on the error rate. For each circuit, we minimize the power consumption with four different bounds on the error rate, Υ, starting with the no bound on error rate, error rate of the initial circuit, i.e., Λ, reduced rate by a factor of 100%, and, finally, reduced error rate by a factor of 200%. Figs. 6(a) and 7(a) show the power consumption reduction versus the bound on error rate for combinational circuits. A point (x, y) in these graphs means that power dissipation has been reduced by y%, while the total error rate has been decreased by at least a factor of x% compared with the error rate in the initial circuit.
It can be observed that an average of 47% power saving can be achieved without any constraint on the error rate for these benchmarks. Obviously, power saving percentage depends on the initial design which we compare our results with. If an original design is far from optimal, then the power saving ratio becomes larger. Therefore, the significance of the results is that we can achieve optimal power reduction in the presence of SER constraints. This is the reason why we have not compared our results with any other power optimization method since power reduction through gate sizing is a well-known fact, and it is important to know how bounding SERs can be integrated in gate sizing. We ran the same optimization with process variation. Figs. 6(b) and 7(b) summarize these results. The timing constraint is set such that the circuit delay is less than the constraint with probability ≥ 97%, which means that d c ≤ T + 2σ T . The average power reduction is about 34% for these benchmarks under process variation.
Furthermore, in order to get a feeling of the running time of the proposed methodology, Table I summarizes the running time of the optimization problem for the set of benchmarks. The first column is the benchmark name, and the second column contains the number of gates in the circuit (each DFF is counted as one gate). The third column is the CPU time in seconds for the solver to find the optimal solution for the case with no bounds on SER, while the fourth column is the CPU time for the case when we impose an upper bound on the error rate. Timing results also validate the fact that our linear size model and the efficient convex programming method are a fast process, and runtime grows almost polynomially with the circuit size.
VIII. CONCLUSION
In this paper, we have introduced a new formulation for gate sizing that targets power optimization, resiliency against SEUs, and timing constraints simultaneously. As a preprocessing step for optimization, we developed a statistical modeling and validation technique that quantifies the impact of fault masking in combinational logic. We formulated the problem as a convex optimization problem of linear size as opposed to the previous convex programming approaches which could potentially be exponential in size. We further generalized our methodology to include nonlinear delay models and leakage power as well. The MOSEK convex optimization tool was used to evaluate the proposed approach on ISCAS benchmarks. We were able to minimize power dissipation for a given timing constraint and various upper bounds on error rates caused by an SEU. An important practical result was that convex-programming-based gate sizing can simultaneously reduce power consumption and improve SEU resiliency. Our simulation showed that different circuits from various benchmarks behave differently when carrying out power optimization while considering SEs. To further generalize the method, we used our method in combination with statistical techniques to consider process variations. As future work, we propose the examination of the question of designing circuits such that, through gate sizing, optimum power savings can be achieved while enforcing low error rates.
