In this paper, we present an accurate but very fast soft error rate (SER) estimation technique for digital circuits based on error propagation probability (EPP) computation. Experiments results and comparison of the results with the random simulation technique show that our proposed method is on average within 6% of the random simulation method and four to five orders of magnitude faster.
Introduction
Soft errors are intermittent malfunctions of the hardware that are not reproducible [4] . These errors, which occur more often than permanent errors, arise from Single Event Upsets (SEU). These SEUs, in turn, arise from energetic particles, namely neutrons and alpha particles. Soft Error Rate (SER) for a device is defined as the error rate due to SEUs. So far, memory elements have been more susceptible to soft errors than the combinational logic. However, analytical models predict that the SER in the combinational logic will be comparable to that of memory elements by 2011 [6] . The first step in developing soft error reliable designs with minimum performance and area penalties is to accurately estimate system SER and the contribution of each component to the overall soft error vulnerability.
The error rate of a circuit node, ni, is broken into three terms and computed as RSEU (ni) × P latched (ni) × P sensitized (ni) [3] . RSEU (ni) is the bit-flip rate at node ni which depends on the particle flux, the energy of the particle, type and size of the gate, and the device characteristics. P latched (ni) is the probability that an erroneous value on node ni is captured in a flip-flop. P sensitized (ni) is the probability that node ni is functionally synthesized by the input vectors to propagate the erroneous value from the error site to primary outputs (POs) or flip-flops (FFs). Estimating the last parameter is the most time-consuming part since in this step, several random vectors are applied to the circuit inputs to determine the Error Propagation Probability (EPP) from an error site to outputs. All previous SER estimation methods use the random vector simulation approach [2, 3, 4, 6] . The SER estimation time of a node in large circuits exponentially increases with the size of the circuit. Hence, SER estimation of larger circuits becomes intractable with these techniques.
In this paper, we present a new EPP computation technique based on circuit topological traversal and signal probabilities. This paper is organized as follows. Sec. 2 presents our EPP computation technique. Sec. 3 presents the experimental results. Finally Sec. 4 concludes the paper.
Gate-level SER Estimation
We consider all circuit nodes as possible error sites. In our approach, we first extract the structural paths from each error site to all reachable outputs. Then, we traverse these paths to compute the propagation probability of the erroneous value from the error site to reachable primary outputs or flip-flops. An on-path signal is a net on a path from the error site to a reachable output. An on-path gate is defined as the gate with at least one on-path input. Finally, an off-path signal is a net that is not on-path and is an input of an on-path gate. These three are shown in Fig. 1 .
For EPP calculation, as we traverse the paths, we use signal probability for off-path signals and use our propagation probability rules for on-path signals. The signal probability (SP) of a line l indicates the probability of l having logic value "1" [5] . If there is only one path from the error site to an output, the error propagation probability from an on-path input of a gate to its output depends on the type of the gate and the signal probability of other off-path signals. In the general case in which reconvergent paths might exist, EPP from the error site to the output of the reconvergent gate depends on the polarities of the propagated error on the on-path signals, as well. To address this issue, we need error propagation rules for reconvergent gates. First, we define the following parameters:
• Pa(Ui) and Pā(Ui) are defined as the probabilities of the output of node Ui being a andā, respectively. (a is an erroneous values andā is the inverted of a). In other words, Pa(Ui) (Pā(Ui)) is the probability that the erroneous value is propagated from the error site to Ui with an even (odd) number of inversions.
• P1(Ui) and P0(Ui) are the probabilities of the output of node Ui being 1 and 0, respectively. In these cases, the error is blocked and not propagated. Note that for an on-path signal Ui, P (Ui) = Pa(Ui) + Pā(Ui) + P1(Ui) + P0(Ui) = 1, while for an off-path signal Uj, P (Uj) = P1(Ui)+P0(Ui) = 1. Since we have considered the polarity of error propagation, this will take care of reconvergent fanouts. EPP computation rules for elementary gates are shown in Table 1 .
Consider the example shown in Fig. 1 . Assume that an SEU with sufficient energy hits gate A. After computing P (E) = 1(ā), P (G) = 0.7(ā) + 0.3(0), and P (D) = 0.2(a) + 0.8(0) 1 , we follow these steps for EPP calculation.
Gate
Rule 
Figure 1. EPP calculation on reconvergent paths
For a general case, the following algorithm shows how we can extract and then traverse all paths from a given error site to all reachable outputs and how we apply the EPP rules as we traverse the paths.
For every node, ni, do:
1. Path Construction: Extract all on-path signals (and gates) from ni to every reachable primary output P Oj and/or flip-flop F F k using the forward Depth-First Search (DFS) algorithm [1] .
2.
Ordering: Levelize signals on these paths using the topological sorting algorithm [1] .
3. EPP Computation: Select the on-path gates in a topological order, from the error site to reachable outputs, and apply propagation rules (Table 1) for EPP computation. Using a topological order enable us to compute EPP in just one pass (linear time complexity).
After completing the above steps, Pa(P Oj) and Pā(P Oj) are computed for every P Oj reachable from ni. P sensitized (ni) is calculated as follows:
, where k is the number of outputs reachable from ni.
Experimental Results
The proposed approach was implemented and applied to ISCAS'89 benchmark circuits using a DELL Precision Table 2 shows the results for our systematic approach as well as the random simulation for selected circuits (due to space limitations). For larger circuits, a limited number of gates of the circuits are simulated due to exorbitant run time of the random-simulation method. The speedups of our approach are reported with and without including the SP computation time in the total execution time. When SP time is excluded, the speedups are 4-5 orders of magnitude. When included, our approach is still 2-3 orders of magnitude faster than the random simulation method. The accuracy of our approach versus random-simulation is 94%, in average.
Conclusions
In this paper, an accurate error propagation probability computation technique for soft error rate estimation has been developed. The proposed approach leverages the signal probability calculation, which is already used in other steps of the design flow. This technique can be used to identify the most vulnerable components to be protected by soft error hardening techniques.
