Abstract -A new improved method for calculation of fault coverage with parallel fault backtracing in combinational circuits is proposed. The method is based on structurally synthesized BDDs (SSBDD) which represent gate-level circuits at higher, macro level where macros represent subnetworks of gates. A topological analysis is carried out to generate an efficient optimized model for backtracing of faults to minimize the repeated calculations because of the reconvergent fanouts. The algorithm is equivalent to exact critical path tracing, however, processing the backtrace in parallel for a group of test patterns. Because of the parallelism, higher abstraction level modeling, and optimization of the topological model, the speed of fault simulation was considerably increased. Compared to the state-of-the-art commercial fault simulators the gain in speed was several times.
I. Introduction
Fault simulation is a widely used procedure in the digital circuit design flow. Test generation, fault diagnosis, test set compaction serve as examples of application of faultfree/fault simulators. Accelerating fault simulation would improve all the above-mentioned applications.
Parallel pattern single fault propagation (PPSFP) [1] has been widely used in combinational circuit fault simulation. Many proposed fault simulators incorporate PPSFP with other sophisticated techniques such as test detect [2] , critical path tracing [3, 4] , stem region [5] and dominator concept [4, 6] . These techniques have reduced further the simulation time.
Although the concepts of dominators and stem regions achieve substantial speed up in fault simulation, there are still two shortcomings [7] : some regions are repeatedly simulated [1, 4] , and inefficiency for circuits with high depth [2, 5] . Further improvement was reached in [7] where the key idea was to simulate the circuit in the forward levelized order and to prune off unnecessary higher level gates in the early stages.
Another group of methods like deductive [8] , concurrent [9] and differential fault simulation [10] are based on logic reasoning rather than simulation. Deductive fault simulation [8] can be faster than parallel fault simulation as their complexities are O(n 2 ) and O(n 3 ), respectively [11] , where n is the number of logic gates in a circuit. There is no direct comparison between the deductive and concurrent fault simulation techniques, however, it is suspected that the latter is faster than the former because concurrent fault simulation only deals with the "active" parts of the circuit that are affected by faults [12] . Differential fault simulation [10] combines the merits of concurrent fault simulation and single fault propagation techniques, and was shown to be up to twelve times faster than concurrent fault simulation and PPSFP.
The fault analysis methods based on reasoning (deductive, concurrent and differential simulation) are very powerful since they collect all the detectable faults by a single run of the test pattern. What they cannot do is to produce reasoning for many test patterns in parallel.
The Critical Path Tracing (CPT) consists of simulating the fault-free circuit and using the computed signal values for backtracing all sensitized paths from primary outputs to primary inputs, to determine the detected faults. The trace continues until the paths become non-sensitive or end at network primary inputs. Faults on the sensitive (critical) paths are detected by the test. CPT is also based on reasoning, and can process by a single simulation run all the faults, however it works exactly only in the fanout free circuits. A modified critical path tracing technique that is linear time, exact, and complete is proposed in [13] . However, the rule based strategy does not allow simultaneous parallel analysis of many patterns. Parallel critical path tracing in fanout free regions combined with parallel simulation of stem faults was investigated in [14] . In [15] the concept of parallel critical path tracing was generalized for using it beyond the fanout free regions.
Compared to [15] where the topological model for backtracing of faults was created for all the outputs separately as a set of submodels, in this paper, a joint topological model is created for the whole circuit to avoid unnecessary repetition of the same calculation procedures. To achieve high simulation speed, the network of macros rather than gates is used. To make it possible to rise from the lower gate level to the higher macro level, the macros are modeled by structurally synthesized BDDs [16] .
The organization of the paper is as follows. In Section 2, a general description of the method for parallel critical path tracing beyond reconverging fanout regions is presented. Section 3 explains the topological pre-analysis of the circuit and proposes a method for optimized building of the simulation model. Experimental results are discussed in Section 4, and Section 5 summarizes the paper.
II. Critical path tracing in a reconverging fanout region
The parallel gate-level critical path tracing inside the fanout free regions (FFR) is straightforward. Let us have an arbitrary macro with a Boolean function:
If a stuck-at fault is detected at the output y of the macro, then for every input the fault on it is also detected iff
The Boolean derivative is a Boolean formula that can be calculated in parallel for many test patterns. In order to extend the parallel critical path tracing method beyond the fan-out free regions we use the concept of partial Boolean differentials [17] .
Let us have a converging gate (or macro) F of the converging fanout region depicted in Fig.1 , and represented by a Boolean function
Assume that the input variables x 1 ,…,x i of the gate F are connected to the fanout stem x via macro level network represented by functions
Here X i are subsets of variables. The function of the full convergent fanout region with a common fanout stem x and output y in Fig.1 can be represented now as a composite Boolean function
Consider the full Boolean differential [17] of the gate or subcircuit F )) ( ),..., (
Here, by dx we denote the change of the value of x because of the influence of a fault at x. According to the interpretation of dy [17] , dy = 1 if some erroneous change of the values of arguments of the function (1) causes the change of the value of y, otherwise dy = 0.
In the following we show that the dependency of the output variable y of a reconverging FFR on the fanout stem x can be represented by a Boolean differential equation which in its turn can be transformed to a very efficient exact procedure for critical path tracing beyond the convergent FFRs.
Fig. 2. Nested reconvergencies in a circuit
Theorem 1: If a stuck-at fault is detected by a test pattern at the output y of a subcircuit represented by a Boolean function y = F(x 1 , …, x i , x j ,… x n ), then the fault at the fanout stem x which converges in y at the inputs
The proof of Theorem 1 is given in [15] . From the formula (3), a method results for generalizing the parallel exact critical path tracing beyond the fan-out free regions. All the calculations in (4) can be carried out in parallel because they are Boolean operations.
Consider now a reconverging fanout circuit in Fig.2 represented by a composite function y = F y (x,z,X y ) where z = F z (x,X z ), whereas X y and X z are the subsets of variables that are not depending on x.
Theorem 2. If a stuck-at fault is detected by a test pattern on the output y of a subcircuit with two nested reconvergencies:
where x 1 is depending, but X y and X z are not depending on x, then the fault at the common reconverging fanout stem is also detected iff
The proof of Theorem 2 is given in [15] . The formula (4) can be used for calculating the influence of the fault at the common fanout stem x on the output y of the converging fanout region by consecutive calculating partial Boolean differentials, first d x F z , and then d x y.
It is easy to see that the result of Theorem 2 can be iteratively generalized for an arbitrary configuration of nested reconvergencies. On the other hand, according to Theorem 1, the dependencies between fanout stems and fanout region outputs (reconverging nodes) can be represented by full Boolean differentials and thereafter
8A-6
transformed into fast computable critical path tracing procedures.
III. Generation of the model for fault tracing
The proposed exact parallel path tracing fault analysis is carried out in the following sessions:
• topological pre-analysis of the circuit to create a model for fault tracing along the critical paths and subcircuits; • parallel simulation of a given set of test patterns to calculate the values of all variables of the circuit; • parallel fault backtracing on the topological model created during the first session.
The topological pre-analysis to create a model for fault backtracing is carried out only once to serve all the next sessions of the procedure.
Compared to the previous method [15] where the topological model for backtracing of faults was created for all the outputs separately as a disjoint set of submodels, in this paper, a joint topological model is created for the whole circuit to avoid unnecessary repetition of the same calculation procedures during the third session of the fault analysis.
The topological analysis of the circuit consists of three procedures:
• creation of the Reonvergency Graph (RG) of the circuit, • creation of the whole calculation model of the circuit.
A. Reconvergency graph
The first procedure of the topological analysis is carried out in direction from primary inputs to primary outputs of the circuit. By this procedure, all the fan-out stems and all the reconvergent fan-in nodes of the circuit will be found.
As the result of the procedure, a graph G = (N,Γ) is created which represents a skeleton of the circuit. Let N be the set of nodes in G, and Γ the mapping in N, where
• Γ(x) ⊂ N is a set of successor nodes of the node x ∈ N, • Γ -1 (x) ⊂ N is a set of predecessors of the node x ∈ N,
• Γ -1 (x i ) ⊂ N is a predecessor node connected to the input i of the node x∈ N, • Γ*(x) ⊂ N is a transitive closure of Γ(x), and
The nodes of the graph belong to the following subsets:
where FO is the set of all fan-out stems, FI is the set of all reconvergent internal fan-in gate outputs, and OUT is the set of all primary outputs of the circuit. Denote by RO ⊆ FO the subset of all fan-out nodes which reconverge and by RI ⊆ FI ∪ OUT the subset of all reconvergent fan-in nodes. The procedure of path tracing in direction from inputs to outputs with creating of subsets RI(n) is illustrated in Table I . 
8A-6
An edge (x,y) in the reconvergency graph between two neighbour nodes x and y represents a signal path of the circuit through gates without fanouts. During the topological analysis, all the paths in the circuit are traced, and the found reconvergencies are fixed in the form of subsets RI(x). Before creating a joint topological model of the whole circuit for fault tracing purposes, the next step is to build an ordered set N* of all the nodes in G. The subset N* is built by the following procedure:
1. First, all the outputs x ∈ OUT are included into N*, and ordered in an arbitrary way. 2. Each node x from the remaining subset of nodes FO ∪ FI ⊂ N will be included into N* as soon as Γ(x) ⊂ N* will be valid. The ordered set N* of nodes of the skeleton graph of the circuit will be the basis for creating the model for fault backtracing in the circuit according to the proposed method. This strategy allows to create an optimized calculation model by the breadth-first processing of nodes.
B. Creation of the optimized calculation model
In the third procedure of the topological analysis we start the creation of the calculation model Φ(N*) for the ordered set of nodes N* of the given topology graph G = (N,Γ). Each node x ∈ N in a graph G represents a node variable in a circuit to be fault simulated in parallel for a set of test patterns t = (t 1 , t 2 , … t τ ). Denote by x = (x(t 1 ), x(t 2 ), … x(t τ )) the vector variable for representing the values of the node x at the set of test patterns t.
The model Φ(N*) is a sequence Φ(N*) = (φ 1 , φ 2 , … , φ q ) of vector formulas YX = φ k (y,x), k = 1,2, … , q for calculating the vector derivatives ∂y/∂x along the signal path (x,y) in the original circuit, which corresponds to the edge in the graph G between the neighbouring nodes x and y, where x ∈ Γ -1 (y). All φ k (y,x) in Φ(N*) are accompanied also with formulas to calculate the detectability of faults in the cones with top on y outside the topology graph G. For two consecutive paths (z,y) and (x,z) where z ∈ Γ -1 (y) and x ∈ Γ -1 (z) according to the rules of calculating Boolean derivatives, we get the topological formula
Introduce for x ∈ N as a sensitivity vector
which expresses the sensitivity of outputs of the circuit to the erroneous values of x, so that for each k = 1,2, …, τ, where OUT x {y⏐ y ∈ OUT, x ∈ Γ -1 (y) }. Let us call the vectors YX in (6) partial sensitivity vectors used for expressing the detectability of faults at particular outputs y ∈ OUT.
If we have calculated the sensitivity vector Y for y ∈ N, we can calculate also all sensitivity vectors for z ∈ Γ -1 (y) by the corresponding formulas YZ. This procedure can be continued for all x ∈ Γ -1 (z) in their turn according to (5) . For the fan-in nodes y ∈ RI with reconvergency from a node x ∈ RO we specify all the inputs y k of the node y which have paths to x and create the corresponding formulas Y k X for calculating the Boolean derivatives ∂y k /∂x. Let us call the values of Y k X as activity vectors for the paths (x,y k ). The activity vectors are needed for calculating the Boolean derivatives by formulas (3) or (4).
Activity vectors express the sensibilities of selected internal nodes y to faulty signals in respect to other internal nodes x ∈ Γ* -1 (y) whereas the sensitivity vectors directly express the detectability of faults at the primary outputs of the circuit. Now the We start to create the formulas φ k in Φ(N*) starting from the output nodes in the beginning of the ordered set N*. The following procedures will be carried out. Procedure 1. For each y ∈ OUT -RI we put into Φ(N*) the formulas YX where x ∈ Γ -1 (y) for calculating the sensitivity vectors X. Each formula is extended by the procedure of calculating derivatives ∂y/∂z for all the nodes z in the fan-out free cone of the original circuit with the root in y.
Procedure 2. For each y ∈ OUT ∩ RI we create two formulas for each reconverging input y k with respect to x ∈ Γ 1 (y): for calculating the partial sensitivity vectors YX, and for calculating the activity vectors Y k X. The procedure of using formulas YX and Y k X can be optimized to avoid repeated calculations along the same path.
Processing of nodes in the topology graph will go on according to the order in N*. For each y ∉ RO, the Procedure 2 will be used for calculating both the sensitivity and activity vectors.
Example 3. An iterative use of the Procedure 2 for calculating ∂t(B 2 )/∂t(H) will create the activity formula B 2 H=B 2 G∧G4∧4H and for calculating ∂t(B)/ ∂t(H) the sensitivity formula BH=BG∧G4∧4H.
Whenever a node x ∈ FO where RI(x) ≠ ∅ is reached then for all y ∈ RI(x) the topological formulas φ k are created in a form of (3) by Procedure 3 or iteratively, in a form of (4) by Procedure 4.
Procedure 3. Consider a subgraph G(x,y) of the reconvergency graph G, consisting of the nodes
8A-6
where x∈ RO, y ∈ RI(x), N(x,y) ∩ RI(x) = y, and y j ∈ Γ*(x) for all of the inputs j = 1,2,…,i of the node y. In this case, there are no nested reconvergencies on the paths from x to y, and for calculating ∂y/∂x the formula (3) can be used. Denote the Boolean expression for calculating ∂y/∂x by fault backtracing in G(x,y) from y to x based on the formula (3) as
The arguments Y i X of the function F in (7) represent the formulas for calculating ∂x i /∂x in (3) . If the path (x,y i ) traverses several nodes in G(x,y) then the component Y i X in (7) will be calculated according to the rule of the consecutive paths (5).
Example 4. As an example, we represent the partial sensitivity vector B3 in Fig.3 for calculating the Boolean derivative ∂B/∂3 according to (7) , by the topological formula
which according to (3) corresponds to the expression (4) and (5) we calculate ∂y k /∂x, as
and ∂y/∂x according to the Procedure 3. If the subgraph G(x,z) has further nested reconvergencies we have to repeat Procedure 4 iteratively.
At each current node y ∈ N* the formulas φ k (y,x) for all x ∈ Γ* -1 (y) are created as Boolean expressions along the paths (x,y) in the circuit.
Example 5. The full calculation model Φ(N*) for the topology graph in Fig.3 is presented in Table II . The formulas marked by "*" are sensitivity vectors, whereas all other represent activity vectors.
IV. Experimental results
Table III presents the fault simulation results for the circuits of ISCAS'85 benchmark family and combinational versions of selected circuits of ISCAS'89 family (column 1) to compare different fault simulators: exact critical path tracing [13] (column 2), two state-of-the-art commercial fault simulators from major CAD vendors (columns 3, 4), our previous parallel critical path tracing method [15] (column 5), and the proposed new method (column 6). The simulation times were calculated for the sets of random 10000 patterns. The time needed for topology analysis is included and is negligible compared to the gain in speed which is 2.5 times compared to the previous best method. Compared to the commercial tools C1 and C2, the average gain in speed is 41.1 and 5.8 times, respectively.
All the experiments were run on a 366 MHz SUN Ultra60 server using SunOS 5.8 operating system except the experiments for the known exact critical path fault simulator where the data are taken from [13] . The experiments in [13] were run on a 2.8 GHz Pentium 4 computer with Windows XP. The last column clearly shows the gain in speed that was achieved by optimizing the topological model by the method proposed in the paper.
8A-6

V. Conclusions
A method has been proposed for macro-level parallel exact critical path fault tracing for combinational circuits or scan-path designs. Differently from the known critical path tracing approaches, a method is proposed to create ordered topological model for parallel fault backtracing. The model is based on the full Boolean differential, which allows generalization of the parallel critical path fault tracing beyond the reconvergent fanout stems. Considerable increase in the speed of fault simulation was shown compared to the two state-of-the-art commercial fault simulators, in average 5.8 and 41.1 times, respectively.
The proposed method allows creation of the full fault table for a group of test patterns with a single pass of the algorithm.
