Abstract-In this paper, we present a concept of compacting the error traces generated by pseudo-random/random simulations. The new shorter error trace not only decreases the time of user's debugging process but also reduces the simulation time required to verify the bug fixes. Two algorithms CET1 and CET2 are presented to perform the task of compacting the error trace. Both algorithms first use an efficient approach to eliminate the redundant states to generate the unique states of the error trace. Then, CET1 build the connected graph of these unique states by computing the reachable states by one cycle for each unique state, and then apply Dijkstra's shortest path algorithm to find out the shortest error trace in the connected graph. Compared with CET1, CET2 computes the reachable states by one cycle for those unique states, when they are needed in Dijkstra's shortest path algorithm to find the shortest error trace. After finding the shorter trace, the corresponding input/output test vectors are generated. The experimental results show that both algorithms can reduce the length of error traces dramatically for most cases using reasonable memory. For cases required longer CPU time to find the shortest trace, CET2 is up to 37 times faster than CET1.
I. INTRODUCTION
The increasing complexity and circuit size of designs have made function verification task as one of bottlenecks in VLSI design cycle. In recent conferences, many panelists claimed that functional verification takes about 60%-70% of design cycle. The tasks of functional verification includes detecting errors in designs and finding the causes of the errors (debugging process). To detect errors in designs, verification engineers and designers may generate test vectors manually, write test benches using tools, and write properties. Currently, formal verification tools checks whether the design satisfies their properties. If not, a set of counter-examples ("error traces") can be generated to debug the design. Usually, formal verification tools can use techniques [4, 9] to generate good error traces to make the debugging process easier. Test vectors and test benches are mainly used in simulators and emulators to detect errors in designs. If errors are found, error traces are written out for debugging process. In general, these error traces produced by random/pseudo-random test benches contain very long cycles and many redundant states. It is very This work was supported by the National Science Council, R. O.C., under contract no. NSC89-2215-E009-120, while both authors were at National Chiao-Tung University in Taiwan. difficult for users to debug using these traces and to take long simulation time to verify the bug fixes. Thus, we are inspired to develop the technique to compact error trace generated by random/pseudo-random test benches.
The main reason for compacting error traces generated by pseudo random simulations is that they may contains very long cycles and many redundant states. In the debugging process, users need to find the causes of bugs from the information in the error trace. Thus, The longer cycles of simulation trace are, the more time users spend to debug. Moreover, after finding the causes of bugs and fixing them, users perform the same simulation run to verify the bug fixes. Then, users put the simulation run into the daily regression suites to prevent the same bugs happen again. Therefore, shorter error traces not only reduce the simulation time required to verify bug fixes, but also reduce the simulation time for daily regression.
A digital circuit can be formulated as a finite state machine (FSM) and the circuit behavior can be viewed as the sequences of state traversal. The error trace contains all information of a sequence of state traversal from an initial state to an error state. The problem of compacting the error trace can be defined as follows: Given a digital circuit design and its simulation error trace information containing the initial state and the error state, the problem is to find other error trace as shorter as possible.
Our approach to solve compacting the error trace is based on the following three techniques: First, an efficient technique is used to eliminate the redundant states and to generate the unique states of the error trace. Then, the connected graph of these unique states is generated by computing the reachable states by one cycle for each unique state. The Dijkstra's shortest path algorithm is applied on the connected graph to find out the shortest error trace. Finally, the corresponding input/output test patterns are automatically generated for the new error trace. This algorithm is named as CET1. In experiments, we found out that not all of the unique states are needed to compute their reachable next states. Thus, algorithm CET2, modified from CET1, only computes the reachable states by one cycle for those unique states, when they are needed in Dijkstra's shortest path algorithm to find the shortest error trace.
To the best of our knowledge, techniques to generate counterexamples in symbolic model checking done by Clarke et al. [4] and Jin et al. [9] are the closest related work to ours. Their approaches can only be used in symbolic model checking, while our approach can be used for any error trace generated by simulation or formal verification tools. In general, test vector generation has been the subject of many efforts in testing and function verification areas. For instance, formal verification-based techniques to derive a set of function test vectors for simulation have been reported by Benjamin et al. [2] , Geist et al. [6] , Gupta et al. [7] and Ho et al. [8] . However, these approaches are generating test vectors for detecting bugs in designs. Our work is to generate shorter test vectors from the initial error trace for debugging process.
The rest of this paper is organized as follows: In section II, the definition of the problem is described. Section III describes our approach to solve the problem. The experimental results are shown in section IV. Section V describes our conclusions and future work.
II. PROBLEM DEFINITION
A digital circuit can be formulated as a finite state machine (FSM) and the circuit behavior can be viewed as the sequences of state traversal. The error trace contains all information of a sequence of state traversal from an initial state to an error state. First, we observe some characteristics of the error trace. Figure 1 shows the state transition diagram of a simple finite state machine and an initial error trace. The error trace takes 9 cycle time to reach the target error state. However, notice that three of the passed states,
, and
, are traversed twice. In fact, it just passes through 6 unique states totally, and hence the shortest trace takes 6 cycle time at most to reach the error state in the worst case. If state 5 rather state 4 was chosen as the next state when the state 3 was traversed first time, the duplicate state traversal can be avoid. However, we can't know which next state should be chosen to avoid the duplicate state traversal until the whole error trace is traversed. It is a quite popular circumstance to have many redundant states in the error traces generated by random/pseudo-random simulation. Thus, we want to find a shorter trace to reach the same target error state from the same initial state. The problem can be defined as follows: Given a circuit design and its error trace containing the initial state and the error state, the problem is to find the shortest error trace from the initial state to the error state and then to generate the corresponding input/output test vectors of the new trace. In addition, we have to make the following assumptions on the input designs:
1. Completely synchronous digital circuits.
2. Global reset signal always available.
3. no 3-state latches or registers.
III. OUR APPROACH This section presents our approach for compacting the error trace. The basic idea of our approach is based on the concept of eliminating the redundant states in the error trace and finding the shortest path among the remaining unique states in the trace. 
Add edges with weight 1 between a and states in Finally, according to the shortest path, input/output test patterns are generated by function GenIO(). The following tasks are performed for every state transition 
A. Finding Unique States in Initial Error Trace
In order to quickly find out the duplicate state in the initial error trace, we build one data structure,called non-duplicate tree, as shown in Figure 3 , based on the assumption that each latch or register just could contains one of the two values, 0 or 1, at any time. The left branch represents encoding 0, the right one represents encoding 1, and the height of the tree represents the number of state bits. While each path from the root node to each leaf node represents one state appears on the error trace. During the non-duplicate tree construction, if the traversed path according to state encoding has already existed, we can affirm the state has been reached previously, and then ignore it to deal with the next state continuously. Otherwise, it means the state has not been reached previously, and we should add some required nodes and branches to construct the non-duplicate tree. This non-duplicate tree is represented by Algebraic Decision Diagrams (ADD) [1] where the leaf nodes contain the unique state identification numbers. BDDs are generated to represent these unique states. Moreover, the length of the shortest error trace will not longer than the number of unique states, since each state on the shortest error trace should be traversed at most once. What is the time complexity of this method? With N states on the error trace and b state bits, the time complexity is q . B. Computing 1-cycle reachable states 
C. Finding the Shortest Path
After building the connected graph 1 with weighted edges, the task is to find the shortest error trace from initial state to the target error state . Dijkstra's shortest path algorithm [5] , shown in Figure 5 , is a very suitable approach for our requirement. First, the graph We found that most of the time for algorithm CET1 is spent on computing the 1-cycle reachable states for each unique state and these reachable states information is needed to perform function Relax() in Figure 5 . If we can avoid performing function ComputeNS() as many as possible, our algorithm can have better performance. Thus, we reschedule the time to perform function ComputeNS() into Dijkstra's shortest path algorithm, as shown in Figure 6 . Algorithm CET2 rearranges the tasks in line 4-7 of algorithm CET1 into the new order shown in line 4-10 of Figure 6 . With this modification, function ComputeNS() will not be performed for those states which shortest distance from is greater than the shortest distance from to , since they will not be extracted from z . In worst case, CET2 will have the same performance as CET1.
D. CET2 Algorithm

IV. EXPERIMENTAL RESULTS
We have implemented our CET1 and CET2 algorithms in C++ language with the CUDD (Colorado University Decision Diagram) package [11] . We applied our algorithms on twelve designs in ITC'99 benchmarks. The circuit information about those designs is given in Table 1 . The first and second columns list the number of circuits and the original functionality respectively. The third column is the total number of gates in the designs. The numbers of the primary inputs and outputs are given in the fourth and fifth columns respectively. The last column is the total number of registers in the designs. For each of these designs, we generated several error traces with different lengths and our algorithms on them. The experiments were performed on a 1.4 GHz AMD Athlon machine with 2GB main memory.
Circuit Original Functionality
Gates Table 2 shows the experimental results using CET1 algorithm. The circuit name and trace number is shown in column 1. The number of states of the initial error trace is shown in column 2. Column 3 shows the number of unique states in the initial error trace. In CET1, the number of unique states in the trace is the number of states needed to compute their reachable next states, and is the upper bound of the final result. From the results, case "b1-1" has the lowest percentage (7.3%) and several cases have no redundant states in the traces. Column 4 shows the number of states in the shortest error trace found by CET1 algorithm. For most of the cases, the length of the error trace is reduced dramatically. For instance, the length of error trace is reduced from 40061 to 17 in case "b8-1". For cases in circuit "b4", CET1 can found the shortest error traces within 6% of the length of their initial traces, which have almost no redundant state. However, for case "b5-2" and "b12-1", the length of their error traces can not be reduced at all. For case "b5-1", the length of the error trace is reduced by only 1 state. CPU time and memory usage of CET1 algorithm, reported in columns 5 and 6, respectively, is proportional to the number of unique states shown in column 3, because for each unique state, CET1 compute its reachable next states to build the graph and then apply Dijkstra's shortest path algorithm to find the shortest error trace. For most cases, except cases in circuit "b4", CET1 can generate the shortest error trace within 300 seconds and 100 MB memory usage. For cases in circuit "b4", we found most of the CPU time and memory of CET1 spent on computing the reachable next states. Table 3 shows the CPU time and memory usage of CET1 and CET2 algorithms for cases in circuit "b4" and case "b11-2", which requires longer CPU times to find the shortest error trace. For other cases, CET2 has the similar CPU time and memory usage as CET1 shown in Table 2 . CET2 found the same length of shortest error trace as CET1 did. In general, CET2 used less memory usage to achieve the same quality as CET1 did, while CET2 improve the CPU time dramatically for cases in "b4". The CPU time improvement is contributed by that the number of states to compute the next states in CET2 is reduced from the number of unique states to the number states with length not greater than the length of the shortest error trace. For instance, in case "b4- 4 
V. CONCLUSIONS AND FUTURE WORK
We have shown the concept of compacting the error traces generated by pseudo-random/random simulations and two algorithms CET1 and CET2 to perform the task of compacting the error trace. Two algorithms CET1 and CET2 are presented to solve this problem efficiently. Experimental results show that both algorithms can reduce the length of error traces dramatically for most cases using reasonable memory. For some cases, our approach can not reduce the length of error traces, since they are the shortest one already. For cases required longer CPU time to find the shortest trace, CET2 is up to 37 times faster than CET1. However, there are still many ways to improve our algorithms. First, we would like to explore other heuristics to speed up our algorithms by reducing the number states to compute the reachable next states. Second, we would like to reduce the length of the shortest error trace as shorter as possible. Currently, we only compute one-step successors in our algorithms. In some case, we can not reduce the length of the error trace. Thus, one-step successors may not be enough to achieve shorter trace. For examples, in Figure 7 , State 7 is one of the next states of state 1, but is not one of unique states. Thus, our algorithms doesn't include state 7 in the reachable states. However, there exists one path passing through state 7 to reach the target error state in 3 cycles, and our algorithm can only find the path with length 4 cycles. In order to find shorter error trace, we plan to add the length parameter into function ComputeNS() to find -cycle successors. Thus, edges among unique states may contains different edge weights from 1 to . With this modification, we will be able to find shorter error trace. However, the more cycles we compute the reachable states, the shorter error trace we can find and the slower performance we have to suffer. Thus, can not be too large.
