We present a new approach to automatic test pattern generation for very large scale integrated sequential circuit testing. This approach is more e cient than past test generation methods, since it exploits knowledge of potential circuit defects. Our method motivates a new combinatorial optimization problem, the Tour Covering Problem. We develop heuristics to solve this optimization problem, then apply these heuristics as new test generation procedures. An empirical study comparing our heuristics to existing methods demonstrates the superiority of our approach, since our approach decreases the number of input vectors required for the test, translating into a reduction in the time and money required for testing sequential circuits.
Introduction

Overview
Very large scale integrated sequential circuit testing is necessary to insure the reliability of circuits. Since circuits contain from thousands to millions of parts and connections, testing of sequential circuits is currently a time-consuming and expensive process, which needs to be made more e cient. Even small reductions in time invested for testing lead to savings in the manufacturing process.
At the termination of a typical manufacturing process thousands of chips that contain the physical representation of a circuit need to be tested for functional and logical correctness. For each unit produced the testing should determine if the chip is faulty or not.
Very large scale integrated sequential circuit testing thus attempts to pinpoint problems with a circuit, determining whether faults are present. However, this task becomes di cult when the circuit cannot be examined internally. If the circuit is viewed as a black box to which we can apply input and then observe output, but never see the internal workings, then determining exactly whether the circuit is faulty is challenging.
One way of testing a circuit (or the corresponding chip) is to try out all possible inputs and compare the resulting outputs to the outputs of a fault-free circuit. This process is very time consuming, and yet it works only for combinational circuits, those that have no feedback mechanism (see Figure 1(a) ). In a sequential circuit there is a feedback mechanism which implies that it is not su cient to try all possible inputs, but also all possible values of the memory elements have to be tried in conjunction with the inputs. Consequently the testing of sequential circuits is far more complex than the process of testing combinational circuits, which is in itself a di cult problem.
In an attempt to reduce the complexity of sequential circuit testing, internal testing methods have been devised to \open up" the black box. Techniques such as Scan Path, or Level Sensitive Scan Design, which employ shift register latches to control and observe all internal states PW] can be used. Since these methods are costly, time-consuming, and potentially damaging to the circuit, they are avoided unless absolutely necessary.
Our goal here is is to propose a more e cient method for testing circuits that does not monitor the internal state at every step, yet still exploits knowledge of the circuit interior in order to come up with a faster test. Thus, our approach is useful for testing circuits for which the physical design is known. The information about the circuit is represented as a list of potential defects and the corresponding resulting performance errors.
In the evaluation of testing methods the amount of time required to create the test vectors is of secondary importance since the creation of the test vectors is done once for each circuit design. The e ciency is measured in terms of the number of generated test vectors, as this number is proportional to the amount of time required to test a given circuit chip. The manufactured circuits are tested with the same set of test vectors repeatedly (and commonly thousands or even millions of times), and therefore the number of test vectors is of prime importance.
Testing of circuits can be viewed as comprising of two main tasks. One is the generation of inputs that can lead to external manifestation of existence of faults. The other is the compression, or compaction, of this set of vectors. These two tasks are usually viewed as one and the attempt is to generated a compacted list of test vectors. Our approach allows for the separation of these tasks. It can work with any test generation procedure and produce a faster test than would have been produced by the test generation alone.
The approach we use is proved faster than commonly used approaches on the set of benchmark circuits ISCAS-89. These circuits are considered the standard bechnmark for algorithms on sequential circuits. Real circuits are substantially larger than the benchmark circuits, yet the methods used are the same as for the small ones, typically combined with a heuristic decomposition of the large circuit to a collection of smaller ones. The method we propose can similarly be used in conjunction with such decomposition and is likely to be 3 The testing method we develop is based on modelling the testing problem as a Tour Covering problem. In this problem the input is a directed graph adn a set of elements. Each arc has an associated subset of elements it covers. The problem is to nd hte shortest length tour that traverses arcs which joinly cover all the elements. As we point out, this model has some de ciency and does not represent fully all aspects of the testing problem. We correct for those de ciencies in the heuristic procedures devised for solving the problem in the testing context.
Preliminaries
Figure 1(b) presents a schematic description of a sequential circuit. This type of circuit has a feedback mechanism. Logic values in the next state lines are stored in memory units and then fed back into the circuit as the present-state values in the next time frame. These memory units are also referred to as latches or ip-ops (denoted by FF in Figure 2 ). Figure  2 presents one time frame of a sequential circuit. The values carried by the latches along with those assigned to the primary inputs determine uniquely the outputs and the next state of the circuit.
The gates of the circuit in Figure 2 represent the well known boolean functions, AND, OR and NOT. For 2 (or more, if the gates are concatenated) 0 ? 1 inputs the AND function outputs 1 if and only if all inputs are 1. The OR function outputs 1 if and only if at least one input is 1. The NOT function has a single input and it reverses 1 to 0 and 0 to 1.
The model best suited for describing the behavior of a VLSI sequential circuit is the nite state machine, a ve-tuple, M = (S,J ,O,v,w) Gr]. S is the set of the states of the machine, J is the input alphabet, O is the output alphabet, v is the next-state function, v:S J ! S, and w is the output function, w:S J ! O. Logic values in the circuit's present state lines correspond to the machine's current state, input applied to the circuit is identical to the machine's input.
The nite state machine associated with a circuit de nes its State Transition Graph (STG) which conveniently and concisely encodes information about all possible states, their allowable inputs, and reachable next-states M], Gh]. Nodes in the graph correspond to logic states in the circuit, while arcs represent transitions between states. Thus, each node has outgoing arcs representing transitions resulting when allowable input is applied to that state. Arcs are labeled with this input, as well as the output produced when this input is applied. Movement from one state to another in the State Transition Graph simultaneously relates to transitions of the nite state machine and to the functioning of the circuit. We will refer occasionally to states as nodes and to transitions as arcs.
Each input vector applied to a circuit at a given state, produces an output and a transition to a di erent state. While the output is externally visible, the resulting state is invisible. This invisibility of the state plays an essential role in all testing methods for sequential circuits.
For further clari cation we digress at this point to explain the notion of don't cares. Suppose the inputs include q 1 ; : : : ; q n as the primary inputs, and q n+1 ; : : : ; q m as the memory latches values (the state). Suppose q 1 ; q 2 are inputs to an OR gate and q 1 = 1. Then no matter whether q 2 is 0 or 1 the outgoing line from the gate will carry the value 1. In this case we say that q 2 assumes a don't care value which is denoted by X. Similarly if q n+1 ; q n+3 are inputs to an AND gate and q n+3 = 0 then the output line from the gate is always 0 and q n+1 = X. By the same token many of the state values can be don't care values. Consequently each state can be viewed as one of an equivalence class of states that have the same pattern of transitions given the same inputs (or the same equivalence class of inputs). This view of each state as a set and each input vector as a set, exploiting the existence of don't cares, is signi cant in reducing the complexity of dealing with large circuits. We will refer to each state and input vector as a singleton, whereas actually each may represent a set of states or input vectors.
While we do not make explicit use of the notion of don't cares, this is an important aspect of the subroutines we use for test vector generation per faults and for fault simulation. Faults in the circuit can a ect its behavior, either by altering the output produced in a state or by a ecting transitions out of the state, sending the circuit to the incorrect nextstate. With the single stuck-at model, the fault model we work with, a fault is present when a circuit wire's logic value is corrupted so that it is xed either at value zero or one. It is further assumed that only one such fault is ever present at a time in a circuit PW]. This standard assumption may lead to invalid tests as is manifested in the veering o phenomenon that we address in detail in Section 5.
An excitation vector for a fault is an assignment of values to the circuit's present state lines and primary inputs, which produces faulty output or causes an incorrect next-state transition when the fault is present. An excitation state is the present state part of the excitation vector. A sequence of input vectors that takes the machine from the reset state to an excitation state while the defect is present is known as a justi cation sequence, while the process of sending the machine to the excitation state is referred to as justifying the state Gh] . A test sequence for a fault is a sequence of input vectors, which when applied to the faulty circuit, produces output values di erent from those the fault-free machine produces. If a fault alters output produced at an excitation state, the justi cation sequence concatenated with the input part of the excitation vector form a test for the fault. However if a fault alters the excitation state's next-state but not its output, the justi cation sequence alone does not form a test for that fault, since the e ects of the defect cannot be observed at the excitation state. Additional input vectors must be applied, until the e ects of the fault are propagated to the output lines. Such a sequence of input vectors applied at the excitation state which propagate the e ects of the fault to the output are termed a di erentiating sequence Gh] . Concatenated to the justi cation sequence and input part of the excitation vector, these form a test.
We refer to the sequence beginning with the excitation state and ending with the state ending the di erentiating sequence as ED path. The sequence beginning at reset followed by the justi cation sequence and the ED path we call JED path. In order to clarify why the ED path may consist of more than one arc (transition) consider the example in gure 4. That example depicts a JED path, which is a test for fault f, as followed in a fault-free circuit versus the same path followed in the presence of fault f. At the point where the excitation state is reached the behaviour of the circuit di ers in the faulty circuit from that in the fault free circuit. Figure 4 shows the two paths that are followed in both circuit. The faulty one follows the dotted transitions while producing outputs O j that are identical to the outputs produces in the fault-free circuit until the di erentiating transition leading to D. On this nal arc of the path the output of the fault-free circuit is O q whereas the output of the faulty circuit is O j . So although the fault is present, it is not manifested prior to reaching state D.
The task set by the testing problem is to traverse the state graph with a minimum number of input vectors so that for each potential defect one of its excitation states is visited, and immediately thereafter its corresponding di erentiating sequence is applied. For the circuit of Figure 2 one possible test checking for defects A through F starts at reset, or state 00, and consists of the input vector sequence 1-0.
There is more than one dimension to determine the quality of a test procedure. It ought to be fast and require on the average a small number of input vectors per fault checked. Another factor determining the quality of testing is that a large fraction of faults (ideally all) are detected. It has been observed empirically that any crude approach to testing will be able to test a small fraction of the faults very quickly (with small number of test vectors). As more and more faults are checked for the number of input vectors needed to detect the next fault tends to go up. Our goal is therefore to devise a test procedure that is both fast and detects a large proportion of the set of faults compared to other test procedures.
Literature Review
Di erent approaches have been taken for conducting VLSI sequential circuit test generation without prying into the internal workings of the circuit, and thus by justifying states and checking output. The most popular methods, due to its simplicity, is the random approach.
8
The random approach is to generate sequences of input vectors composed of zeros and ones at random. These input vectors are applied to faulty circuits while the output is observed. If this output di ers from that which a fault-free circuit would produce, as determined from the STG, the vector forms a test for the defect in the circuit. Random tests do manage to check for some defects. However, they are not e cient, at least not after a small fraction of faults has been detected. (Our observation is that this fraction is about 20% .) It is possible, using information about the circuit structure, to design smarter tests which check for more faults using less input vectors.
Ghosh presents such a test procedure Gh]. His method nds several paths, each starting at the reset state and traveling through the STG, checking for a fault. His approach justi es and di erentiates states one at a time, starting at the same initial reset state before each justi cation sequence. Because of this strategy of looking for input vectors which check for faults one at a time, we refer to his method as the Single-fault approach. The idea of repeatedly resetting the circuit (or the machine) to this initial state before looking for a sequence for each defect proves wasteful, however. In addition to adding distance to the test vector, resetting every time adds cost to the testing process since the machine must be switched o then on again to get back to the initial state. As we show later, it is frequently bene cial to not to restart and look for an entirely new test sequence for each fault. Instead, simply adding a few input vectors to an existing sequence could take the machine a few states further and check additional faults. Such an approach would be less costly and timeconsuming than generating a new test sequence for each defect, not using information from previous defects. Gosh has demonstrated however, and our empirical studies have con rmed that, that his approach is substantially more e ective than the random approach.
F uredi and Kurshan proposed a theoretical method that views test generation as a requirement to traverse every arc in the graph FK]. As such, they solve the Chinese Postman Problem, or the problem of nding a minimum length tour that traverses every arc in the graph, de ned on the STG graph. (We present a formal de nition of the Chinese Postman Problem in Section 3.) We refer to such an approach as a Chinese Postman approach. This strategy has several shortcomings. By requiring that every arc in the STG is traversed this approach traverses more than one edge for each defect and also edges that do not di erentiate any defect at all. The method is also prone to error due to the phenomenon of veering o . This happens when one gets the right output, but it is produced at the wrong state. More explicitly, the test procedure is based on the logic description of the fault-free circuit and depends on the machine being in the correct state at every step. Since faults may cause incorrect next-state transitions, the machine may not always move into a planned state, but to a di erent state, due to a faulty transition. Then it has veered o and it may not be possible to justify all excitation states at once. Although the circuit may appear, deceptively, to produce the correct output vector this happens only because it has veered o , and the output at this new state just happens to coincide with the correct output at the planned state.
Another shortcoming of the method of F uredi and Kurshan is that it requires the entire STG to be generated in advance. Even for small size circuits, with only 10-15 memory latches, this is impossible. For these reasons this approach is not implementable as is. The concept however is useful and we incorporate it in our approach.
Aho, Dahbura, Lee, and Uyar employ an improvement on the Chinese Postman approach. They seek to cover only edges that cover defects, i.e., those that correspond to di erentiating vectors. The problem of covering a subset of edges with a single tour of minimum length is called the Rural Postman Problem, and this approach to testing we call the Rural Postman approach. The application of this problem which Aho et al. discuss comes up in a related, but di erent context. They generate tests for checking the conformance of a protocol implementation to its speci cation, a problem closely related to sequential circuit testing ADLU]. Their problem, however, requires that all of a certain subset of transitions of a Finite State Machine be checked, while our problem must only check selected transitions, since a fault can have more than one ED path. For the testing problem, their approach has the shortcoming of traversing a di erentiating sequence for a fault even after that fault may have already been identi ed. Again this approach is not practially implemetable for the same reasons as the previous one -the veering o and the explicit generation of the STG.
There are also more general test generation methods, appropriate when a list of potential defects and di erentiating vectors is not available. Such is the case when testing implementation of logical design, or testing protocols. Lee and Yannakakis present such a testing method taking as input two machines, A and B, the speci cation machine, and the implementation machine respectively LY]. They construct an input sequence which distinguishes the two machines. This sequence is of length O(pn 4 logn) where n is the number of states in the STG and p is the number of input lines. That is, if machine B contains some defect, their test will nd it in time proportional to more than a fourth degree polynomial of the size of the state space. Because Lee and Yannakakis's method works with machines about which little is known their method is not constraining and thus has wide-ranging applications. It does however require the traversal of all edges in the graph (at least once), as it does not assume the availability of information about the circuit and its potential defects. It is hence prohibitively slow and inappropriate for checking physical circuits.
Among these existing methods only the random and single fault methods are practical for circuits of moderate size. We will therefore compare our approach to these two methods.
Overview of Paper
Our approach seeks to traverse at least one di erentiating sequence for each defect, so that the number of input vectors applied (and hence the number of edges traversed) is as small as possible. To achieve this end we examine combinatorial aspects of the test generation problem.
We begin by presenting in the next section the relevance of the set covering problem to the testing problem. A discussion on the similar features of the VLSI sequential circuit testing to the crew scheduling problem implies that similar empirical approaches may be used for solving both problems. In the following section we introduce a new combinatorial optimization problem, the Tour Covering problem, in which aspects of the Rural Postman are combined with set covering aspects of the testing problem. Three heuristics for Tour Covering are introduced in the fourth section, including a greedy heuristic that generates test sequences online. The following section, Implementation for Sequential Circuit Testing, relates this optimization problem back to sequential circuit testing, compensating for the veering o phenomenon. Section 6 presents results from an empirical study which applies three new testing heuristics to benchmark circuits. Data from these three methods is compared with data from the Single-fault approach and the random approach. Finally, the last section presents conclusions from the study and plans for future research.
The Set Covering Approach
The set covering problem can be viewed as a simpli ed version of the testing problem. Consider the path created in the STG graph by traversing from the reset state the justi cation sequence -excitation state -di erentiating sequence for a given fault. Recall that such path is termed JED-path to distinguish it from a path starting with the excitation state -di erentiating sequence, which we call an ED-path. A path usually detects a collection of faults.
For a path P j (of either type) we denote the collection of faults detected by the path as F(P j ) F where F is the set of all faults. We will consider the cost of a path to be the number of vectors in the path, or in other words, the length of the path jP j j. A path can therefore be viewed as a set that covers a subset of the faults. With this view in mind we can formulate the problem as a set covering problem using the notation a ij = 8 > > < > > :
1 if fault i 2 F(P j ) 0 otherwise: 12 and the variable x j is set to 1 if path P j is selected for the testing and 0 otherwise. The set covering problem is then formulated as, (SC) min P m j=1 jP j jx j s.t. P m j=1 a ij x j 1 i 2 F x j 2 f0; 1g j = 1; : : : ; m
Consider now the case when the paths are JED paths and hence their length is calulated from reset. In that case the optimal solution to (SC) is an upper bound on the length of the test sequence. This is because it is possible to concatenate another ED path to detect more faults, but this formulation does not allow for that. In that sense, this formulation emulates an optimization version of the single-fault method. After detecting some faults and reaching a di erentiating state it returns to reset.
The case of using ED paths is more intricate. The (SC) formulation then ignores the additional cost required to get from the end of one path (the di erentiating state) to the beginning of another (the excitation state). In that sense the optimal solution provides a lower bound. On the other hand, various ED paths may be overlapping, and hence their combined length is lesser than actually stated. For these reasons the value of (SC) when we consider ED paths is neither a lower nor an upper bound.
One way to remedy that is to check for all possible overlaps. This in itself may be too time consuming to be practical. However, one can heuristically just check for overlapping pairs and if nd any, add the union of the two paths to the set of candidate paths. Assuming that the number of overlaps is not substantial, this can be a good`approximate' lower bound.
Checking, to a limited extent, for overlaps is useful also in the case we use JED paths. There we can check for the overlap of ED paths and add the shortest justi cation sequence to the concatenation of the overlapping paths. This will result in an improved (i.e. lower in value) upper bound. The ultimate improvement would be to check for all possible sucecssive routings of ED paths and use their union as additional paths for the (SC) model. This, however, is too time consuming to be practical.
An additional aspect that is ignored in the set covering model is the veering o e ect. In addition to these shortcomings, the Set covering is a well known NP-complete problem. As such, even this simpli ed version of the testing problem is intractable.
The set covering instances that we consider have some speci c features. The number of sets is typically extremely large, as there are many di erent paths that could potentially test the same defect. In fact, it is practically infeasible to enumerate all such potential paths. A set covering problem with the same features comes up in the context of the crew scheduling problem for airlines. There, the sets are all the possible individual crew schedules, each speci ed in terms of the ight legs that it includes. The number of such schedules is typically exponential. In practical solution methods developed for the crew scheduling problem, the set of potential schedules is generated as needed (the column generation technique), rather than being given with the initial input. Such an approach is most appropriate for our application as well, in which context it means that test vectors are not speci ed in advance.
There exist some good recent methods for solving the crew scheduling problem. In particular, we believe that a column generation technique, such as the one developed for the crew scheduling problem by Anbil, Barnhart and Johnson, ABJ], could result in a small collection of test sequences that forms a comprehensive test for the testing problem. We do not pursue this direction here, but consider it very promising for future research.
The Tour Covering Problem
The tour covering problem introduced in this section is a better model of the testing problem compared to the set covering. It is still a simpli ed model of the problem as will be explained later. The tour covering problem combines the set covering aspects of the testing problem 14 with the requirement that edges must be traversed along a contiguous path which starts at the reset state, or along a tour if the path also terminates at reset.
A relevant problem here is the Chinese Postman Problem (CPP), that was initially solved by Edmonds and Johnson, EJ]. In (CPP), we are given a directed graph (the undirected version is irrelevant for our discussion), with each arc having some weight, representing its length, associated with it. The goal is to create a tour that traverses all the arcs so that the total length of the tour is minimum. This problem is solvable in polynomial time, by reducing it to a transportation problem, as discussed in the next section.
A more closely related problem is the Rural Postman Problem (RPP). Here there is a speci ed subset of the arcs in the graph that need to be traversed in a tour. The other arcs may or may not be included in the tour. Although a minor variation of the (CPP) algorithm nds, in polynomial time, a collection of tours of minimum length traversing all edges in the subset at minimum length, the problem of creating a single tour of minimum length is NP-complete ADLU]. (Indeed, it is easily seen that the Traveling Salesperson Problem is reducible to it by setting each subtour as a node in a graph).
As in (RPP), in the testing application we need to traverse only a subset of the edges, namely those that detect faults. Moreover, we need to traverse only one of the set of many paths that can detect a certain fault. This is the problem we call the Tour Covering problem, and it is de ned formally as follows: TOUR COVERING PROBLEM Instance: A directed graph, G = (N; A), a universal set F and subsets of F associated with each arc a j 2 A, F(a j ). Problem: Find a shortest length (not necessarily simple) cycle which starts at a prespeci ed node and travels through the graph, traversing a subset of the arcs A A such that a2 A F(a) = F.
The Tour Covering problem is related to set covering, since we seek to cover all of the n 15 elements. Also it generalizes the Rural Postman Problem which requires nding a shortest tour traversing a certain subset of the arcs.
The Tour Covering problem when used in the context of testing has faults detected associated with arcs. In fact the detection of faults requires traversing a sequence of arcs in an ED path -from excitation to di erentiation. Another aspect which is not addressed in the Tour Covering formulation is the veering o e ect. In an ideal testing procedure the length of the tours should be bounded to reduce the likelihood of veering o . In that sense a collection of a small number of tours each beginning and ending at reset, is preferred to a single tour. This issue will be discussed in detail in the implemetation section (Section 5). The additional number of vectors required for the multitour covering still justi es the added certainty that veering o is avoided.
Heuristic Algorithms for the Tour Covering Problem 4.1 The Chinese Postman Approach
Suppose one applies a Rural Postman algorithm to solve the tour covering problem as follows. Consider each arc a that has a nonempty set F(a) associated with it to require traversing. The union of these arcs is then the subset of arcs that need to be traversed. In that case the solution tour covers all these arcs although many may not need traversal as the set that they cover has already been covered by other arcs. To avoid that, we generate in our heuristic one subtour at a time, and then update the remaining set of elements to be covered on the remaining arcs in the graph.
When the CPP algorithm is applied to the Rural Postman Problem the result is in general a collection of subtours of total minimum weight among all possible such collections that cover all edges in the speci ed subset. Although such a result is inappropriate as a solution to the Rural Postman Problem, it is better suited for the purposes of the Tour Covering problem than a single tour. In our heuristic we will use the CPP algorithm and then select among all the subtours the one subtour that contributes most to the covering, where the contribution is measured in terms of the number of new elements covered per unit weight of the subtour. At each iteration a subtour is selected in this way. Then the collection of elements to be covered is updated. Those covered by the selected subtour are removed from the collection. When terminating the heuristic returns the collection of selected subtours, which together cover all n elements.
The heuristic calls for a routine CPP which generates a collection of tours covering the arcs in the subset at minimum total cost. That algorithm's description is given later in this section.
CPP Heuristic for Tour Covering
Input: G = (N; A); F, F(a) F; 8a 2 A. A = fa 2 AjF(a) 6 = ;g. An initial state v 0 2 N.
Step 0: (Initialization) i = 0.
Step 1: i i + 1.
If F = ; return(T) the set of subtours, STOP. 
End
Each subtour generated covers at least one element, as otherwise it could have been omitted while reducing the cost of the solution, thus contradicting the optimality. Hence, after at most n iterations the algorithm terminates.
For the testing problem we place in A all the arcs (transitions) that belong to ED paths generated for all faults. It is then possible that a subtour generated will not inculde the full ED path and hence will not test for the faults that this ED path detects for. For this reason the set of faults that is actually detected by a tour is not simply the union C k on each subtour as in Step 1 of the algorithm. Instead we fault simulated to identify the actual set of faults detected by each subtour. Details about fault simulation are given in Section 5. Note that it is also possible that more faults can be detected by a subtour compared to the union of sets of faults detected by each arc (or path) in the tour. These will be identify as well by fault simulation.
For completeness, in the remainder of this section we present our implementation of the CPP algorithm in L] used to obtain the collection of subtours. Given a speci ed subset of arcs to be traversed, the algorithm rst determines which additional arcs to include in the subtours. It then constructs the subtours on these edges. We consider next the problem of which additional edges to include in the tour. We form a collection A pot of potential additional arcs to be considered for the tour. Then a transportation problem is used to determine which arcs from this collection are actually included in the tour.
Initially A pot is empty. Then we add to A pot the shortest path from v i to v j , for each pair (v i ,v j ) of imbalanced nodes with e(v i ) > 0 and e(v j ) < 0. Note imbalances with respect to A pot of all intermediate nodes on a path are zero, since a path adds exactly one incoming and one outgoing arc to each. Thus including any of the paths in the tour would not disturb the imbalances of these nodes with respect to the tour. To determine which of the paths to include, we solve a transportation problem on the bipartite graph G B = (N pos ; N neg ; A b ) which has bipartition N pos ; N neg and arc set A B , consisting of arc (v i ; v j ) for each pair (i; j) such that e(v i ) > 0 and e(v j ) < 0. Arc (v i ; v j ) has cost equal to the length of the shortest path from v i to v j to in G. Node v i has supply equal to e(v i ).
Let A RP be the nal set of arcs to include in the Rural Postman tour. This set consists of arcs in A as well as edges in shortest paths whose arcs have positive ow in the transportation problem's solution. The number of copies of arcs on the shortest path between node v i in N 1 and node v j in N 2 that are contained in A RP is equal to the value of the ow on arc (v i ; v j ).
Since nodes in N are balanced with respect to A RP , subtours covering arcs in A RP exist. They can be formed by starting at any node in N 0 which is the tail node of a arc and following the arcs along the path of that arc. Once an arc is included in the tour, one of its copies is removed from the set A RP . As long as all of the edges in a arc have a positive number of copies remaining in A RP , the arc is active. When the head node of the arc is reached, arcs along any active arc are followed, one copy of each is removed from A RP , and the process continues until a node is reached with no adjacent active arcs. At this point, a subtour is complete and is placed in the collection of subtours T. Any node in N 0 which is the tail node of an active arc is selected, and the process continues, forming another subtour. Subtours are constructed until no active arcs remain.
Formally, the CPP algorithm is given as follows. It calls a subroutine shortest path which takes two states, v i and v j as input, then uses a breadth-rst search to nd the shortest path between these two states. It also calls transportation, which solves a transportation problem on the given graph and returns X, an (N pos N neg )-dimensional vector of ows, and eulerian tour, which nds and returns an Eulerian tour on the given graph. The output is T the collection of subtours that traverse and cover all arcs in A RP .
CPP Algorithm
Input: G = (N; A) and A A.
Step 0 Step 3:
T eulerian tour (G RP ) return(T).
End
Nearest Neighbor Approach
The second heuristic for solving the Tour Covering problem is the Nearest Neighbor heuristic. The input is, as before, the set F of n elements to be covered and the subset of arcs A each of which covers a nonempty subset of elements.
The nearest neighbor heuristic creates a subtour by appending to a given path starting at an initial node (initial state or reset) the nearest arc in A. Finding the nearest arc is determined by calculating the shortest path from the tail of the current path to the head of each arc in A. The shortest among those shortest paths is appended to the path along with the selected arc. The process is then repeated after updating the set of elements remaining to be covered. If the path folds on itself, it has created a subtour. The process continues nevertheless till all elements of F are covered.
In the algorithm description there is a function covered elements(B) = a2B F(a). For the testing problem this function is evaluated using fault simulation.
Nearest Neighbor Heuristic for Tour Covering
Input: G = (N; A) ; F, v 0 2 A, A A ; F(a) F; 8a 2 A. 
End
When implementing the nearest neighbor for the testing problem, the arcs in A get replaced by the ED paths with their starting state -the excitation state -and the end state -where the fault has been di erentiated. We use in this heuristic the terminology of states instead that of head and tail of arcs and paths.
22
The Greedy Approach
An approach to Tour Covering that is on-line has the advantage that it is not necessary to enumerate all possible paths covering an element ahead of time. Instead elements can be considered one at a time and a path to cover the element can be generated as needed. The Greedy Heuristic uses an on-line approach.
This heuristic maintains a set F of elements to cover. In the i th iteration a subtour is formed as follows. The heuristic generates an arc (a ED path) to cover the rst remaining element in F. It then appends the shortest path from the current state to the starting state of the generated arc and then that arc to the tour. At this point the heuristic checks which other elements are covered by the tour or some part of the tour. It removes these elements and the rst element from F. Then it generates a path to cover the next element in the updated F, adds the shortest path between the last point in the tour and the rst state in the generated path to the tour, then adds the generated path to the tour. It checks which additional elements are covered by the new part added to the tour and removes these and the rst one from F. The heuristic continues in this way, until N i arcs (paths) have been generated and appended to the tour created in the i th iteration. Then the shortest path from the last state in the tour to the speci ed initial state is added to the tour, completing it. The purpose of closing the tour is to return to the reset state and thus avoid the increasing likelihood of the veering -o e ect. Elements which this nal path covers are removed from F, completing the i th iteration. For the generation of the test vectors for the given faults in the implementation, we used the program SIS, developed and maintained by Logic Synthesis Group at University of California, Berkeley. The fault simulation is also done using slightly modi ed SIS subroutines.
Greedy Heuristic for Tour Covering
Fault simulation is an essential part of the testing method. It is the process of comparing the transitions and outputs of a fault free circuit and a circuit that contains one or more faults by simulating the functioning of such circuits. For a given sequence of vectors we compare the output generated of the fault free circuit to the circuit containing one fault at a time. Any fault that produces di erent output from the one of hte fault free circuit is thus detected by this sequence of vectors.
As input to our heuristics we supply the list of potential faults as the set of elements to cover, and their JED paths that detect them. For each JED and ED paths delivered by SIS we apply fault simulation to identify the entire collection of faults identi ed by that vector sequence. This collection is typically including many more faults than the one for which the sequence was generated.
The initial state is set to the reset state of the circuit. Then the subtours produced by the heuristics are subtours in the STG which de ne sequences of input vectors to apply to the circuit in order to detect the faults.
When implementing our heuristics an important point at which testing di ers from the Tour Covering model is the veering o phenomenon. If a fault which causes an incorrect next state transition is present in the circuit, the heuristics may not design valid tests. Consider for example a pair of faults fa; bg that are present so that in the presence of a the ED path devised for b veers o and still produces the correct output (while being at the wrong state). Similarly, the ED path devised for a veers o in the presence of b and still produces the correct output. Consequently, with both these faults present our tests are not valid -the circuit appears to be free of these faults, but it is not. This error results from the generation of the JED paths baseed on the assumption that only a single fault is present.
To protect against veering o we apply fault simulation ( PW]) to a tour while checking that in the presence on any potential fault there is no deviation from the correct state transition.
The alternative to fault simulation to check for veering o is to check after every state transition that the tested (physical) circuit is in the correct state. Performing such check is a costly process that reduces the speed and e ciency of the overall testing procedure. By comparison, the fault simulation is done in the procedure that devises the overall test, but does not a ect the time required for the actual testing.
The fault simulation tends to be more conservative in that it may determine that certain test are invalid, but in practice they are valid as the fault that causes the veering o is not present. As a result we may reject good sequences or subtours that can perform a valid test for a large collection of circuits. Nevertheless, in our empirical experience we observed that the tours generated never veered and hence no subtour was rejected due to veering o .
For the Chinese Postman Heuristic, all subtours produced in each iteration are fault simulated, so that the number of faults actually detected by the subtour is known before the best subtour is selected based on this information. With the Nearest Neighbor Heuristic, before each sequence is added to the tour the sequence is fault simulated to check that it detects the fault. Any other faults that it detects are removed from the fault list at this time. The Greedy Heuristic fault simulates each tour after it is constructed, also using Fault Simulation for the union of the faults covered to check that each fault which should be detected by the tour is indeed detected. Also, as with the Nearest Neighbor approach any additional faults which the tour covers are removed from the fault list.
In the greedy heuristic we can control the length of the tour generated. As mentioned before, the density of detected faults -which is the average number of faults detected per input vector -tends to be initially very high and even a short sequence of input vectors can cover a signi cant number of faults. After a test is found for a collection of faults these checked faults can be removed from consideration. Thus the number of additional faults detected by later sequences decreases as more faults are detected and removed from the set and the density goes down. For this reason we set for the Greedy Heuristic that initially it constructs tours from a small number of sequences. The length of the tours in terms of the number of sequences it contains is increasing as more tours are generated and more faults are detected. We chose to form the i th tour using i 2 sequences, thus setting N i equal to iter 2 in the Greedy Heuristic. As a result the tours produced initially, when the risk of veering o in the presence of almost all faults is high, tend to be short whereas later on when the number of remaining faults is small we can a ord to use very long tours without substantial risk of veering o .
In all of our heuristics it is necessary to nd shortest paths between states in the STG. Our code employs a breadth-rst search to nd these shortest paths, as edges in the STG are of unit length. For very large circuits our implementation does not enumerate the entire STG, due to memory and CPU time constraints. In this case paths found by breadth-rst search are not necessarily the shortest ones. For all the circuits used in the current study, which are of small size, complete graphs were generated.
Another issue concerns the transportation problem used in the CPP Heuristic for Tour Covering. To solve this problem, we used a capacity scaling minimum cost ow algorithm described by Ahuja, Magnanti, and Orlin AMO]. Such an algorithm, employing capacity scaling, is appropriate for our needs since it can e ciently handle networks in which the nodes have small imbalances, compared to the number of nodes in the graph.
Another implementation detail involves constructing subtours in the Chinese Postman Heuristic. If any subtour included in the nal test does not go through the reset state, it is necessary either to patch this tour to another subtour which does pass through reset, or to link this subtour directly to the reset state. After observing that paths connecting a subtour to reset typically had unit length, we simply linked each subtour directly to reset, when reset was not already included in the subtour. sequence per fault were also done using these heuristics. Results from both are presented.
Each table includes the number of input vectors each heuristic used and the number of faults it covered. The number of testable faults in the circuit, which is the number of faults less the number of redundant faults, is also given, as well as the percentage of testable faults the heuristic detected. Also included is the ratio of faults covered per input vector. When the random approach required more than 5000 input vectors, the table records 5000+ input vectors and the ratio is omitted. Results from the Chinese Postman Heuristic are labeled as CP, those from the Single-fault approach as SF, those from the random approach as RN, those from the Greedy Heuristic as GR, and those from the Nearest Neighbor Heuristic as NN. Runs for the Chinese Postman and Nearest Neighbor heuristics which considered one ED path are labeled as sing; those considering more than one are labeled as mult.
We also experimented with combining the heuristics in a multiple-stage approach to test generation. First we investigated using three stages, one for each new heuristic, and conducted two sets of runs, the rst set using the Greedy Heuristic in the rst phase, the Chinese Postman Heuristic in Phase II and the Nearest Neighbor Heuristic in the last phase. The second set used the Nearest Neighbor approach rst, then the Chinese Postman, with the Greedy Heuristic last. This ordering was inferior in terms of the quality of the results to the ordering using greedy rst. We also experimented using just the greedy and nearest neighbor (a two stage approach) and got some better tests using this approach. Results from these studies are presented in the next three tables. NN indicates the Nearest Neighbor Heuristic, and CP the Chinese Postman Heuristic.
We chose to use either the Greedy or Nearest Neighbor Heuristic in the rst phase since it is important that initial tours do not have a large number of input vectors, as just a small number detect a considerable percentage of the faults. The Greedy Heuristic is then appropriate for the rst phase since it constructs short initial tours. With the Nearest Neighbor Heuristic we can limit the length of a tour, cutting it o after a few sequences are added, thus keeping it short. The rest of the faults can then be tested in another tour, which starts at reset. We also observed that after most of the faults have already been detected, the Chinese Postman Heuristic tends to generate only one subtour which detects all of the remaining faults a multiple number of times, and therefore at high cost. For this reason we chose not to use the Chinese Postman Heuristic in the third phase.
Two sets of runs were performed with each circuit. In the rst set tours were constructed in Phase I until at least 30 % of the faults were detected. (A new tour was constructed in entirety in Phase I if less than 30 % had been detected at that point.) In the second set of runs tours were found in Phase I until at least 40 % were detected. When the rst phase used the Greedy Heuristic, the rst iteration alone often detected more than 40 % of the faults. Likewise, when Nearest Neighbor was run in the rst phase, the rst vector alone often detected more than 40 % of the faults. In this case the two sets of runs show the same percentage of faults detected in the rst phase. For both sets of runs, after the Chinese Postman Heuristic detected 30 % of the faults in the second phase, test generation switched to the third phase. In cases where this would result in both runs producing the same set of tours, the rst run switched to the third phase after detecting less than 30 %, in order that the two runs produced di erent sets of tours. The percentage of faults actually detected by the Chinese Postman Heuristic varied from circuit to circuit, depending on the subtours produced.
Next we experimented with a two-stage approach to test generation, using just the Greedy and Nearest Neighbor Heuristics. The rst set of runs used the Greedy Heuristic in the rst phase, while the second set considered the Nearest Neighbor Heuristic rst. The results for the rst ordering is presented in Table 3 .
In the column showing the number of input vectors used, asterisks appear next to an entry when the heuristic used the least number of input vectors among all heuristics for this circuit. Likewise, asterisks appear in the column showing the ratio of faults covered to number of input vectors when this ratio was greatest among all ratios for this circuit.
The Three Stage Approach using the Greedy Heuristic in the rst phase gave the best results for nine of the thirteen circuits, the Two Stage Approach for two circuits, including one tie with the Three Stage Approach, and each pure heuristic gave the best result for one circuit: the Greedy Heuristic for circuit s386; the Chinese Postman Heuristic for circuit cse; and the multiple excitation state Nearest Neighbor Heuristic for circuit s1238. In every case, the heuristic with the best performance in terms of the number of input vectors used also had the best ratio of faults covered per input vector for the circuit.
The success of the Three Stage Heuristic indicates that it is worthwhile to combine heuristics in a multiple-stage approach to test generation. Such a strategy takes advantage of the fact that many faults are easy to detect, covering them as inexpensively as possible at the start. In addition it avoids constructing long, costly tours at the end, when most of the faults have already been covered, yet still applies optimization techniques in the second phase, setting it apart from a pure Nearest Neighbor or Greedy Heuristic.
Since both the multiple-stage runs and the pure heuristics which were successful considered more than one excitation state per fault, it seems worthwhile to invest the computer time to locate additional excitation states and di erentiation sequences for faults before constructing test sequences. Note that the random approach used more than 5000 input vectors for all but three of the circuits, and for all but the rst, very small circuit, required more vectors than any other approach.
We observed that many states function as an excitation state for more than one fault, or function both as an excitation state for one fault and as the end state of a di erentiation sequence for another fault. Such overlapping makes it possible and even convenient to continue to append the test for the next fault to the end of the test for the current fault, indicating that constructing tours detecting more than one fault is a worthwhile approach. The results con rm this observation, as our heuristics indeed show a signi cant improvement over the Single-fault approach.
Notice that our heuristics allow the exibility of charging for di erent transitions di erent costs. In particular it may be of interest to allocate an additional charge to the reset 30 transition.
Conclusions and Future Work
We have presented new approaches to test generation, improving upon past e orts. When applied to sequential circuits, our heuristics construct sequences of input vectors which efciently check for the presence of circuit defects. Further, our methods are useful in that they exploit the knowledge of the physical plan of the circuit, and are thus more applicable to circuit manufacturing environments.
In expanding the work we plan to incorporate a new procedure for generating test vectors for faults at the third phase. The purpose is to create full coverage of all faults. At the start of the third phase, after our heuristics have generated tests for most of the faults, it is often di cult to nd short tests for the remaining faults. We may need to include additional features in our heuristics in order to test for this last batch of faults while still maintaining low average density of vectors per fault. One such prodcedure we consider is the technique of Product Machine Traversals reported by CHS].
We plan to investigate further the cuto point for switching between the heuristics. A preliminary study has shown that the results could be further improved by choosing the percentage of faults covered by each heuristsic properly. In order to have conclusive results the experiments will be run on very large circuits.
After researching these ideas, we plan to apply our method to real size circuits, and investigate modi cations that will be particularly applicable for such circuits.
An approach proposed but not fully investigated is the set covering/ crew scheduling model for testing. The exibility of crew scheduling problems in handling paths in the network render this model particularly attractive and promising for future research. 
