Abstract. We present a new technique for uniquely identifying a single failing vector in an interval of test vectors. This technique is applicable to combinational circuits and for scan-BIST in sequential circuits with multiple scan chains. The proposed method relies on the linearity properties of the MISR and on the use of two test sequences, which are both applied to the circuit under test. The second test sequence is derived from the first in a straightforward manner and the same test pattern source is used for both test sequences. If an interval contains only a single failing vector, the algebraic analysis is guaranteed to identify it. We also show analytically that if an interval contains two failing vectors, the probability that this case is interpreted as one failing vector is very low. We present experimental results for the ISCAS benchmark circuits to demonstrate the use of the proposed method for identifying failing test vectors.
Introduction
As feature sizes of integrated circuits (ICs) shrink and designs increase in complexity, built-in self-test (BIST) and scan design are being increasingly accepted as industry-wide test solutions [6] . The combination of scan design and BIST, commonly referred to as scan-BIST, is now especially common. A scan-BIST technique typically applies a large number of pseudorandom patterns to the circuit under test via internal scan chains. The responses are then captured by the scan chains and compacted to a short signature in a multiple-input signature register (MISR) [3] . The properties of MISRs and their effectiveness for response compaction have been studied extensively in the literature [12, 18, 21, 25, 28] . However, a drawback of this approach is that the compact signature provided by the MISR provides only limited information for fault diagnosis, either to determine (failing) test vectors that produce errors on observable outputs or to identify errorcapturing (failing) scan cells.
Fault diagnosis is essential not only for the identification of manufacturing defects, but also for yield learning. In addition, fault diagnosis during silicon debugs is useful for design error detection. The cost of diagnosis is usually determined by the total time required for test application, identification of failing vectors and scan cells, fault localization, and failure analysis. For a scan-BIST scheme involving millions of vectors and tens of thousands of scan cells, the diagnosis and failure analysis time can be extremely high. Due to the large numbers of test vectors and scan cells, it is often difficult to determine a small set of failing vectors and failing scan cells that can be used for failure analysis. The diagnostic accuracy provided by a typical scan-BIST scheme is therefore too low to be useful in practice [26] . Hence, there is a need for scan-BIST schemes that offer high diagnostic accuracy, i.e., they provide a small set of failing vectors and failing scan cells, without requiring excessive test application time, processing time, and additional on-chip hardware.
Diagnosis in a scan-BIST environment can be broadly classified as either time diagnosis or space diagnosis. Time diagnosis refers to the identification of the set of failing testing vectors in a BIST session. Space diagnosis refers to the identification of failing (error-capturing) scan cells. A number of space diagnosis techniques have been proposed in the literature [4, 5, 23, 29] . These techniques provide a small set of candidate failing scan cells out of the tens of thousands of scan cells in large ICs.
A more difficult problem in scan-BIST diagnosis is that of identifying the set of failing vectors. This is because the number of test vectors applied to the circuit under test in a typically BIST scheme is usually much larger than the length of a scan chain. As a result, even fewer practical techniques are available today for rapidly identifying a small set of candidate failing vectors. Early work on failing vector identification was based on the analysis of LFSR sequences [20] , and the use of cycling registers [24] and error-correcting codes [11, 17] . An alternative approach that does not require intermediate signatures was presented in [1] . These techniques suffer from several drawbacks, including limitations on error multiplicity and restriction to a single-bit response stream [20] , diagnostic aliasing [11, 17, 24] , and high overhead [1, 11] . A promising matrix-based approach for identifying failing vectors is presented in [7] . However, it makes several simplistic assumptions, e.g., it limits the number of erroneous channels driving the MISR and the number of errors per MISR channel. Recently, a method based on the combination of cycling registers and pruning techniques was proposed for failing vector identification [15] . While this approach is useful for narrowing down the set of candidate failing vectors, it suffers from the drawback that it cannot identify all failing vectors.
An elegant analytical technique for identifying failing vectors is described in [13] . This method takes advantage of the availability of efficient public-domain numerical software tools; the computation time for this method can be reduced through the use of appropriately-chosen signature register polynomials. A potential drawback of this approach however is that it may identify a large number of candidate failing vectors. The principle of superposition is first used in [8, 9] for identifying failing vectors. Failing vector identification is also the focus of [14] , where a MISR and a second device consisting of T-flip-flops are together used to determine the location of a single error vector in a stream of n response vectors of m bits each, as well as the m-bit error vector. While the MISR generates a signature based on signature analyis, the second output compactor consists of T-flip-flops that are not connected to each other. Since only a single test vector is assumed to fail, the modulo-2 difference between the signatures of the T-flip-flops for the correct test response and the erroneous response is equal to the error vector E corresponding to the test response sequence. To determine the position of the failing vector in the test sequence, the initial state of the m-bit MISR is chosen to be the state that is reached after the test response stream of n m-bit vectors is compacted, followed by a sequence of 2 m − 1 − n m-dimensional all-0 vectors provided as input to the MISR. With this initial state, the state E is reached after N cycles of the all-0 vector is applied to the MISR, where N denotes the position of the failing vector in the test sequence.
In other related prior work, erroneous frames of scan cells can be identified [30] , where the ith frame consists of the ith scan cells of all N scan chains. The scan chains are connected to two MISRs, one of which is programmable, and in every repetition of the test set, the programmable MISR has to implement a different primitive polynomial. In order to identify up to t erroneous frames, 2t signatures for the complete test set are needed. In addition, to localize the failing scan cells, every scan chain must be considered separately. For a large number of scan chains, this approach imposes time overhead, as well as area overhead due to the need to modify the scan-BIST architecture.
A recent technique for failing vector identification relies on the use of overlapping intervals of test vectors [19] . An interval is a set of consecutive test vectors. A separate BIST signature is determined for each interval to determine is pass/fail status. An advantage of this approach is that all failing vectors are included in a reduced set of candidate vectors for failure analysis. The overlap allows effective pruning the set of candidate failing vectors with low hardware overhead. An interval that does not contain a failing vector can be omitted from the set of candidate failing vectors. Despite these advantages, the interval-based approach of [19] suffers from the drawback that a larger number of non-failing vectors are often included in the set of candidate failing vectors.
In this paper, we present a new technique for uniquely identifying a single failing vector in an interval for combinational circuits and for sequential circuits with multiple scan chains. As in [14] , this approach is based on an algebraic analysis of the signatures captured in the MISR during a BIST session. The algebraic analyis is based on the well-known theory of shift-register sequences as described in detail in the literature [3, 16, 27, [31] [32] [33] . Instead of demonstrating how to apply this methodology in practical situations, the emphasis of this paper is on the theory underlying the method. The analysis relies on the linearity properties of the MISR, and on the use of two test sequences, where the second test sequence is derived from the first in a straightforward manner. The same test pattern source is used for both test sequences. If an interval contains only a single failing vector, the algebraic analysis is guaranteed to identify it. In addition, we show that if the interval contains two (or more) failing vectors, the probability that this pair of failing vectors is misinterpreted as a single failing vector is extremely low. The proposed method can be used with a standard STUMPS-like BIST architecture.
The proposed approach is based on a classification of faults on the basis of their ease of detection using pseudorandom patterns. This approach is not intended for the "easy faults" that are detected by a large number of test patterns. The benefit of the proposed approach for such faults is limited because they require very short intervals. The focus here is on faults that cause only a few failing vectors within an interval. The identification of these failing vectors and the underlying faults imposes an onerous burden on the failure analysis engineer. We show through simulation experiments that the ISCAS benchmark circuits contain a large number of such single stuck-at faults, therefore the proposed analysis technique for identifying single failing vectors within an interval can reduce diagnosis time significantly. Once a failing interval is identified, the test vectors in the interval can be either applied from the tester or from a BIST pattern source. A set of appropriate interval lengths can be determined through preprocessing on the set of faults that are targeted by this approach. In addition to facilitating effect-cause analysis, the proposed approach can also benefit cause effect analysis [10] . For example, the knowledge of candidate failing vectors can improve the resolution provided by a compact fault dictionary [22] .
An inefficient approach to identify a failing test vector in a failing interval is to apply the vectors in that interval, and bypass the MISR and download all the test responses captured in the scan chains. To see what this entails, consider a design with 500 scan chains and 200 scan cells per scan chain. If the interval contains 200 test vectors, the total volume of test response that must be used to identify a failing test vector is as large as 20 Mbits. Even though diagnosis is done off-line, this amount of test data volume for a typical design imposes a severe burden on ATE memory for on-tester diagnosis, and workstation memory for off-tester diagnosis. In the proposed method, the failing vector is identified by examining only the MISR signature at the end of the test session, and even though the test patterns are applied twice (as explained in the next section), the volume of test response data is only 1 Kbits.
The organization of the paper is as follows. In Section 2, we describe the algebraic technique for identifying failing vectors in an interval for combinational circuits. In Section 3, we extend the algebraic analysis to a full-scan BIST environment with multiple scan chains. In Section 4, experimental results are provided for the ISCAS-85 and the ISCAS-89 benchmark circuits. Finally, Section 5 presents conclusions.
Failing Vector Identification Using MISR Signatures
In this section, we describe the algebraic analysis technique that allows us to determine the failing vector in an interval from the signature generated by the MISR. We consider combinational circuits here, and extend the analysis to full-scan designs in the next section. Suppose a sequence of test vectors t 1 , t 2 , . . . t n is applied to the circuit under test; see Fig. 1 . (The masking logic is described later.) We refer to this sequence of k-bit vectors as an interval of length n. Let the corresponding sequence of m-bit test response vectors be y 1 , y 2 , . . . y n , and let the MISR signature for this response sequence be denoted by
Let the initial state of the MISR be 0, the all-zero vector.
Next, suppose that a fault ϕ in the circuit under test changes the fault-free response y i to the faulty response y i ⊕ e i , where e i is the error vector. Since we are assuming that only one test vector fails in an interval, all the other test response vectors are fault-free, i.e. e j = 0, j = i. The erroneous signature corresponding to an error for the ith test vector, denoted S(1 → n, i, e i ) is given by
where S 1 (i, e i ) is the difference between the MISR signature obtained for the fault-free circuit and the MISR signature for the faulty circuit with the test sequence t 1 t 2 . . . t n . Let the state transition matrix of the m-bit MISR be A. For the simplicity of presentation, we assume here that the MISR has a primitive feedback polynomial and that the inverse A −1 of the state transition matrix A exists. The next-state equation of the MISR, as described in [3, 16] , is given in general terms by
where z(t), z(t + 1) and y(t) are m-bit vectors. Note that for notational convenience, we are using both y(t) and y t to refer to the m-bit test response vector at time instant t. Therefore, we have
Next we apply every test pattern twice to the circuit under test, i.e., the test sequence is now
n . This can be implemented by clocking the circuit twice as fast as the test pattern source. The mask control signal generates the alternating sequence 1010 . . . 1010. For this repeated sequence of test vectors, the fault-free response of the circuit under test is y 1 y 1 y 2 y 2 . . . y n y n , and the input sequence to the MISR is y 1 0y 2 0 . . . y n 0. The corresponding MISR signature is S(y 1 0y 2 0 . . . , y n , 0) = S (10 → n0) .
If the test vector t i fails due to a fault in the circuit, the test response of the circuit is y 1 y 1 y 2 y 2 . . . y i−1 y i−1 y i ⊕ e i y i ⊕ e i , y i+1 , y i+1 , . . . , y n y n , and the corresponding signature is 
Combining Eqs. (2) and (3), we get
which implies that
In Eq. (4) above, A is known for the MISR, and the observed differences in MISR signatures S 1 (i, e i ) and S 2 (i, e i ) for the two test sequences are also known. Thus the value of i can be easily determined from this equation. Once i is determined, it can be used to determine e i . Note that under the assumption that the MISR has a primitive feedback polynomial, we can always use the state space of the MISR to determine for arbitrary values of S 1 (i, e i ) and S 2 (i, e i ) a value of i, 1 ≤ i ≤ 2 n − 1, such that Eq. (4) is satisfied. However, here we restrict i to lie between 1 and n, i.e. 1 ≤ i ≤ n, since we are interested in the failing vector within the interval that produced the error. If no solution exists, we conclude that the erroneous signature in this case cannot be attributed to the presence of exactly one failing vector in the test sequence. Instead, the erroneous signature in this case must have been produced by more than one failing vector. In this way, we not only determine the failing vector, i.e., the location of the error in the test response sequence, but we also identify the error e i , i.e., the outputs of the circuit under test that are in error.
Equation (4) suggests that matrix multiplication and inversion operations are necessary to determine i and e i . This is however not necessary, and the computation can be speeded up significantly if we utilize the nextstate and previous-state functions of the MISR [3, 16] . From the feedback structure of the MISR, we can easily determine the next state and previous state for every flip-flop in the MISR. Next, we set S 2 (i, e i ) to be the initial state of the MISR, and depending on the value of the index i −n −1, we calculate an appropriate number of next states or previous states. For example, if i = n, we calculate the previous state; on the other hand, if i = 1, we calculate the nth previous state and check to see if it equals S 1 (i, e i ).
We now illustrate the failing vector identification procedure using a small example as shown in Fig. 2 . The circuit under test has four inputs x 1 , x 2 , x 3 , x 4 and three outputs y 1 , y 2 , y 3 . The outputs are connected via masking logic to the three parallel inputs of the MISR. The MISR has three stages and it has a primitive feedback polynomial. The circuit under test has three gates numbered 1 to 3.
Consider a stuck-at-0 fault on the line connecting the output of Gate 1 to an input of Gate 2. under test, as well as the accumulated signatures are shown in Table 2 . The accumulated signature for the fault-free circuit is 101 and the accumulated signature for the faulty circuit is 100. We therefore obtain S 2 (i, e i ) = 101 ⊕ 100 = 001. According to Eq. (4), the index i is determined with n = 5 as follows: Instead of multiplying the vector 011 repeatedly by A, we can use the simple shift register equations for the state components under input 0. The state successor equations are:
With the initial state (110), we easily compute the state sequence (110), 111, 011, 001 and for 6 − 3 = 3, the states (001) T and A 6−i (110) T are equal, and i = 3 is uniquely determined.
Next we compute the error vector e 3 , which is determined as e 3 = A 3−5 S 1 (3, e 3 ) = A −2 (110) T . Instead of matrix multiplication, we can again use the shift register equations for the state components of the MISR.
The state predecessor equations are:
Recall that failing vector identification using the above procedure is guaranteed if a single test vector produces an error. We next determine the condition under which the case of two failing vectors is incorrectly diagnosed as a single failing vector. 
Since k and e k are known from the diagnosis procedure, we now determine e i and e j using the following equations:
Since the feedback polynomial of the MISR is assumed to be primitive, the matrix
n − 1 is invertible and for fixed i and j, the error e j is determined uniquely from (6) and the error e i is then determined uniquely from (5). For each of the (
) pairs of i and j, 1 ≤ i < j ≤ n, i, j = k, the corresponding error vectors e j and e i are uniquely determined by the Eqs. , where m denotes the number of outputs in the circuit under test. If all errors are equally likely, the probability p 2→1 that a pair of failing vectors is misinterpreted as a single failing vector is given by
Although the assumption that all possible errors are equally likely is not realistic in practice, we conclude that it is extremely unlikely that two failing vectors will be misinterpreted as a single failing vector using this technique. If a fault is easy to detect by pseudorandom inputs, then even in short intervals we can expect several failing vectors. Therefore, we apply the proposed methods only to faults that are not easy to detect using pseudorandom patterns. for this interval. If no solution is found to exist with 1 ≤ i j ≤ l, we ignore this interval and conclude that a higher-positional error occurred. If there is a solution with 1 ≤ i j ≤ l, we assume that a one-positional error occurred since the probability that a higher-positional error is misinterpreted as a one-positional error is extremely low. 
Failing Vector Identification for Scan-BIST
In this section, we extend the analysis of Section 2 to scan-BIST. We assume that the circuit under test has L scan chains, each of length M, that drive an L-bit MISR; see Fig. 3 . To fill the L scan chains of length M with a test pattern X i , the test pattern generator TPG generates L pseudorandom sequences 
and
respectively. The state of the MISR is denoted by z = z 1 z 2 . . . z L . According to (8) and (9) 
Thereby we assume that only the test response Y i ⊕ E i under the test input X i is erroneous. For all the other test inputs X j , j = i, the corresponding test responses Y j , j = i are assumed to be correct. The error E i , i.e., the difference in the test responses for the test pattern X i can be represented in matrix form as follows:
Since the MISR is a linear automaton with the linear state transition function (1) and with the initial state 0 we have:
where 0 denotes an M by L matrix of all 0s.
Next, we apply a second, modified test sequence to the circuit under test, derived in a straightforward fashion from the first test sequence. The goal here is to provide an all-0 L dimensional column vector (0) input to the MISR for one clock cycle immediately after each test response Y i , 1 ≤ i ≤ n, is shifted in M clock cycles into the MISR. The second test sequence can be easily generated by use of the same test pattern source as for the first test sequence. The masking logic is used to provide every M clock cycles an additional 0 input to the MISR and at the same time, freeze the scan clock using simple gating logic. This is done immediately after the capture cycle of the scan operation.
The signature of the MISR for the second test sequence is given by 0, 0, . . . , E i , 0, . . . , 0, 0) .
We now assume that if the input E i is applied to the MISR with initial state 0, the MISR is driven to the state z(E i ). This implies that the first test sequence give us
. From the second test sequence, we get
Hence we conclude that
From Eq. (10), we can easily determine i, i.e., the failing vector in the BIST sequence. An admissible value of i must lie between 1 and n. As in Section 2, it is not necessary here to carry out matrix multiplication and inversion in order to determine a value of i that satisfies this equation. Instead, we can use the next-state and previous-state functions to reduce the amount of computation. We consider the MISR with initial- state  S(0, 0, . . . , 0, 0, E i , 0, 0, 0, . . . , 0, 0) and determine if the state S(0, . . . , 0, E i , 0, . . . , 0) can be reached for different values of i.
As in the case of a combinational circuit under test we next ask under what conditions can a fault ϕ in a full-scan sequential circuit, which is identified by the proposed method as a one-positional error under the test pattern X i , is actually a two-positional error under the test patterns X r and X s .
If the two errors E r and E s under the test patterns X r and X s are erroneously interpreted as a single error E i under the test pattern X i then   S(0, . . . , 0, E r , 0, . . . , 0, E s , 0, . . . , 0)   = S(0, . . . , 0, E i , 0, . . . , 0) , which implies that
, and
have to be valid. The error z(E i ) and the error position i are determined by the proposed diagnosis method, and in the following discussion these values are assumed to be known. We take into account the fact that for a MISR with a primitive feedback polynomial, the matrix (I ⊕ A m ), 1 ≤ m ≤ 2 L − 1 here, is invertible. It is then easy to show that for every pair r, s, with 1 ≤ r, s ≤ n, z(E r ) and z(E s ) are uniquely determined from z(E i ) and i. Thus, if a one-positional error E i under the ith test pattern X i is identified by the proposed method, a total of ( n−1 2 ) two-positional errors can be erroneously interpreted as this one-positional error. The number of all possible two-positional errors is (
where L is the number of scan chains and M is the number of scan cells in a scan chain.
If we assume that all the possible two-positional errors are equally likely then the probability p 2→1 , that a pair of failing vectors is misinterpreted as a single failing vector is given by
which is very small for realistic values of L and M. In practice, the assumption that all the two-positional errors are equally likely is unrealistic. Nevertheless, we expect the the probability that a two-positional error is erroneously interpreted by the proposed method as a one-positional error to be extremely small. Based on these results, we are able to identify a failing vector for a fault ϕ in the circuit under test in the following way.
We consider R test intervalsn X 
Experimental Results
We first present experimental results for the ISCAS-85 combinational benchmark circuits. We identified a set of faults for each circuit that are not easily detected by a small number of pseudorandom patterns. These faults are of interest to us and the proposed method is targeted toward them. In order to eliminate the easy todetect faults that are not of interest to us, we applied 100 pseudorandom vectors to the circuit under test as a pre-processing step. We only retained those faults that were not detected by these patterns. We injected these faults into the circuits one at a time, and collected the MISR signatures in each case for various interval lengths, varying from 100 to 500, and for a total of 1000 pseudorandom patterns. The algebraic analysis technique was then applied to the differences in the MISR signatures. If a one-positional error occurs in at least one of the intervals, the corresponding failing test pattern (and the corresponding error vector in case of a combinational circuit) can be uniquely determined.
The results are shown in Table 3 . Columns 1 and 2 list the name of the circuit and the number of faults that are not detected by the first 100 (pre-processing) pseudorandom test patterns. Columns 3, 4, 5, 6, 7 list the number of hard-to-detect faults (out of those listed in Column 2) that cause exactly one failing vector in at least one interval. These are the faults for which the failing test vector and the corresponding error vector can be diagnosed by the proposed method. Also listed are the percentages of the hard-to-detect faults that are correctly diagnosed by the proposed method. For an interval length of 100, we can uniquely identify the failing vector, which we refer to as the diagnosis of the corresponding fault, for over 90% of the hard-todetect faults. In many cases, we can diagnose over 95% of these faults. This percentage drops as expected for longer intervals; nevertheless, we can diagnose a significant number of faults even for an interval length of 500.
The experimental results for the full-scan versions of the ISCAS-89 circuits are shown in Table 4 . For these circuits, only the failing vector in an interval can be identified; the corresponding error vector cannot be obtained from the algebraic technique presented in Section 3.
In our experiments, we do not need to explicitly to model the scan chains. We apply a sequence of 1000 test patterns to the circuit under test. If the circuit is combinational, we apply these test patterns to the inputs of the circuit and we observe the outputs of the circuit.
In the case of a sequential circuit with one or more scan chains, we apply the pseudorandom test patterns to the primary inputs and to the secondary inputs of the combinational part of the circuit, and we observe the corresponding primary and secondary outputs.
As discussed in Section 2, the proposed method requires very little hardware overhead. No modification is made here to the traditional STUMPSlike BIST architecture. The overhead is limited to m 2-input AND gates and a small finite-state machine that generates an alternating sequence.
Conclusions
We have presented a new technique for uniquely identifying a single failing vector in an interval of test vectors. This approach is based on an algebraic analysis of the signatures captured in the MISR during a BIST session, and it is aimed that faults that are not detected by a small number of pseudorandom test patterns. The analysis relies on the linearity properties of the MISR, and on the use of two test sequences, where the second test sequence is derived from the first in a straightforward manner. The same test pattern source is used for both test sequences. We have shown algebraically that if an interval contains only a single failing vector, the analysis technique is guaranteed to identify it. In addition, we have shown through a probabilistic analysis that if an interval contains two failing vectors, the probability that this pair of failing vectors is misinterpreted as a single failing vector is extremely low. Experimental results for the ISCAS benchmark circuits show that the proposed approach can be efficiently used to uniquely identify failing vectors in a BIST environment.
Michael Goessel is a full professor of computer science and the head of the fault-tolerant computing group at the University of Potsdam in Germany. He is the author and coauthor of numerous papers in the areas of non-linear systems, parallel memories and fault tolerant systems design, including the books "Memory-Architecture and Parallel Access" (together with B. Rebel and R. Creutzburg), Noth Holland 1994, and "Error Detection Circuits" (together with S. Graph), McGraw-Hill, 1993 
