Research has shown that single stuck-at fault (SSF) Ndetect test sets are effective for detecting defects not modeled by the SSF model. Experimental results showed N-detect coverage is a good metric for determining test quality. In this paper, we examine the test quality of pseudo-random Built-In-Selj-Test (BIST) patterns by quantifiing the relations between their N-detect coverage and test length. We theoretically derive bounds on the minimum test length of pseudo-random patterns required to achieve a given N-detect coverage. For faults with high detectability, the expected test length for N-detection is around N times the expected test length for single detection. However, for faults with low detectability, the expected test length for N-detection can be NlogN times the expected test length for detecting the fault only once; this increases the test length significantly. We also introduce the idea of effective detectability which is important for analyzing the effectiveness of BIST techniques for detecting real defects.
Introduction
Built-in Self Test (BIST) allows an integrated circuit (IC) to test itself [McCluskey 851 . By embedding the test pattern generators, an IC can be tested at-speed to reduce the test time. In addition, timing-related defects and unmodeled defects are more likely to be detected. By embedding the output response analyzers, BIST eliminates the need for expensive test equipment to capture the output responses precisely and rapidly. Moreover, BIST can be used for testing the IC in the field.
Most BIST architectures use pseudo-random test pattern generators, such as linear feedback shift registers (LFSR), to perform test pattern generation. The quality of pseudo-random patterns is evaluated by calculating their stuck-at fault coverage. The relationship between the expected fault coverage and test length for psuedo-random patterns has been derived in [McCluskey 881. Defect-based testing is an emerging paradigm in VLSI test in which real production defects are targeted and test techniques are developed to detect these defects (rather than faults). For more details of the defect-based testing paradigm, the reader is referred to the lecture series in the proceedings of the International Test Conference, 2000 [ITC 001. However, very surprisingly, there is no publication which quantifies the effectiveness of BIST patterns in detecting *Sun Microsystems Menlo Park, California real defects. Almost all commercial tools and publications report the single stuck-at fault coverage to quantify the effectiveness of BIST patterns. In this paper, we address the problem of quantifying the defect coverage of pseudorandom BIST patterns.
Although the single stuck-at fault model is a widely used fault model, it is not accurate and does not model the behavior of production defects very well [McCluskey 001.
Test vectors generated using the stuck-at fault model do not necessarily guarantee the detection of all defective parts. In order to increase the defect coverage of the single stuckat fault test vectors, N-detect single stuck-at fault test vectors were considered. In an N-detect single stuck-at fault test set, each single stuck-at fault is detected by different test vectors at least N times, or the maximum number of times the fault can be detected. By detecting each stuck-at fault many times, the chance that other unmodeled defects (often called surrogates) are detected increases significantly [Pomeranz 981 . In the Murphy test chip experiment, all the 116 defective chips were detected by single stuck-at test sets that detect each single stuck-at fault 3 or more times; however, the best 100% stuck-at test set missed 2 defective chips [Ma 951 . It was later observed that all N-detect test sets have higher transition fault coverage than other 100% SSF test sets. In [Grimaila 991 , it was reported that the defect level was improved by 1288dpm, after applying N-detect single stuck-at test sets. Therefore, N-detect single stuck-at fault coverage is a better metric for defect coverage compared to the traditional stuck-at fault coverage.
In this paper, we examine the test quality of pseuclorandom test patterns using the N-detect coverage as the metric. We derive theoretically the bound on the expected length of pseudo-random test patterns for detecting a stuckat fault N times by different test vectors. We also present simulation results for several circuits. Our theoretical bounds and simulation results show that pseudo-random test patterns can achieve good N-detect coverage; however, under certain circumstances the test length can be very large. For detecting a single stuck-at fault N times, the expected test length may be between N and M o o times the expected test length for detecting the fault only once.
For example, to detect a fault 20 times, the test length ran increase to approximately 30 times the test length required to detect the fault only once.
As part of our analysis technique, we introduce the concept of effective detectability of a single stuck-at fault. This concept is extremely important while grading BIST patterns to quantify their defect coverage. This paper is organized as follows. Section 2 briefly describes why N-detect stuck-at test sets are effective for detecting real defects. In Sec. 3, we derive the theoretical bound on the expected length of pseudo-random test patterns required for detecting a given single stuck-at fault N different times. We also introduce the idea of effective detectability and discuss the implications of our results for BIST architectures and techniques in Sec. 3. Section 4 presents simulation results for several circuits. Section 5 concludes this paper. Suppose that a defect causes a resistive gate-to-source short (G-S short) in an NMOS transistor as shown in Fig.1 . For certain values of the resistance of the short, this G-S short can be detected only by the patterns (WXY) = (01 1) or (WXY) = (101). A 100% SSF test set will not detect the G-S short if it only consists of the pattern (WXY) = (001). However, in a test set in which each SSF must be detected at least three times, (also called a 3-detect single sruck-ar fault test set or SSF 3-detect test set), all three of these patterns must be included. Therefore, a SSF 3-detect test set is more likely to detect the G-S short of Fig. 1 compared to a 100% SSF test set, even though the G-S short can not accurately modeled as a SSF fault. 
Stuck-at Fault N-detect Tests
G-S short l W l X l Y l Z l z/1 I G-Sshort 1 0 1 0 I 1 I 0 I detect I escape 0 1 1 I 1 I 1 I detect I detect
Expected Length of N-detect Test Patterns for Pseudo-Random BIST
In this section, we derive upper and lower bounds on the expected test length of pseudo-random patterns for detecting a given single stuck-at fault N times by different test patterns. To model a realistic BIST environment, we assume that the number of vectors applied is much less than the number of all possible input combinations that can be obtained from the LFSR. This assumption is realistic and justified [McCluskey 881. As explained in [McCluskey 881, this assumption allows us to treat the BIST test patterns as random patterns although they are actually pseudo-random. In Appendix A, we derive the expected test length of pseudo random patterns for detecting a fault N times without these assumptions.
Suppose that we have an m-input combinational logic circuit C. Let M (= 2 ) be the total number of all input combinations that can be applied to this combinational logic circuit C. Consider a single stuck-at fault f in C. Let us suppose that the detectability of the fault f is k, i.e., k is the total number of input combinations that detect the fault f. We apply random patterns at the inputs of C and try to detectfby N different input combinations, where N 5 k.
The expected number of random patterns that we must apply to detect the fault f once is equal to -. This is because, the probability that the fault f will be detected by the first pattern is -; the probability that it will detected
by the second pattern is (1 --) -, and so on. Hence, the expected test length is
be a random variable which represents the test length to detect f using N different random patterns.
E[g(N)] is the expected value of g(N)
. We can write the following recurrence relation:
where p N = Intuitively, the above recurrence relation is true because, for detecting a fault using N different test patterns, we must first detect it with N-1 different test patterns; for the Nth detection, none of the previous N-1 test patterns can be used. The above recurrence relation can be solved as follows:
This is because, When the value of N is very close to the value of k, the following relation holds:
M k Note that, as discussed before, -is the expected test length for detecting the faultfonce.
The bounds derived in the previous paragraph can be explained intuitively. Since the patterns applied to the input of the circuit are random, some patterns may appear more than once before all possible patterns appear at the inputs. Suppose a fault is detected by a pattern which has appeared already; the number of times this fault is detected does not increase but stays the same. Hence, the expected test length for detecting a fault N times by di'fterenr test vectors is more than N times the test length for detecting the fault only once.
Let us look at an actual circuit to understand the significance of the above bounds.
Consider a combinational circuit with 14 inputs. Hence, M = 16,384. Let us suppose that there is a faultfin the circuit such thatf is detected by 96 input combinations. Therefore, k = 96. This is not an unrealistic example because, the ALU 181 circuit has 14 inputs and there is a fault which is detected by only 96 input combinations [McCluskey 881. The expected number of random patterns that we must apply to detect the faultfonce is equal to 171. Next, let us suppose that we want to detect this faultf 5 times (i. Next, consider the following scenario. Suppose that, out of the 14 inputs of the given combinational logic circuit, values on only 12 inputs structurally affect the excitation and propagation of faultf This means that, if we trace from the fault site o f f to the primary outputs in the given logic network and then trace back from the primary outputs to the primary inputs, then we find that the primary outputs that can be affected by fault f depend on only 12 primary inputs. This is shown in Fig. 2 . This means that, the values on the remaining 2 inputs do not affect the: detection of the fault f structurally. If we have a gate-to-.
source short created by a gate-oxide short or an open defect at the fault site off, we must be cautious and try to deteci: the fault f by applying different input combinations on the: 12 inputs that really structurally affect the excitation ancl propagation of faultf. Continuing from our earlier example, suppose that the detectability offis 96. In this scenario, we really have 24 (rather than 96) different choices of inpuf combinations to detect the fault f. Thus, the effecrive. detectability of the fault f for defects like gate-to-source shorts or opens at the fault site is 24.
Note that, defects like opens or gate-oxide shorts do not create extra dependence of any node in the circuit under test (CUT). Hence, the set of inputs of the CUT that affect the excitation and propagation of single stuck-at faults ancl contribute to the effective detectability of single-stuck-at faults for these detecting these kinds of defects can bt:
found easily by traversing the given network. In contrast, for defects like interconnect shorts (shorts among different nodes in the given logic network), the problem of calculating the effective detectability of single stuck-at faults is more tricky. This is because, defects like interconnect shorts can create extra dependence on input!; which cannot be predicted without the knowledge about the actual defect. This problem can be addressed by using inductive fault analysis tools to generate a list of bridging defects that are highly probable.
By now, it must be clear that the value of effective detectability of a single stuck-at fault f depends on the defect type under consideration. This is clearly a new approach and introduces a "defect" component in the fault grading of BIST patterns. Let I be the set of all binary combinations that can be applied to the inputs that structurally affect the excitation and propagation of fault f in a circuit C. The efective detectability, d, of the faultfis defined as the number of members of I that can detectf. If a combinational logic circuit has m inputs and the excitation and propagation of a fault f structurally depends on the values of only t inputs, then k = 2 d. In that case, our earlier equations are changed in the following way: The above recurrence relation can be solved as follows: have not yet studied which combination of BIST techniques is most effective and it is outside the scope of this paper. Although the results presented in this section are derived from theoretical calculations, we present simulation results in Sec. 4 to show that the above characteristics are true for real circuits in a pseudo-random BIST environment.
Our above analysis shows the relationship between the test lengths for single detection and N-detection of a single fault. However, in an actual BIST environment many single stuck-at faults have to be detected. Hence, the question is: what happens to the overall test length? Just like the case of single detection [McCluskey 881, for Ndetection, the overall test length is dominated by the expected test length for the hard-to-detect faults with very low values of detectability and effective detectability.
Simulation Results
In this section, we present simulation results for a couple CUTS on the Murphy chip and the ELF35 chip [Ma 95 ] [Li 991 . The Murphy chip is a 0.7um CMOS chip designed to evaluate the effectiveness of various test techniques. The ELF35 is a follow-up chip to the Murphy chip. It was manufactured in 0.35um technology and is currently being tested. Figure 3 shows the simulation flow. We first created the SSF dictionaries for all CUTs. Exhaustive patterns were applied to the CUTs to determine their detectability profiles. BIST test sets were generated by LFSRs with various initial seeds. Each BIST test set was then analyzed to count the number of times that each SSF was detected. 
Detectability Number of faults

Pseudo-Random to Binary Translator
This CUT translates a pseudo-random sequence into a binary sequence. There are 398190 SSFs in this CUT. There are many faults with low detectability. Table 2 shows the detectability profile for faults with detectability less than 10. Table 3 shows a set of pseudo-random test fault simulation results using different seeds for the LFSR. Each entry represents the average ratio of test length for Ndetection over I-detection over all faults of a given detectability value. The results in Table 3 show that the test length increases more rapidly than linear as N approaches k, the detectability fault. As discussed earlier, we expect the test length to increase at a rate greater than N as N approaches k. Table 3 validates the predicted bound.
For each entry, we show the bound calculated using our theoretical derivation in Sec. 3 within parenthesis. The upper bound was calculated using the actual expression for the expected test length for N-detection derived in Sec. 3. It can be seen that our theoretical predictions match the simulation results. 
Controller
This control circuit is part of a DMA controller. There are 7502 equivalent stuck-at faults in this CUT. The results in Table 4 also show that the test length increases linearly with the number of detections. This is because the number of detections, 1,3, and 5 , are all small comparing with the detectability, 12, 20, 50, and 100. For each entry, we show the bound calculated using our theoretical derivation in Sec. 3 within parenthesis. The results match our theoretical predictions.
Number of faults
Seed 1
Conclusions and Future Directions
In this paper, we analyzed the effectiveness of pseudo random testing for detecting real defects by using the concept of single stuck-at fault N-detect coverage as a metric to determine the quality of the pseudo-random test patterns. We introduced the idea of effective detectability and theoretically derived bounds on the minimum test length of random or pseudo-random patterns required to achieve a given N-detect coverage. For faults with high values of effective detectability, the expected test length for N-detection is around N times the expected test length for single detection. However, for faults with low effective detectability, the expected test length for N-detection can be MogN times the expected test length for detecting the fault only once; this increases the test length significantly. possible M = 2" patterns, i.e., the LFSR has been modified to include the all-0 pattern.
Theorem 1: The probability that a fault of detectability with k is detected exactly x times ( x s , xSk) in a pseudorandom test sequence of length L is
Proof: In pseudo random testing, each test vector is chosen with equal probability out of a "pool" that initially contains Therefore, their ratio is the probability that the fault is detected exactly x times in the pseudo-random test sequence.
Q.E.D.
Theorem 1 shows that the number of times that a fault is detected in a pseudo-random sequence is a hyper-geometric random variable. The next theorem derives the expression of the expected length of a pseudo-random sequence for detecting a fault N times.
Theorem 2: For a fault of detectability k (k>=N) the probability that a fault is not detected N times until the L-th vector in a pseudo-random sequence, i.e., the L-th pattern is one of the N detecting patterns, is Proof: The first term is directly adopted from Theorem 1 with x=N. It represents the probability that the fault is detected N times in a pseudo-random sequence of length L.
c;xc,M_-,k N
CL" L
X-
It is the sum of the probabilities of two excluding events:
1. the L-th pattern is a detecting pattern and 2. the L-rh pattern is not a detecting pattern. It is the probability of event 1 that we are trying to derive. The percentage of event 1 is the conditional probability that the L-th pattern is a detecting pattern when there are N detecting patterns in the L patterns. The conditional probability is NIL. Q.E.D. Combining Thm 1 and 2, the expected length E(L) of a pseudo-random sequence for detecting the fault N times is E ( L ) = C P r ( L e n g t h = L ) x L = L
