Abstract We present a very effective on-line interconnect built-in-self-test (BIST) method I-BIST for FPGAs that uses a combination of the following novel techniques: a track-adjacent and a switch-adjacent (also called "mirror adjacent") pairwise net comparison mechanism that achieves high detectability, a carefully designed set of only five net-configurations that cover all types and locations of wire-segment and switch faults, a 2-phase global-detailed testing approach, and a divide-and-conquer technique used in detailed testing to quickly narrow down the set of potential suspect interconnects that are then detail-diagnosed. These techniques result in I-BIST having provable detectability in the presence of an unbounded number of multiple faults, very high diagnosability of 99-100% even for high fault densities of up to 10% that are expected in emerging nano-scale technologies, and much lower test times or fault latencies than the previous best interconnect BIST techniques. In particular, for application to on-line testing, our method re-quires £ ¥ ¤ roving-tester (ROTE) configurations to test an entire ¤ § ¦ ¤ FPGA, while the previous best online interconnect BIST technique re-quires ¤ © configurations. Thus, I-BIST is an order of magnitude more time-as well as power-efficient, and will scale well with rapidly increas-ing FPGA device sizes that are expected in emerging technologies.
Introduction
An FPGA consists of an array of programmable logic blocks (PLBs) interconnected by a programmable routing network, consisting of wire-segments and switches, which we refer to as interconnects. Current as well as future double and single-digit nanometer FPGAs are susceptible to permanent or hard faults, both during fabrication and field operation. In order to isolate these faults using the re-programmability feature available in many FPGAs, each fault needs to be detected as well as diagnosed, i.e., located. Built-in self-test (BIST) approach to test FPGAs, wherein the test circuit is placed within the FPGA, has gained popularity due to its advantages of at-speed and low-cost testing. There are two types of BIST techniques, offline and online. In offline testing, the FPGA is tested while no application circuit is executing on it, while in online testing, the FPGA is tested with the application circuit mapped to and executing on it. In applications such as life critical medical appliances, space hardwares and remote installations, where the circuit mapped on the FPGA needs to run continuously with minimal or no interruption, online testing is a must [11] . Figure 1 (a, b) shows how online testing is performed by testing only a small part of the FPGA at a time using a ROving TEster (ROTE). Each ROTE comprises of multiple built-in selftesters (BISTers) (Fig. 1[c] ), each formed of a test pattern generator (TPG), which applies test vectors to two identically configured cells under test (CUTs) in the case of PLB testing, and two sets of wires-under-test (WUTs) in the case of interconnect testing. The outputs of the CUTs/WUTs are then compared by
This work was supported in part by NSF grant CCR-0204097.
an output response analyzer (ORA), that checks for any discrepency in the two outputs, indicating a presence of fault(s). 
columns) is under test using a ROTE, while the remaining part of the FPGA performs the circuit function. Once testing of this part is completed, the ROTE is reconfigured to test another part of the FPGA, as shown in (b). (c) A Built-in self-tester (BISTer) tile.
Since interconnects occupy approximately 90% of the FPGA area, the chances of faults occuring in interconnects are much greater than that for PLBs. However, although there are many BIST techniques for PLB testing in FPGAs ( [4, 6, 12, 13, 17] are some recent ones), there are very few BIST techniques for FPGA interconnects [1, 2, 8, 9, 14, 15] . Table 1 enumerates the salient features of these interconnect BIST techniques 1 . As implied by the table, none of the existing interconnect BIST techniques possess provable detectabilty in the presence of multiple faults, which can be quite frequent in emerging FPGA technologies. Thus none of these techniques are effective for the test and diagnosis of interconnect faults in nano-scale FPGAs.
In this paper we present 
Interconnect-BIST (I-BIST) Technique
Before we explain our I-BIST technique, we discuss which interconnects will be tested under any ROTE area in online testing.
ROTE configuration
FPGAs have wire-segments of lengths one (single), two (double) and j (long), spanning both horizontally as well as vertically across the FPGA. A ROTE normally occupies n or o columns/rows in the FPGA. Hence to test both horizontal and vertical long lines we require a Horizontal-ROTE (H-ROTE) as well as a Vertical-ROTE (V-ROTE) [3] . Figure 3 (a) shows the interconnects (wire-segments and switches) that will be tested under a particular V-ROTE position. As shown in Fig. 3(a) , each V-ROTE area includes three channels (a single vertical or horizontal line in Fig. 3(a) represents an entire channel 2 ). All vertical wires of these three channels plus the single and double length horizontal wires under the ROTE area will be tested for faults. All the switches in the middle channel (channel n ) will also be tested. For channels , and, the horizontal long wires are tested in H-ROTE. We refer to the set of interconnects tested under a particular V-ROTE (H-ROTE) position as the V-set (H-set). Also, we refer to the set of v 'th wiring tracks in each of the three vertical channels under the ROTE area along with their associated horizontal wire-segments and switches, as track-set v . We next discuss the general idea of the I-BIST technique. I-BIST is discussed in this paper for single-length track segments but is easily extended to multiple-length track segments.
Main Concepts of I-BIST
The main objectives of an effective BIST technique are: 1. To maximize diagnosability, even in presence of multiple faults. 2. To reduce test time (i.e., fault latency), and especially the dominant component of test time, the configuration time, i.e., the time to set LUT or switch configuration bits. Hence the main objective is to minimize the number of configurations required for testing interconnects. The secondary objective is to reduce the number of test vectors, used in each configuration. The I-BIST technique achieves these objectives using a twophase test-and-diagnosis scheme, wherein, in the first phase, 2 A channel is a set of tracks, each formed of multiple wire-segments and switches, spanning either horizontally or vertically across the entire FPGA. global testing, the possible fault locations are isolated to a small set of interconnects, termed the suspect set, using very few configurations. In the second phase, detailed testing, the actual faulty interconnects are identified from among the suspect set. We next discuss the basic concepts for testing interconnect faults that we use in both the testing phases. In our approach, nets formed of multiple interconnected wire-segments and switches belonging to the same track-set, are configured ( Fig. 4[a] ). For the global testing phase the nets are configured to span the entire length of the FPGA in the V-ROTE area (entire width of the FPGA in the H-ROTE area), while in detailed testing, which is composed of a series of more specific targeting of smaller interconnect sets, the nets span smaller sub-lengths of the FPGA. Two adjacent nets act as a WUT pair and are tested by passing n -bit test vectors from the TPG. Their outputs are compared using an ORA configured as a n -bit XOR. We now discuss how each type of interconnect fault, present in a WUT pair, are detected by our approach. Note that complementary-bit test vectors
applied to a fault-free WUT pair will result in the ORA output of , while for identical-bit test vectors
, the ORA output will be . 
(or °
) are required to identify all possible stuck-at fault patterns in 
Figure 4: Basic mechanism of the I-BIST technique. (a) and (b) The two stages of testing,TPG-top and TPG-bottom, required for detecting bridge, stuck-at and stuck-open faults (combinedly referred to as non-SSC faults). (c) Mirror net pair testing required for switch-stuck-closed (SSC) fault. (d) Spanning switches between nets

Stuck-open fault:
A bridge or stuck-at fault in a WUT pair will result in an erroneous value on the entire net(s) in the WUT pair, irrespective of the location of the fault. However, a stuckopen fault in either a wire-segment or a switch will only be detected if that wire-segment or switch is present on the part of a net between the TPG and the corresponding ORA, because unlike stuck-at or bridge faults, a stuck-open fault will not result in an erroneous value on the entire net but will only isolate the portion of the net that lies between the point of its occurrence and the sink of the net. E.g., in Fig. 4(a) , a stuck-at-S fault on wire-segment T ³ will result in the entire net . Stuck-open fault(s) present on either or both the nets in the WUT pair will result in an incorrect ORA output for at least one of the two test vectors & and in at least one of the two stages, and hence will be detected. We will refer to the two stages of testing as TPG-top and TPG-bottom.
Switch stuck-closed (SSC) fault: Unlike the above mentioned faults, an SSC fault cannot be detected by including the corresponding switch on a net, as the switches included on any net are deliberately kept closed. The only fault a stuck-closed switch can produce is that it will bridge the two wire-segments that it spans (i.e., can connect). Thus, forming nets within a channel, as shown in Figs. 4(a, b) , will not aid in detecting SSC faults, since no two nets in the same channel are spanned by a switchbox swich. In order to detect SSC faults we introduce the concept of mirror nets. Mirror nets are mirror-image nets in different channels that share at least one common switch box and are spanned by an open switch in each common switchbox. In other words, these nets are formed in such a way that an SSC fault in the switch(es) under consideration will result in bridging of these mirror nets. E.g., as shown in Fig. 4(c and . It follows from the construction of our testing technique for the WUT pair that if there are faults in either net in it and/or a stuck-closed fault in any spanning switch between the two nets in the WUT pair, the ORA output will be different than (
). Note that due to the application of identical-bit as well complementary-bit test vectors, there are no masking (aliasing) faults between the two nets in the WUT pair across all test vectors.
X
Thus, the basic mechanism of the I-BIST technique, discussed above, is able to detect all types of faults in a pair of nets and requires only three test vectors
. We next discuss how this mechanism is used in the global and detailed testing phases of the I-BIST technique.
Global Testing
The aim of the global testing phase is to first detect faults, and then isolate the possible fault locations to a small number of suspect interconnects, thus making the task of the detailed testing phase simpler. Also this aim needs to be achieved in a minimum number of configurations as mentioned in objective Figure 5 shows one of the five configurations of the global testing phase, that we will use to explain our technique. As shown in Fig. 5 , the basic mechanism discussed in Sec. 2.2 for a pair of adjacent and mirror nets, is simultaneously applied to all adjacent and mirror net pairs in the ROTE area using the same TPG to generate the replicated test vectors that are applied to all these compared net pairs but using distinct ORAs to evaluate their outputs. We call a set of similarly configured nets in each channel a net-set. Hence in Fig. 5 , the nets 
Simultaneous adjacent-&-mirror pair comparison technique
b a d c d e f e d g d h 8 ì p a d c d e d e d g d h 8 q b a d c d e d e d g d h C r s g d t v u d w E g d x $ u $ y d q d s g d t v u d w E g d x $ u $ y d r d i i i i i i s g d t v u d w 1 g d x v u $ y d E i d d e f g i h j ¥ k k l m n o b p q r t s s b u v w y x E z d { E { E |{ E } w d | d w d E d d d E d d )
U v
shown by continuous lines form another net-set. In order to detect interconnect faults, two types of net comparisons are performed:
1. Intra-net-set comparison: Each pair of adjacent nets in a net-set is compared by an ORA configured as a n -bit XOR. E.g., in ). We will refer to the ORAs used in intra-net-set comparison as intra-net-set ORAs. ). We will refer to the ORAs used in inter-net-set comparison as inter-net-set ORAs. With regards to the configuration details discussed above, we next describe how the design of the five configurations is derived on the basis of the possible interconnect faults. Note that, as discussed earlier, to detect non-SSC faults on a particular interconnect, the necessary and sufficient condition is to include this interconnect on a net in at least one configuration, while to detect SSC faults in a spanning switch, the necessary and sufficient condition is that each of the two compared nets from each of the two net-sets should contain exactly one of the two wiresegments on either side of the spanning switch. ; similarly for detecting stuck-closed faults in switches p ³ q u r of channel n , the wire-segments on the east and the west side of the switch should be contained in different nets. The design in which both these requirements can be met is the one corresponding to configuration and is indicated by the dense lines in Fig. 6(a) . As switches , hence stuck-closed faults in these switches will also be detected.
In order to detect stuck-closed faults in the remaining switches of channel n , i.e., switches Fig. 6(b) . Thus, configurations and n together can detect all switch faults of channel n .
In order to detect stuck-closed faults in switches , configuration 3 (Fig. 6(c) ) is required.
Stuck-closed faults in switches Once the nets required to detect switch stuck-closed faults are designed (shown by dense lines in Fig. 6(a) -(e)), the remaining task is to extend the nets in each of the configurations in such a way that it passes through each interconnect in at least one configuration, as per the necessary and sufficient condition of detecting non-SSC faults. This is shown by the lighter (arrowed) lines in Fig. 6(a) -(e), and fully shown for configuration in Fig. 5 .
THEOREM 2 Any number of fault(s) in the V-set (i.e., set of interconnects under a V-ROTE area) will be detected by the SAMP technique across the five configurations of the global testing phase.
Proof: From the above discussion, the five configurations of the global testing phase include all the wire-segments as well as all switch-box switches in both connected and spanning modes in the ROTE area. Thus all wire segments and swtitches are tested for all possible faults, and from Theorem 1 they will be detected irrespective of their numbers.
X 4 Detailed Testing
Detailed testing is the second phase of interconnect testing (I-BIST) wherein we diagnose faults in the nets detected as faulty in either of the five configurations of the global testing phase.
For further reference, with respect to a particular track-set, we will refer to a configuration of the global testing phase as a failed configuration if at least one of the intra-net or inter-net ORAs in that configuration has an output of fail. The global testing phase with one or more failed configurations for a particular track-set will be referred to as failed global testing phase, else it will be refer to as passed global testing phase for that track-set.
One simple approach to performing detailed testing is to form a suspect set that consists of all the interconnects in the ROTE area minus the interconnects of the nets corresponding to the passed global configurations. Each element of this suspect set is then diagnosed by comparing it with a fault-free element. This (for north-south) indicates the wire-segments' locations (at the north and south ends of the switch) that the switch can connect.
High-level view of the divide-and-conquer approach of the approach, however, is inefficient in terms of the fault latency, as the size of the suspect set may be large. E.g., a single stuck-at fault on a horizontal wire-segment will lead to a failure in all the five configurations and all the interconnect components of the corresponding track-sets will be the elements of the suspect set. We will refer to this approach of diagnosing the elements of the suspect set as the flat intersection approach. In order to overcome this problem of high detailed test time (or fault latency), we propose another approach based on the divide-and-conquer (D&C) paradigm. In this approach, in case of a failed global testing phase corresponding to a particular track-set, we iteratively divide the ROTE area into two halves and apply the failed global test configuration(s) on the two subsets of the failed track-set located in these two ROTE sub-areas. E.g., as shown in Fig. 7(a) , track-set or the spanning switches between them), is divided into two sub-track-sets each belonging to one of the two ROTE sub-areas Î and Ï , and then configuration 1 along with the SAMP technique are applied simultaneously to the two sub-track-sets (Fig. 7(b) ). The sub-track-set(s) that fail are further divided into two halves and the above procedure is repeated (Fig. 7(c) ) until the size of the ROTE sub-area spans two rows, the minimum area required to apply the global test configurations. Also, as was implicit in the above example, a sub-track-set in a particular sub-area needs to be tested for only those configurations that failed in the global testing of its root track-set. E.g., in Fig. 7(a) , if track-set n failed for configurations and n only, then the two sub-track-sets need to be tested for configurations and n only. This is because the components of track-set n that correspond to passed configurations (o -Y ) of the global testing phase are guaranteed to be fault-free.
The nets in each sub-track-set in the D&C approach are also tested in the same way, using intra-net-set and inter-netset ORAs, as in the global testing phase. However, if in a failed ) of these track-sets results in a pass output, then it implies that there is no wiresegment bridge fault between these two nets or any other interconnect fault in any of these two nets. Hence in detailed testing (i.e., in the D&C approach) the nets formed in each ROTE subarea are tested only with inter-net-set ORAs for testing inter-net pairs (j After the fault(s) are confined to the minimum possible ROTE sub-areas (i.e., n -row-length sub-areas), the interconnect components in each failed configurations of these sub-areas constitute the final suspect sets. Note that the switches that lie between any two sub-areas do not get diagnosed while diagnosing the interconnects of the two sub-areas, and hence these switches are also included in the final suspect sets. The elements of the suspect sets of each of the minimal sub-areas are then tested by comparison with a fault-free element. However, since the number of components in the minimal sub-area is far less compared to that of the entire ROTE area, the size of the suspect set is small. Thus the fault latency of D&C approach is much smaller compared to that of the flat intersection approach. 
Simulation and Analytical Results
×
tracks/channel, each track formed of single-length wire-segments, was functionally simulated in C++. Different types of interconnect faults were inserted randomly in the wire-segments and switches. Table 3 gives the fault coverage (% of faults that are correctly diagnosed) results for different fault densities (% of interconnect blocks that are faulty. The internal fault density of each interconnect block was randomly selected from Table 3 affirms the high-diagnosability of our I-BIST technique. fault latencies of the two approaches to detailed testing, divide and conquer (D&C), and the flat intersection approach. The results clearly indicate that D&C is a better approach in terms of fault latency; fault coverages are the same for both techniques. configurations-all these add up to much lower test times as well as power consumption for I-BIST compared to the technique in [1] .
Conclusions
We presented a new BIST technique, I-BIST, for testing interconnects of an FPGA, which was theoretically proven to to be able to detect any number of mutiple faults, and empirically shown to have a very high-diagnosability even at high fault densities of up to 10%. Our BIST technique is applicable to both offline as well as online testing (testing concurrently with the circuit operation). To the best of our knowledge, this is the first BIST technique that alleviates the limited diagnosability of only one fault per WUT (wires-under-test) of previous interconnect BIST techniques. I-BIST is thus a very effective interconnect testing technique for current as well as emerging nano-scale FPGAs that are expected to have high fault densities. The fault latency or test time of our technique is also much lower than those of previous interconnect BIST techniques as demonstrated in Table 2 , and should be scalable with the large FPGA device sizes expected in emerging technologies.
