Designing a testable circuit is often a two step process. First, the circuit is designed to conform to the functional specifications. Then, the testability aspects are added,. By taking the test strategy into account during the synthesis of the circuit, the overhead due to the test features can be reduced. We present a synthesis-for-scan procedure, called beneficial scan, that orders the scan chain(s) during logic synthesis to minimize the area and pelformance overhead due to the scan-path by sharing the functional and the test logic. The results show that circuits synthesized with beneficially-ordered scan chains consistently have smaller area and are easier to route than circuits with traditional MUXed flip-flop scan-paths.
Introduction
Scan chains are widely used to improve the testability of sequential designs. A scan chain provides direct access to the internal nodes, greatly improving the controllability and observability of the circuit. It is then no longer necessary to generate test patterns for a sequential circuit, a difficult. and time consuming task, since the scan chain effectively turns the sequential circuit into a combinational circuit during testing.
There are a variety of scan techniques in use, each with its positive and negative aspects [l] . Scan designs, in general, have some costs associated with them. These costs are:
1. Additional circuitry added to make bistables scannable increasing the circuit area. 2. Possible performance penalty due to the increased propagation delay in the scan chain bistables. 3. Additional pins necessary for the test signals. 4 . Additional interconnect that can add to area and reduce performance. 
5.

Scan element classifications
Since each flip-flop input equation is a function of flip flop outputs and primary inputs, each flip-flop in a synchronous circuit has some relationship with every other flip-flop in the circuit. This relationship may be trivial, as in the case when the input to a flip-flop is not a function of another flip-flop, or the relationship may be more complex.
The relationship could be as simple as whether or not the input function of one flip-flop includes the output of another flip-flop, or the relationship could be based on more complex characterisdcs, e.g., thle input function is positive (or negative) in a variable.
Some of these relationships allow some, or all, of the test logic to be shared with the functional logic. These are beneficial relationships. Other relationships do not allow the logic to be shared, and a MZJX must be inserted to make a flip-flop scannable.
These are non-beneficial relationships.
For the purposes of beneficial scan, the relationship between flip-flops is based on Shannon's expansion theorem. The Shannon expansion transforms a function, f(xl, xz, ..., xn), based on residues. Thexi-residue, also called the positive-phase cofactor off with respect to xi, is defined as The Shannon expansion is defined as
Based on the Shannon expansion, each flip-flop j may be classified with respect to every other flip-flop i. The classifications are described in Table 1 , and a more detailed example is given below. Some of these classifications are beneficial relationships and allow the test logic to be shared with the functional logic. These beneficial relationships are marked with an "B" in Table 1 . The other classifications are non-beneficial and no logic may be shared. While other relationships exist that may be used to classify the flip-flops, these are the ones used in this technique.
Notation: Dj is the input of flip-flop j . Qi is the output of flip-flop i. T is the scan test select signal-1 for scan mode, 0 for normal system operation. h is the function for normal system operation, i.e., the input equation before scan is inserted. s and a are arbitrary functions. The goal is to find a minimal weight path through the graph that starts at an input node and covers all the flip-flop nodes. For a circuit withp primary inputs and n flip-flops, there are pen! possible scan chain orderings since there are n! ways to order the flip-flops andp different inputs to use to scan in data. An exact solution is computationally expensive, so an heuristic is required.
Case 2 example
Certain characteristics of the relationship graph allow heuristics to perform very well. A greedy approach works well.
This basic greedy algorithm can be modified to make better choices at each s t e p s u c h as giving preference to flip-flops with only one beneficial relationship over flip-flops with multiple beneficial relationships in order to leave more options open for later, or making sure that a chosen flip-flop is not the only beneficial case for another flip-flop. These modifications, along with others, help guide the algorithm to a better result.
The algorithm tries to reduce the number of additional test pins needed by using one of the existing primary inputs as the scan-data-in pin. If there is a need to have an explicit scan-data-in pin, a new primary input may be added to the relationship graph. This new input will have a case 0 arc to every flip-flop node.
Multiple scan chains
The approach just described will give an ordering for a single scan chain. The possibility of finding orderings for multiple scan chains also exists. Multiple scan chain orderings take advantage of an additional characteristic of the relationship graphs:
Beneficial relationships with a primary input are more frequent than beneficial relationships with other flip-flops. Even if a flip-flop has no beneficial relationships with other flip-flops, it is likely that it will have a beneficial relationship with one or more inputs. By allowing multiple scan chains, a lower cost scan chain ordering can be obtained since more beneficial relationships with primary inputs will be included.
The algorithm is the same as that for single scan chains, with a slight modification to step 2:
1. Choose the input with the lowest weight edge to act as the scan data input 2. Add to an existing chain the flip-flop with the lowest weight edge remaining or 2. Add another scan chain if cost is less than adding flip-flop to existing chain(s)
Repeat step 2
As with the single scan chains, the algorithm tries to use existing primary inputs as the scan-data-in pins. Any additional logic sharing achieved by using multiple scan chains comes from this sharing of the inputs.
The algorithm only adds another scan chain if there is some additional logic sharing gain. A specific number of Table 2 . Benchmark characteristics I 11 Number I Number scan chains may be forced with some degradation in the results. The algorithm also tries to balance the chains without degrading the results. If there is more than one equally attractive choice during step 2, the flip-flop is chosen to balance the existing scan chains. Completely balanced chains may be forced, but the resulting order will not be optimal.
Implementation and results
Beneficial scan
This algorithm to order flip-flops for single, or multiple, scan chains has been implemented in SIS [ 191. Existing SIS facilities are used to synthesize and technology map the resulting circuit. The circuit is mapped to the Alliance library, and the Alliance CA.D system is used to do placement and routing and static timing analysis [20] . Table 3 shows the results of the scan chain ordering. Both single and multiple scan chains are shown unless the two results are the same, i.e., there is no benefit in having multiple scan chains for that particular circuit. For each benchmark circuit the number of scan chains is shown along with the breakdown of the flip-flop classifications used in the final scan chain. Table 4 shows the area results of the circuits after placement and routing. Data is shown for five different scan-path implementations: 1) the circuit synthesized and a single scan chain, not ordered based on beneficial relationships, implemented entirely with MUXed flip-flops then added, 2) the circuit with a single beneficially-ordered scan chain, 3) the circuit with multiple beneficially-ordered scan chains, 4) the circuit with a single beneficially-ordered scan chain including cost-free scan and 5) the circuit with multiple beneficially-ordered scan chains including cost-free scan. The last two variations including cost-free scan are discussed in section 4.2. The overhead is calculated as ) x 100 area of scan -area of nonscan area of nonscan %overhead = The beneficially-ordered scan chain circuits achieve a significant area reduction over the MUXed flip-flop scan chains. The average area overhead for beneficially-ordered scan chains, after placement and routing, is about 12%, compared to about 27% for more traditional scan chains. The area savings before routing (table 7 in the appendix) is not quite as great since much of the area savings result from the elimination of interconnect through the logic sharing and the reduction in the length of the scan signal interconnect. Traditional scan chains are often ordered based on preliminary layouts to reduce the length of the scan signal interconnect. Beneficially-ordered scan chains have short interconnect as a result of the ordering; i.e., flip-flops that should be placed near each other in layout to produce short functional interconnect also should be placed near each other to produce short scan interconnect. All of this results in lower interconnect overhead for the beneficially-ordered scan chains. Four of the routed circuits, two single scan chain circuit and two multiple scan chain circuits, actually end up being smaller than the non-scanned circuits. This result is a phenomenon of the place and route tool. The non-scanned circuit could achieve the same (or better) area as the beneficially-ordered circuit by using the same placement and routing layout, but with the same level of effort the place and route tool gives better results for the beneficiallyordered circuit than for the non-scanned circuit. The beneficially-ordered circuit is easier to place and route.
The data is not shown, but the average worst-path delay overhead for the beneficially-ordered scan chains is 16%, and for the MUXed flip-flop scan chains the average delay overhead is 19%. Beneficial scan can reduce the area and delay overhead due to scan.
Beneficial scan with cost-free scan
Cost-free scan I l l ] is a recent technique that allows the reuse of combinational logic for scan by exploiting the controllability of primary inputs. The circuit is analyzed and a primary input vector, called the enabling vector, is selected to enable the maximum number of free-scan flipflops. A free-scan flip-flop does not require any additional logic to implement the scan operation. Cost-free scan is very similar to beneficial scan, but it uses different relationships in the analysis of the circuit.
Free-scan flip-flops have been added to our beneficial scan technique by adding a new class of beneficial relationship to the analysis. The beneficial ordering is now able to use free-scan flip-flops in addition to the cases presented in section 2. Tables Multiple scan chains use additional primary inputs as test inputs to reduce the overhead, but in order to use free-scan flip-flops, certain primary inputs belonging to the enabling vector may not be used as scan inputs. This restriction reduces the number of beneficial cases available for multiple scan chains. In general, the addition of free-scan to beneficial scan can give better results than either technique by itself.
Using free-scan flip-flops has the added disadvantage of reducing the routing benefit observed with beneficial scan. Beneficially-ordered scan chains are easier to route, as discussed in section 4.1, but the addition of the free-scan flip-flops can reduce the layout tools effectiveness, resulting in slightly larger areas for some circuits.
Comparison of beneficial scan with other techniques is complicated by the fact that the most of the savings from beneficially-ordering scan chains comes after the circuit has been placed and routed. Most other techniques do not include this data. Comparisons based on the literal counts are possible, but they can be misleading. Table 6 in the appendix shows the literal counts for the benchmark circuits. The literal counts for the beneficially-ordered scan chain circuits do tend to be smaller than for the MUXed flip-flop scan chain circuits, but the literal count can increase for with beneficial scan since Shannon's expansion theorem is used to factor the logic equations. This expansion does not always reduce the literal count, but it enables the logic to be shared for the beneficial relationships, and can result in smaller circuits overall once the circuits are placed and routed.
Conclusions
This paper has presented a synthesis-for-scan technique, called beneficial scan, that orders the scan chain so as to maximize the sharing of the functional and the test logic and thereby minimize the area and delay overhead due to the scan chain. The results show that circuits synthesized with beneficially-ordered scan chains consistently have smaller area, smaller worst-path delay, and are easier to route than circuits with traditional MUXecE flip-flop scan-paths. The average area overhead for beneficially-ordered scan chains, after placement and routing, is about 12%, compared to about 27% for more traditional scan chains. This area reduction results from two factors, the sharing of the functional and test logic and the reduction in the interconnect.
The class of beneficial relationships can be expanded to included free-scan flip-flops allowing more opportunities to share the functional and the test logic. Other relationships may also be found that can increase the resulting number of beneficial relationships, further reducing the test overhead. 
