We present a new output encoding problem as follows:
. INTRODUCTION
Column compaction plays a major role in the synthesis of digital systems [ 11. In column compaction, the number of output functions for a given specification is reduced by merging outputs which are logically equivalent, or can be made equivalent through assignment of don't cares. Two examples of output column compaction are shown in Fig.  1 . Given a set of output columns, the problem of finding the smallest set that can be obtained by compacting the given set is related to the maximum clique partitioning problem, which is an NP-complete problem [2] . This smallest set of compacted outputs is called the minimum cardinality output column cover. Column compaction can greatly reduce circuit area; this is not only true for the PLA implementation of the circuit [ 3 ] but also for multi-level logic circuits. The different types of encoding problems, studied in the past, are the finite state machine (FSM) state encoding problem and the input and the output encoding problem. Techniques for FSM state encoding have been discussed in [4, 5, 6, 7, 8, 9, 101 . The input encoding problem has been reported in [ l l , 121 . One version of the output encoding problem, with a goal to reduce the number of product terms of a logic function, has been discussed in [13] . Another version of the output encoding problem is discussed in [14] . A heuristic solution to the output encoding problem which uses column compaction to reduce the number of outputs after the symbols are encoded is given in [15] . However, this algorithm does not consider any existing non-symbolic (binary) outputs when determining the encoding for the symbols of the symbolic output column.
In this paper, we present an output encoding problem whose objective is different from those mentioned above. The input to our problem is a specification table of a combinational (or sequential) circuit in terms of a truth table (or state table) such that some of the outputs are specified in terms of O's, 1's and don't cares while the other outputs are symbolically specified. The problem is to encode the symbols in the symbolic output column, using rlog2nl bits for n symbolic outputs, so that after encoding, the cardinality of the output column cover computed using column compaction is minimum. Section 2 explains the motivation behind studying this type of an output encoding problem. Section 3 presents our output encoding algorithm for the case in which the specification table does not contain any don't cares in its output part. In Sec. 4, we extend the algorithm of Sec. 3 to handle don't cares in the outputs of the specification. Experimental results are reported in Sec. 5 followed by conclusion in Sec. 6.
. MOTIVATION
Consider the specification shown in Table 1 . Output columns cl, c2, c3 and cq of Table 1 are specified in terms of l's, 0's and don't caresthey constitute the B-set, the bound set. The last output column of Table 1 is symbolic it is referred to as the S-column, the symbolic column.
In this paper, we will consider specification tables containing a single symbolic output column (S-column) for simplicity. We can handle cases with multiple symbolic output columns by choosing an appropriate ordering of the symbolic outputs and repeatedly applying the algorithm reported in this paper.
For Table 1 , since we have seven distinct symbols in the S-column, we need 3 bits to encode them. Output1
Cl r2 c3 .a -1 0 0 1 0 It is possible to reduce the number of output columns after column compaction if we encode the symbols of Table 1 This output encoding problem has many applications. Digital systems are often specified in terms of a combination of binary valued and symbolic signals. This problem is applicable to reduce the area of scan-based [ 161 designs containing one-out-of-n (one-hot) signals, as described in [17] . This algorithm can serve as a preprocessing step for FSM state encoding.
ENCODING ALGORITHM FOR FULLY SPECIFIED OUTPUTS
In this section, we present our output encoding algorithm with the assumption that the output columns belonging to the B-set are fully specified with 1's and 0's for simplicity. We consider the example of Table 4 for illustration. We describe the three basic steps of our algorithm as follows:
Step 1 : Find Inconsistent Columns: Find all pairs of rows, p and q, in Table 4 for which the S-column has the same value and Cj E El-set has different values. Then ci is said to be inconsistent. For this example, the B-set is {cl, c2, c3, c4, c5, cg}. Here, cl has a 1 in the second row and a 0 in the 4th row; but the S-column has X2 in both of these rows. Hence, cli is inconsistent. Column c3 is also inconsistent for symbolic output X3.
Step 2: Create reduced consistent output table:
Remove inconsistent columns from the specification table to obtain the consistent output table (COT) . Merge equal rows of COT to obtain reduced consistent output table (RCOT) . Table 5 is the RCOT for Table 4 . Since there are 6 symbolic outputs, we use m = 3 bits to encode them.
Step 3: i-column counting check: Any set of i columns of the RCOT (i I m) having any of the 2i possible binary values of length i appearing more than 2m-i times in the rows of the RCOT cannot reduce the size of the output column cover after encoding the symbols.
All sets of i columns that satisfy this i-column counting check form elements of column-count-set-i (CCS-i). For Table 5 , CCS-1 = {{c2}, {cg}, {cg}], CCS-2 = { {c2, c5}, {cg, cg}, {c2, qj}}, CCS-3 = NULL. {c4} is not included in CCS-1 because the number of 1's in the c4 column is 5 (greater than 4). We terminate this step when either a CCS set is NULL or CCS-m has been generated.
Table4 . Specification T k
For Table 4 . we can encode 1 r a b l e 5 . TheRCOT. 0 1 1 1 1 1 0 0 ; symbols using three bits in such a way that two of the three columns, representing the symbolic outputs, can be merged with the existing output columns by column compaction. We choose any member of CCS-2, say, {cz, c5}. The first two bits in the encoding of a particular :symbol will have the same pattern as the one present under the columns c2 and c5 in the row corresponding to that symbol in the RCOT. We determine the third bit such that the codes assigned to the symbols are distinct. Thus, referring to Table 4 , Xi can be encoded as 010, X2 as 000, X3 as 011, Q as 100, X5 as 110 and a as 101. It is straightforward to prove the optimality of our solution [18] . It should be noted that if we allowed 4 bits to encode the symbolic outputs of Table 4 , then all output columns corresponding to the four encoding bits could be merged with c:~, c4, c5 and cg. In that case, we can iterate step 3 incrementing the number of encoding bits until all the encoding bit outputs can be merged with the existing outputs or we reach a point where we have tried to encode n symbolic outputs using n bits.
. ENCODING ALGORITHM FOR INCOMPLETELY SPECIFIED OUTPUTS
The basic steps of the algorithm described in Sec. 3 can be extended to handle don't cares in the B-set. cl c2 c3 c4 c5 cg 0 -1 0 -0
Table7 .TheRCOT.
Consider the specification shown in Table 6 . The RCOT is shown in Table 7 . The CCS-1 set is { { q } , {cg}, {cg}}. For calculating CCS-2 and CCS-3 we must ensure that the count of the fully specified binary strings (binary strings containing no don't cares) only need to satisfy the bounds discussed in step 3 of the algorithm in Sec. 3. Thus, CCS-2 is { {c2, c5}, {c2, cg} and {cg, qj}}. CCS-3 is a null set and we choose { c2, c5 }, a member of CCS-2, for providing the first two bits in the encoding of the symbols using 3 bits. Next, we solve the problem of assigning 0's and 1's to the don'tcures in columns c2 and c5 so that the output columns representing the symbolic outputs can be merged with c2 and c5. We formulate the problem as a bipartite graph matching problem [2] . We form a weighted bipartite graph as follows: Set Vi contains vertices corresponding to each symbolic output with a label equal to the string formed by the entries in the row corresponding to that symbolic output under columns c2 and c5 of Table 7 . Set V2 contains 4 vertices each having a label that is a distinct binary pattern of length 2. For each U E Vi and v E V2 we have an edge (U, v) of weight 1 if and only if the label of v contains l(0) in all positions in which the label of U contains l(0). We add two more vertices to the graphthe source and the sink. There is an edge of weight 1 from the source vertex to each member of V I . There is an edge of weight 23-2 = 2 from each member of V2 to the sink.
The graph along with the source and the sink nodes and the edge weights is shown in Fig. 2 . We solve the maximum network flow problem on the graph using the Ford-Fulkerson method [19] . The solid edges show the final mapping. The first two bits in the encoding of X i , X2, X3, X q , X 5 and X g are 00, 01, 01, 10, 11 and 10, respectively. Hence, one encoding of XI, X 2 , X 3 , X 4 , X 5
andxg is 001, 010, 011, 100, 111 and 101, respectively. Note that there may be multiple solutions to this bipartite graph matching problem. Hence, our algorithm can be further refined by selecting a mapping of members of V1 to the members of V2 based on some heuristic following the observations developed in [ 131 and [ 141.
w w 
EXPERIMENTAL RESULTS
In this section, we present experimental results. We added a symbolic output column to the MCNC FSM benchmarks and varied the number of symbols in the symbolic output column; depending on the symbol count, the number of encoding bits required were either 3 or 4. Table 8 . Area results for our output encoding algorithm (best case)and the worst case which never merges any output column.
*: Theseare the cases where all the output columns could not be merged by our algorithm because the cardinality of the Bset is less than the number of bits needed to encode the symbols.
We entered symbolic values in the S-column to ensure that there exists an output encoding, for which all the columns generated due to the encoding of the symbols could be merged with the existing outputs (members of the B-set), assuming that the number of encoding bits did not exceed the number of existing non-symbolic (binary) output columns. We applied our output encoding algorithm to encode the symbolic output columns, then compacted the outputs using column compaction. We then optimized these specifications using sis [20] . For comparison purposes, we then encoded the symbolic outputs such that none of the outputs corresponding to the encoding bits could be merged by column compaction with the pre-existent outputs (members of the B-set). This is the worst-case encoding. The results are shown in Table 8 . Although our scheme works for both combinational and sequential logic specifications, we used FSM benchmarks because we are investigating FSM synthesis techniques for one-hot signals, as mentioned in Sec. 2 [17] . For both cases, we used NOVA [lo] to encode the FSM states and used the recommended rugged script to perform multi-level logic optimization. Finally, we used the LSI Logic glop library [21] for technology mapping. Table 8 shows the area values (in terms of LSI Logic glop cell units) obtained using our output encoding algorithm and the worst-case output encoding where none of the encoding bits could be merged with the already existent nonsymbolic output columns by compaction.
CONCLUSION
In this paper, we have presented a new output encoding algorithm whose objective is to encode the symbols in the symbolic output column of the specification table using rlogznl bits for n symbolic outputs in such a way that the number of output functions, after performing the encoding and subsequent output column compaction, is minimum. An application of this algorithm is to generate a low-area circuit that always maintains a one-hot encoding on certain signals, even during scan or BIST operations as discussed in [17] . This algorithm can also be used as a p r e processing step for FSM state encoding. Although the algorithm produces a minimum solution and the worst case running time is exponential in the number of bits used to encode the symbolic outputs, our algorithm achieves significant speedup through iterative refinement by removing from consideration inconsistent output columns or sets of output columns. As the experimental results show, the circuits generated using our output encoding algorithm have significantly less area (0.9 % to 49 % for 3 bits, 8 % to 61 % for 4 bits) than worst-case circuits generated without considering output column compaction during the encoding of the symbols. In this paper, we have considered specifications with a single symbolic output column. In case of multiple symbolic output columns, we can integrate our technique with the output encoding algorithm reported in [15] to achieve a minimum number of output columns in the final table. 
