Abstract-An array cell (AC) architecture for the layout design is described, which is dedicated to lowpower design by means of the NMOS 4-phase dynamic logic. An AC is constructed of ( M x N ) + 2 transistors so as to constitute each type of NMOS 4-phase logic gates. A graph theoretic approach is exploited in the layout design to reduce the layout area. A number of experimental results demonstrate the practicability of the proposed approach.
I. INTRODUCTION
With the recent spread of portable, wireless, and handheld computing systems, there arises more and more demands of low-power VLSI innovations [ l] .
The CMOS logic scheme is widely used today, in which a CMOS gate has such a structure that a pair of nMOS and PMOS logic blocks are connected serially between Vdd and Vss. Hence, while input signals are transitive between the "0" and "1" logic levels, the short-circuit current flows from Vdd to Vss, which makes the power dissipation increase suddenly according as the clock frequency grows [2] . In addition, the pass-transistor logic, which may be regarded as another promising solution [3] , tends t o lack the robustness against downsizing and voltage scaling [4] .
The NMOS 4-phase dynamic logic [5] [6] is evaluated t o be still a practical low-power scheme under the current technology [7] , in which four types of logic gates are used as illustrated in Fig. l(a) , together with four different clock signals as depicted in Fig. l(b) . Each type of 4-phase logic gates is composed of an nMOS logic block to realize a logic function and two nMOSFETs for precharge and gate control. Every 4-phase logic gate works in two phases, i.e. precharge and evaluation phases, and hence there exist priority relations among inputs/outputs of gates of four types as specified in Fig. l(c) . The NMOS 4-phase dynamic logic scheme has distinctive features in comparison with the CMOS logic scheme, such as elimination of short-circuit current, small capacitive load, low signal swing range, small number of transistors, and ratio-less transistors for layout. It is reported that not only the low-power consumption but also the short delay of a gate can be attained by this logic scheme more than that by static CMOS, domino CMOS, and pass-transistor logic schemes. Due t o this short delay performance, a logic block of this dynamic logic can contain a fairly large number of transistors, for example, 70 nMOSFETs for 100MHz, 144 nMOSFETs for 50MHz, etc [7] . Motivated by this tendency, in this paper an Array Cell (AC) architecture is devised, which can contain (MxN)+2 nMOSFETs t o constitute an NMOS 4-phase logic gate.
ARRAY CELL ARCHITECTURE

A . Layout Model
For ease of layout synthesis, let a logic block be constructed in an M x N nMOSFET array as shown in Fig. 2(a) , where it should be noticed that nMOSFETs are arrayed without interconnection. Now, consider a trivial logic function which is realized by a regular interconnection as shown in Fig. 2(b) , and implement this function by using 0.35pm technology at 3.3V and 1.8V supply voltages, with the nMOS threshold voltages of 0.55V and 0.4V, respectively. We have attempted SPICE simulations for this trivial function on different values of M and N. Fig. 2(c) shows a part of simulation results [7] . Considering that a real logic function can be constructed in this array usually with fewer nMOSFETs than this trivial function, we can see that given an operation frequency and a size of M and N, a fairly large number of logic functions can be realized in this nMOSFET array. Now, let us define an Array Cell (AC) be a layout cell which contains M x N nMOSFETs of a logic block and two nMOSFETs for precharge and gate control, as illustrated in Fig. 3 , where the number M of columns denotes the width of an AC and the number N+1 of rows the height of an AC. In terms of the layout standardization of a layout macro for the 4-phase dynamic logic, henceforth let us fix the width M of each AC, as depicted in Fig. 3 , just in the same way as the standard-cell approach.
B. Layout Generation
The layout design of an AC is executed in three steps: (i)construction of an n-side graph [8] with the minimum number of odd degree vertices, (ii)decomposition of an n-side graph, and (iii)placement of transistors. In what follows, the layout process is exemplified through a carry generator of 4-bit adder as illustrated in Fig. 4(a) .
In realizing this logic function by means of the nMOS 4-phase dynamic logic, first an n-side graph G is constructed, which represents an interconnection topology of nMOSFETs of a logic block. Specifically, an n-side graph is constructed in such a way that (i) each vertex U corresponds t o a drain/source u of an nMOSFET, and
(ii) each edge (u,v) connecting vertices U and v corresponds to an nMOSFET whose drain/source is u and V.
Usually the interconnection topology of nMOSFETs in a logic block is of the so-called series-parallel structure, and hence, in graph theoretical terms, an n-side graph is planar [8] . Moreover, the following property of this graph is of great use for layout synthesis; if two edges i and j are adjacent in an n-side graph, then corresponding nMOS-FET i and j can share a drain/source. This series-parallel structure of G can be represented by a composition tree T[9], where T is an ordered rooted tree in which each leaf represents an edge of G, and each interior node represents the application of a number of successive compositions of the same series or parallel operation type. Depending on the type of composition, an interior node is a series node (s-node) or a parallel node (p-node). If the ordering of the sons of a p-node is changed in T, then the graph is not changed. On the other hand, if the ordering of the sons of an s-node is changed in T, the number of odd degree vertices is reduced in the graph.
Following the rule in [9], a reordered composition tree T' is constructed to achieve an n-side graph with the minimum number of odd degree vertices.
For example, an n-side graph G of the carry generator of 4-bit adder of Fig. 4(a) can be drawn as shown in Fig. 4(b) . The composition tree T of Fig. 4(b) and reordered composition tree T' are drawn in Fig. 4(c) and Fig. 4(d) , respectively. Fig. 4 (e) shows a reordered n-side graph with the minimum number of odd degree vertices.
Given an n-side graph with the minimum number of odd degree vertices, the 2-D array layout of an AC is built.
An Integer Linear Programming(1LP) method is used in the automatic layout generation of CMOS cells[lO]
, where the 2-D array height minimization problem for CMOS cells is defined as a covering problem -a subset of chains that covers each transistor and has the minimum cell height, needs t o be determined. However, the set of all chains which are generated from an n-side graph is too large t o be solved by ILP. Considering the tradeoff between the minimum size of cell height and the computational complexity, we decompose the n-side graph into Euler paths in the following heuristic way in order to reduce the computational complexity: First, the minimum number of Euler paths are determined by tracing the reordered composition tree in the bottom up manner. Second, each of Euler paths is decomposed into the minimum number of subpaths whose maximum length is [t/M1, where t denotes the number of transistors.
Given a set of chains, the 2-D array layout is applied for the purpose of determining the placement of transistor chains. In the same manner as stated in [lo] , we summarize in Table I the assumptions underlying the 2-D array layout. Fig. 5 illustrates an example of the 2-D array layout for an AC. The width of AC is assumed t o be fixed, TABLE I ASSUMPTIONS FOR LAYOU'r OE' AC.
1.
2.
3.
4.
5.
The logic block of a gate is composed only of seriesparallel connection. AC consists of M diffusion columns. Metal-1 routing is taken as far as possible. Diffusion gaps do not allow a Metal-1 wire to be routed between them. Terminals that are on transistors placed in adjacent diffusion columns are connected using routing channel between the columns, otherwise, using routing channel on the top or bottom of cell. Fixed size of transistors are employed. maz{H, : for each column j = 0 , . . . , c } (1)
where c is the number of columns, and H,, t,, c,, and w, denote the height, the number of transistors, the number of chains, and the number of horizontal wires of the jth column, respectively. Table I1 Chain inclusion: A chain is included exactly once.
J E l , . . . , C Column occupancy: A column must contain a t least one chain. 
: columny > columnj} (8)
CMOS ratio
The above constraints are non-linear, since they involve logic operations and, or, and not, and hence they have to be linearized, in the same manner as stated in [lo] , to be included in this ILP model.
EXPERIMENTAL RESULTS
The ILP model has been solved by using GAMS/OSL for a number of circuit examples. Table I11 shows experimental results for a carry generator of 4-bit adder with variable width. The layout patterns of the carry generator with 6 columns have been implemented with the area of 48.05x9.55 pm2 as depicted in Fig. 6 .
For another example of a complicated logic function realized by a number of ACs, we have implemented an 8-bit round shift circuit by means of the 4-phase dynamic logic and the CMOS logic, and experimental results are summarized in Table IV . It can be readily verified that the use of our AC architecture for the 4-phase logic can reduce effectively not only the number of transistors but also the power dissipation, as compared with that of the CMOS logic.
IV. CONCLUSION
This paper has described an Array Cell (AC) approach and its application t o the layout design. This AC architecture can integrate a given logic function with fewer transistors than the CMOS logic, resulting in low-power consumption. The layout generation of an AC is based on the n-side graph and ILP.
Development is continuing further on CAD algorithms for logic synthesis dedicated t o this NMOS 4-phase dynamic logic.
