Adiabatic circuits offer a promising alternative to conventional circuitry for low energy design. Their operation is nevertheless subject to fundamental energy-speed trade-offs, just like any other physical realization of boolean logic. Thus, adiabatic circuits with very low energy consumption at low frequencies fail to function at high operating frequencies. Conversely, high-speed adiabatic circuits tend to be dissipative at low clock rates.
Introduction
Adiabatic circuit architectures reduce energy dissipation by steering currents across devices with low voltage differences and by recycling the energy stored in their capacitors [2, 31 . Broadly speaking, the efficient operation of these designs is yet another manifestation of energy-speed trade offs. Consequently, adiabatic circuits that operate very efficiently at low operating frequencies stop functioning at high data rates. On the other hand, adiabatic circuits with broad operating ranges tend to be dissipative at low frequencies.
This paper presents a source-coupled adiabatic logic family, called SCAL, that achieves very low dissipation across a wide range of operating frequencies. High speed is ensured by activating a sense-amplifier structure in a nonadiabatic manner. High energy efficiency across a broad frequency range is achieved by providing each gate with a current source that can be tuned by transistor sizing to achieve optimal charging rates for its operating conditions. In addition to low energy consumption and broad operating spectrum, SCAL features include true single-phase operation, balanced clock loading, functional completeness, and straightforward cascading.
A plethora of adiabatic logic families has been proposed in recent years. Several of these families, including ZN-ZP, ZN-ZNZP, PAL, CAL, and TSEL, use a sense-amplifier structure to drive their outputs adiabatically [4, 6, 7, 8, 91. The logic family we present in this paper is an enhancement of TSEL, a true single-phase adiabatic logic. SCAL achieves increased energy efficiency across a Permisrim to make digital or hard copier of all or part of this work for peisonal or cla~sioom use is granted without fee provided chat copies are not made or distrib uted for profit or mmmexial advantage and that copies bear this notice and the full citation on the first page. To copy othewis, to republish, to post on servers or to redistribute to lists, q u i r e s prior specific permission and/or a fee. ISLPED99, San Diego, CA, USA 01999 ACM 1-58113~133~X/W/wO8..$5.W broad frequency range while at the same time maintaining all the positive aspects of TSEL.
The remainder of this paper has four sections. The structure and operation of our logic is described in Section 2. Section 3 describes the operation of cascaded SCAL gates. Section 4 presents simulation results from layouts of pipelined 4-bit CLAs that were designed using a 0 S p n standard CMOS technology. Our contributions and ongoing research are summarized in Section 5. SCAL is a partially-adiabatic logic family that is clocked by a single-phase power-clock. Low energy operation is achieved across a broad operating range by tuning a current source attached to each gate. The basic structure of a SCAL PMOS gate is shown in the PMOS inverter of Figure I (a). This inverter comprises a pair of cross-coupled latches (MP1 and MP2), a pair of current control switches (MP3 and MP4), two function blocks (MP5 and MP6) and a current source (MP7). This current source is the main structural characteristic that differentiates SCAL from TSEL, its closest adiabatic relative. The charge flow rate through the current source is controlled by the W / L ratio of MP7. The constant voltage supply Vdd is required for activating the cross-coupled latches. The port PCLK is used to apply a sinusoidal power-clock 0.
SCAL Structure and Operation
A PMOS SCAL gate operates in two phases-discharge and evaluate. The energy stored in the node out or out is recovered during discharge. In this phase, the power-clock 0 L a r l s from high and ramps down toward low, pulling both out and out down toward the PMOS threshold voltage lVep 1. The new output of a PMOS gate is computed during evaluate. In this phase, 9 starts rising from low, turning MP1 and MP2 on along the way. When the gateto-source voltage lVgB,I of the current source MP7 exceeds Ivtpl.
MP7 tnms on and raises the voltage Vxp of the internal node XP.
While 0 remains below Kr -Ivt,l, M P3 and MP4 are conducting.
Therefore, assuming2at in is high and E igow, a pull-up path is created from V,, to out, and the voltage at out starts rising toward VXr. The cross-coupled latches function as a sense-amplifier and boost the voltage difference of the two output nodes. As soonas this difference exceeds IVt,l, MP1 turns off and the output out is charged adiabatically from that point on. When 0 exceeds KPIvt,l, MP3 and MP4 turn off and disconnect the functional blocks from the outputs out and a. Hence, any further changes in the inputs dAnot propagate to the outputs. By the end of the evaluation phase, out is charged up to the peak power-clock voltage.
The impact of the various circuit parameters on SCAL operation can be best understood by examining the equation describing the current of a MOS device in its saturation region:
where I d J is the drain-to-source current, V,, is the gate-to-source voltage, Vt is the device threshold voltage, p is the effective surface mobility of the camer in the channel, and CO, is the gate-oxide capacitance. From this equation, it follows that the current through the current source is controlled by the WIL ratio and V,, [I] . For low-energy operation, the duration of evaluate should be made as short as possible by appropriately selecting the biasing voltage of the current sources. For high-speed operation, the source current should be made large enough by appropriate transistor sizing.
The main difference in the operation of SCAL and TSEL is in the control of the evaluation phase. In TSEL, a reference voltage is used to control both the duration of the evaluation phase and the value of the charging current. To ensure correct function, however, the reference voltages must be two times larger than the thresholds of the PMOS and NMOS devlces. Thus, at low frequencies, the duration of evaluate cannot be shortened below a certain limit, and TSEL dissipation increases even when multiple reference voltages are used. On the other hand, the current flow in SCAL can be controlled by adjusting the WIL ratio of MP7, whereas the duration of evaluate is determined independently by selecting the biasing voltage of the current source. Thus, SCAL gates can be individually tuned to achieve minimum dissipation for low and high operating frequencies.
The inverter in Figure l Figure 4 shows the internal voltages and currents through the current source for each stage of the cascade in Figure 3 when simulated at I20MHz with a 3V peak-to-peak supply voltage and a 0.5pm standard CMOS technology. In these simulations, the WIL ratio of the cross-coupled latch in each gate was 512. For the function blocks, minimum size transistors were used with WIL ratio equal to 312. The WIL ratios for the current sources (devices MP17, MN27, MP37 and MN47) were 112, 114, 211 and 111, respectively. The length of the evaluate phase is the same for every stage. The internal voltages V,,L Vh. K,3, and V x . 4 and the internal currents &, Ixn2, I,,,, and I..r are proportional to the W / L ratios of the current sources. These voltages and currents affect the energy dissipation of the SCAL structures. As the WIL ratio decreases, the current is reduced. Symmetrically, as the WIL ratio increases, the current associated with the non-adiabatic event in the gates increases. For each gate, the optimal ratio can be achieved independently of the duration of the non-adiabatic evaluation. Therefore, energy dissipation can be reduced by using several current source sizes as opposed to just one. peak power-clock voltage was 3V, and each primary output was connected to a 60fF load. As expected, the energy consumption of the static CMOS implementation does not vary with the operating frequency. SCAL is less dissipative than 2N-2P and TSEL for the entire operating regime. Moreover, it is more efficient than PAL for operating frequencies above 100MHz. Below 100MHz, SCAL is still competitive with PAL and is no more than 50% more dissipative at 10MHz. SCAL operates at full logic swing for frequencies up to 280MHz. At 280MHz, it is about 3 times more energyefficient than static CMOS. 2N-2P breaks down at 200MHz. At that point, TSEL and SCAL are 1.5 times more efficient than 2N-2P.
Summary and Ongoing Work
We have described an adiabatic logic that functions efficiently across a wide operating range. Our logic requires a single power-clock phase for its operation and avoids numerous problems associated with multiple phases, including increased energy dissipation and layout complexity in the clock distribution network, clock skew, and multiple AC power supplies. Its efficient operation is based on a current source that can be tuned to control the rate of charge flow into or out of the gate during evaluation.
We In this section, we present HSPICE simulation results for pipelined 4-bit CLAs that we developed in SCAL, TSEL, 2N-ZP, PAL, and static CMOS. Our circuits were designed using a standard 0.5pm CMOS technology and were simulated with distributed RC parameters that were extracted from layout. Our simulations accounted for the dissipation of the gates and internal clock lines but did not include the energy consumed on the external clock distribution network or the power-clock generator. The layout of our SCAL CLA is shown in Figure 5 . The WfL ratio of each current source was selected among the values 118, 114, 112 and 111 according to the output capacitive loads of its gate and the operating frequency. Figure 6 gives the per-cycle energy consumption of our designs as a function of their operating frequency for a uniformly distributed random input sequence. The
