Abstract-The simultaneous reduction of power supply and threshold voltages for low-power design without suffering performance losses will eventually reach the limit of diminishing returns as static leakage power dissipation becomes a significant portion of the total power consumption. This is especially acute in systems that are idling most of the time. In order to meet the opposing requirements of high performance at reduced power supply voltage and low-static leakage power during idle periods, a dynamic threshold voltage control scheme is proposed. A novel Silicon-On-Insulator (SOI)-based technology called Silicon-OnInsulator-with-Active-Substrate (SOIAS) was developed whereby a back-gate is used to control the threshold voltage of the frontgate; this concept was demonstrated on a selectively scaled CMOS process implementing discrete devices and ring oscillators. For a 250 mV switch in threshold voltage, a reduction of 3-4 decades in subthreshold leakage current was measured.
Back-Gated CMOS on SOIAS For Dynamic Threshold Voltage Control
Isabel Y. Yang, Carlin Vieri, Anantha Chandrakasan, and Dimitri A. Antoniadis, Fellow, IEEE Abstract-The simultaneous reduction of power supply and threshold voltages for low-power design without suffering performance losses will eventually reach the limit of diminishing returns as static leakage power dissipation becomes a significant portion of the total power consumption. This is especially acute in systems that are idling most of the time. In order to meet the opposing requirements of high performance at reduced power supply voltage and low-static leakage power during idle periods, a dynamic threshold voltage control scheme is proposed. A novel Silicon-On-Insulator (SOI)-based technology called Silicon-OnInsulator-with-Active-Substrate (SOIAS) was developed whereby a back-gate is used to control the threshold voltage of the frontgate; this concept was demonstrated on a selectively scaled CMOS process implementing discrete devices and ring oscillators. For a 250 mV switch in threshold voltage, a reduction of 3-4 decades in subthreshold leakage current was measured.
I. THE SOIAS CONCEPT
T HE CONCEPT of the Silicon-On-Insulator-with-ActiveSubstrate (SOIAS) technology can be taken to many levels of complexity. The fundamental idea behind this technology is to add one or more conductive under layers beneath the buried oxide of a Silicon-On-Insulator (SOI) structure. Such layers can serve as buried interconnects, gates or both. To take this idea even further, one can imagine stacked SOI structures with embedded interconnects and gates in between them. The fabrication of SOIAS structures leverages off from many of the technologies developed for bulk and SOI CMOS processes (e.g., CMP and wafer bonding). There are several options and various degrees in which the buried layer or layers can be rendered conductive. On one extreme, the buried layer can be a refractory metal such as tungsten, or silicides of such metals which can withstand subsequent hightemperature processing. In this case, the buried conductive layer must be pre-patterned prior to bonding which can make the bonding process more challenging. On the other extreme, a blanket insulating/semi-insulating layer (e.g., intrinsic amorphous/polycrystalline silicon) can be used, and selective areas of the buried layer can be made conductive by ion implantation with dopants. This work focuses on the development of the latter approach with one buried layer of intrinsic polysilicon for the purpose of dynamic threshold voltage control in lowpower applications.
Manuscript received June 3, 1996; revised January 6, 1997. The review of this paper was arranged by Editor G. W. Neudeck. This work was supported by the MIT Lincoln Laboratory. The work of I. Yang was supported by AT&T through a graduate fellowship.
The authors are with the Department Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA 02139 USA.
Publisher Item Identifier S 0018-9383(97)03008-6.
II. DYNAMIC THRESHOLD VOLTAGE CONTROL CONCEPT
Many system computations are either temporally or spatially localized. Systems that are frequently idle, i.e., doing computation only for a small fraction of the time, operate in burstmode, and hence exhibit temporal locality. On the other side of the spectrum are systems that operate in continuous mode (e.g., active all of the time), and hence do not exhibit temporal locality. At the same time, a system may only have a fraction of its functional modules active all of the time; such systems exhibit spatial locality. This idea can be applied to lower levels of the hierarchy such as at the logic gate level or the transistor level. A global strategy for achieving high performance and low power in continuously computing systems (e.g., modules of a video compression system) has been the simultaneous reduction of supply voltage and threshold voltage where the optimal and are found for minimum total system energy by trading off dynamic energy for static leakage energy [1] - [3] . CMOS-based high-performance burst-mode computation systems (e.g., a microprocessor running an Xserver or cellular phone which is idling more than 90% of the time) will suffer high-static leakage energy dissipation operating at low with constant low even with clocks stopped. For example, even when a user is continuously entering data at the keyboard, the X-server is active, (i.e., doing computation), only 2-3% of the time [4] . In order to simultaneously achieve high performance during active periods and low leakage power during idle periods for burstmode computational systems, several schemes of reducing the leakage current have been proposed. The multiple CMOS design involves using high transistors to gate the low blocks [5] , [6] . Both NMOS and PMOS transistors are needed in order to preserve state. These devices must be made large due to the finite resistance of these transistors. This will incur additional switching energy to switch these devices. Therefore, appropriate sizing of the high transistors is crucial. Another approach is the dynamic control of by biasing the bulk-CMOS wells [7] . A triple well technology is required for this scheme. Furthermore, well biasing is complicated by the Nwell to P-well junction leakage current as well as source/drain to well junction leakage currents. Both of the above schemes are implemented at the functional module level; for example, in the well biasing scheme, all the transistors in the functional module have the same variable which is dependent on the well bias.
The aforementioned technologies have been mainly proposed for implementation in bulk silicon CMOS. However, the maturity of the SOI technology in the past few years cannot be 0018-9383/97$10.00 © 1997 IEEE ignored, especially with the dramatic improvements in material quality. There are two modes of operation for SOI MOSFET's: 1) fully depleted (FD) and 2) partially depleted (PD) channel region (body). In the conventional strongly FD SOI device, the silicon film thickness is usually less than or equal to half of the depletion width of the bulk device. The surface potentials at the front and back interfaces are strongly coupled to each other and capacitively coupled to the front-gate and the substrate through the front-gate oxide and buried oxide, respectively. Therefore, the potential throughout the silicon film, and hence the charge, is determined by the bias conditions on both the front-gate and the substrate. By replacing the substrate with a back-gate, the device becomes a dual-gated device. The FD design is unique to SOI because the front-gate and back-gate both have control of the charge in the silicon film. In the strongly PD SOI, the back-gate or substrate has no influence on the front surface potential. In the middle regime, the device is nominally PD and can become FD by applying a back-gate bias, thus, coupling of the front and back surface potentials still occurs. There have been numerous studies on the merits of fully depleted SOI CMOS and its implications for low-power electronics. Various researchers have exploited the use of FD SOI in dualgated devices in which the top and bottom gates are tied and switched together, resulting in enhanced transconductance [8] - [11] . The SOIAS technology was developed to fabricate back-gated FD CMOS devices by capitalizing on existing SIMOX, wafer bonding, and thinning technologies [12] . The back-gate controls the of the front-gate device, and the NMOS and PMOS back-gates are switched independently from each other and the front-gates. For burst-mode highperformance and low-power applications, the threshold voltage would be raised during idle periods to reduce the static leakage current, and lowered during active periods to achieve high performance. Similar to the well biasing scheme, the SOIAS technology is proposed to be implemented at the functional module level. This paper describes the development of the SOIAS technology with implementation in a selectively scaled CMOS SOI baseline process, and a theoretical evaluation for low-power logic applications.
III. SOIAS PREPARATION AND MATERIAL CHARACTERIZATION
The SOIAS substrate is a multilayered blanket film stack consisting of the silicon wafer, insulating oxide, intrinsic polysilicon, back-gate oxide, and silicon film. Two different approaches have been taken for preparing the SOIAS wafers. The first is the more traditional route of the BESOI process. In this case, the device wafer includes the back-gate oxide (to be) which is obtained by dry thermal oxidation, and the back-gate material to be which is amorphous silicon (as deposited). This device wafer was then bonded to the handle wafer which was also oxidized to form approximately 1 m of silicon dioxide. Therefore, the bonding interface is between the amorphous silicon and the thick insulating oxide. After bonding, the wafers were annealed in N at 1000 C for 1 h. The device wafer was then thinned back by chemical and mechanical polishing. Finally, localized plasma thinning (Accu-Thin) 1 was used to improve silicon film uniformity. The second approach involves the bonding of a SIMOX wafer. The buried oxide, in this case, served as an etch-stop using wet chemistry wafer etching. The same layers as described above were grown on the SIMOX and the handle wafers. The bonding interface was still between the amorphous silicon and the insulating oxide. Fig. 1 depicts the SOIAS preparation for the bonded SIMOX process. The bonded SIMOX wafers were etched in 25 wt% Tetramethyl Ammonia Hydroxide (TMAH), commonly known as photo resist developer, to remove the bulk of the SIMOX wafer, stopping on the buried oxide. The selectivity of silicon to oxide in TMAH is about 5000 to 1 at 80 C [13] . Therefore, the resulting silicon film thickness is as uniform as that of the original SIMOX wafer. Final thinning of the silicon film was accomplished with thermal oxidation and wet oxide strip. Amorphous silicon was used as the back-gate material (to be) because as-deposited amorphous silicon is very smooth which facilitates direct bonding to the oxidized handle wafer. Fig. 2 shows the SOIAS wafers prepared by the bonded SIMOX and BESOI processes. The integrity of the SOIAS substrates is compared with the conventional SIMOX wafers through measurements of the effective electron mobility and Time Zero Dielectric Breakdown (TZDB) tests for intrinsic oxide quality. Fig. 3 shows the effective electron mobility versus effective transverse electric field of the front-gate device for conventional SIMOX and SOIAS. The universality of the curves indicates no apparent difference between the SOIAS and SIMOX substrates from a device operation point of view. Fig. 4 shows the cumulative percentage failure comparison of SOIAS with bulk and SIMOX substrates in the TZDB test. The bonded SIMOX SOIAS is slightly worse than the bond and etch-back SOIAS as well as the bulk and SIMOX. Overall, the intrinsic oxide breakdown of the SOIAS is comparable to those of bulk and SIMOX.
IV. DEVICE FABRICATION
The device fabrication on SOIAS follows the conventional CMOS SOI process with two additional steps. The back-gates were formed first by ion implantation through the silicon film in two masking steps, resulting in islands of p and n polysilicon insulated by intrinsic polysilicon after thermal anneal. Using the same type of doping in the back-gate polysilicon and silicon film resulted in near-zero flatband voltage at the backgate. By properly tailoring the energy and dose of the implant, the back-gate and the -adjust implants for setting quiescent value (i.e., at zero back-gate bias) can be done in one step. The peak of the back-gate implant is placed deep in the back-gate polysilicon, and the leading edge of the implant is used to dope the silicon film. Fig. 5 shows examples of the as-implanted and final boron and phosphorus concentrations in the silicon and back-gate from Suprem3 simulations. Typical sheet resistance of the back-gate poly is in the 1-5 K square range for the shown dopant concentrations in the back-gate. The front-gate device is then built as in a conventional SOI CMOS process using LOCOS isolation with an additional step of cutting the back-gate contacts. The back-gates were contacted through the top by cutting through the field oxide. Fig. 6(a) illustrates the final device schematic, and Fig. 6(b) is a SEM micrograph of the SOIAS device. The coupling between the front and back-gates depends on the ratio of the critical film thicknesses: front-gate oxide thickness , silicon film thickness , and back-gate oxide thickness We have demonstrated SOIAS with 9-nm , 40-nm , and 100-nm nominal design parameters in a selectively scaled 1-m baseline CMOS technology.
V. DEVICE RESULTS
Figs. 7 and 8 show the and subthreshold device characteristics for NMOS and PMOS at two different threshold voltages tuned by biasing the back-gate; is 0.44 m and 0.35 m, respectively. A 250 mV change in threshold voltage results in a 3.5-4 decade reduction in off current and a 50-80% increase in on current at 1 V operation for PMOS and NMOS respectively. Fig. 9 shows the maximum and minimum tunable limits for the above nominal design parameters. The -axis is the designed quiescent at V). The quiescent was obtained either by varying the doping or the silicon film thickness. Therefore, for low quiescent devices, the film is strongly fully depleted, i.e., either the doping level is low or the silicon film is thin. Similarly, for the high quiescent devices, either the doping level is high or the silicon film is thick. The -axis, tunable was obtained by applying various back-gate biases. The tunable range is quite large (approximately 1 V) for fully depleted back interface as can be seen for the lowest quiescent case V). The limits of the upper and lower tunable range are determined by the back interface becoming either accumulated or inverted, in which case the back-gate becomes decoupled from the frontgate. Even for the partially depleted highest quiescent case V), there is still a reasonable tuning range (approximately 0.5 V). This has implications for making FD SOI a viable technology since the threshold voltage and the device operating mode can be controlled precisely by the backgate. Fig. 9 demonstrates that can be fine tuned over a wide range despite variations in (average thickness 48.4 nm, maximum thickness 69.9 nm and minimum thickness 37.6 nm) and For example, a nominal of 500 mV can be reached even for a 400 mV deviation by using a 5-6 V back-gate bias. Typically, only a 200 mV switch in the is sufficient to achieve approximately three decades reduction in the subthreshold leakage current. This design range fits wellwithin the limits of the tunable band for the given films thicknesses and doping levels as shown in Fig. 9 . Fig. 10 shows the frequency of a 101-stage ring oscillator as a function of varying the back-gate-controlled for either the NMOS or PMOS only; hence, complete independent control of the NMOS and PMOS device threshold voltages. Fig. 11 shows the actual output of the ring oscillator. For a 200 mV change in for both the NMOS and PMOS, the result is a 36% change in the speed at of 1 V. In order for this scheme of dynamic threshold control to work properly, the must change quasi-statically with back-gate switching. This is of concern because of the relatively high back-gate sheet resistance. The verification of this quasi-static control of the was carried out with an experiment depicted in Fig. 12 . An NMOS transistor's gate was tied to of 0.5 V which is close to the threshold voltage of this device at zero backgate bias, and a 50 ohm resistor was placed between and the drain. The back-gate was pulsed at various frequencies and pulse heights while the output at the drain, was monitored. The device under test has an annular gate with a large 23 25 m back-gate with one contact off to the side. When the back-gate input pulse is high, is low because more current is pulled through the resistor due to a lowered Fig. 12 shows the at 5, 10, and 20 MHz pulses on the back-gate. Even at 20 MHz, the is still following the input pulse for this fairly large back-gate. Knowing the value, the dynamic current due to lowering of the i.e., switching of the back-gate, can be overlayed onto the dc measured currents for various back-gate biases, i.e., pulse heights. The dynamic current was simply calculated as ( where is the resistor value. Fig. 13 shows the composite of these two measurements. The -points lying precisely on the dc measured current curves is indicative of the quasi-static control of the device through dynamic back-gate biasing.
VI. DYNAMIC OPERATION

VII. APPLICATION TO LOW-POWER SYSTEMS
Having demonstrated the technology, a theoretical model was developed to evaluate and compare the total energy total physical capacitance (gate capacitance front-gate overlap capacitance fringing capacitance back-gate overlap capacitance) low low off current high high off current back-gate oxide capacitance back-gate bias The total energy equation for the SOIAS is composed of three components, the dynamic switching energy, the static leakage energy, and the overhead energy required to switch the back-gate. For the SOI technology, the total energy is composed of the dynamic switching energy and the static leakage energy. The applicability of SOIAS technology is a strong function of system, functional block and transistor usage, i.e., the parameters , and is the module activity factor which is the fraction of time a module (e.g., an adder) is "on," i.e., doing computation.
is the back-gate activity factor which is dependent on how frequently the module is "on". One important point to note here is that the back-gate activity factor is always less than or equal to the module activity factor For example, an adder can be active several cycles in a row, and hence the for this module is then left in the low state during those cycles. Therefore the back-gate only needs to be switched once for all the consecutive cycles that the module is active. The dynamic switching energies for the SOIAS and SOI technologies are approximately the same for burst-mode as well as continuously computing systems. Therefore, the sum of the static leakage energy and the back-gate switching energy for the SOIAS technology must be less than the static leakage energy of the constant low SOI technology for low-power applications. For burst-mode computational systems ( 1 2%), the and , are numbers much smaller than one. Therefore, the back-gate switching overhead energy is small due to being a small number. The static energy for the SOIAS technology would also be much less than that of the SOI technology because the low leakage energy is weighted by a very small number and the high leakage energy is low due to low subthreshold leakage. In order to determine functional block usage patterns and , a series of program profiling experiments were performed using the ATOM code instrumentation interface [14] for a particular microprocessor implementation, compiler technology, and various algorithms. The ratio of the total energy dissipation for SOIAS and SOI was analyzed as a function of algorithm and architecture dependent parameters and , see Fig. 14) . The simulation parameters are: nm for the practical limit of thinning the silicon film, nm for a 0.25-m technology; , the probability of a gate switching in one active cycle, is assumed to be 40% and is obtained by estimating a ripple carry adder under random input pattern in general is a strong function of bit transition probabilities), 100 MHz clock frequency, low mV, high mV, and V. minimize the cost of switching the back-gate, the source/drain overlap must be minimized and t box must be optimized. For the low-activity modules, the shifter and multiplier, SOIAS with any t box and S/D overlap value in this space will provide energy savings. The higher activity adder module design space is shrunken due to the higher cost of switching the back-gate. The filled triangle and circle symbols correspond to the same points in Fig. 14. age which does not exhibit strong temporal locality (e.g., adder and shifter in a continuously computing system, 100%), the back-gates would not be switched, and hence would be equal to at constant low However, if the back-gate were to be switched in such systems, the cost in energy for switching the back-gate is high, and therefore, the would be greater than This is indicated in Fig. 14 by the dotted open symbols. In a system which is frequently idle while awaiting I/O, such as an X-server with , the SOIAS technology dissipates much less energy than conventional SOI: 43% less for the adder , 80% less for the shifter , and 97% less for the multiplier This is indicated by the filled symbols in Fig. 14 . In this case, the savings in static leakage energy due to changing of the is much greater than the overhead energy due to switching of the back-gate. As the system and module activities become lower, the energy savings is even greater.
For the given burst-mode algorithmic parameters, the latitude in the technology design can be shown in technology parameter space, in this case, and source/drain overlap with the back-gate are the parameters of choice because they reflect the energy cost of switching the back-gate. Fig. 15 shows the ratio of total energy dissipation in the technology design space for the three modules. The plane outlined in dark is the break-even plane for SOIAS and conventional SOI technology. The S/D overlap with the back-gate is normalized to gate length. For low-activity modules (the multiplier and shifter), the design space in favor of the SOIAS technology spans the entire parameter range under study for both and S/D overlap, Fig. 15(b) . This implies that for any design value of and S/D overlap in this space, the back-gate switching energy is significantly less than the static leakage energy saved by dynamically controlling the For highactivity modules (the adder), the design space in favor of SOIAS technology is smaller because higher module activity implies higher switching frequency of the back-gate, and hence higher energy cost, see Fig. 15(a) . There is, however, an optimal range of which allows the most S/D overlap, i.e., the most process latitude, where the energy is minimized for the adder module.
VIII. CONCLUSIONS
Successful preparation of SOIAS substrates is a crucial part of this technology, and we have shown the preparation of substrates using two different approaches. The dynamic control of threshold voltage has been successfully demonstrated in a baseline CMOS process. Furthermore, the dynamic operation of these devices was shown to be robust and the quasi-static control of the was verified. The flexibility in threshold voltage control through back-gate biasing from partially depleted to fully depleted devices provides a viable option for FD SOI. Finally, the theoretical energy evaluation of the SOIAS technology for low-power design of burst-mode computational systems showed significant energy savings. Currently, she is involved with CMOS device design and process integration at the Advanced Products Research and Development Laboratory, Motorola, Austin, TX.
Carlin Vieri received the B.S. degree in electrical engineering and computer science from the University of California, Berkeley, and the M.S. degree from the Massachussetts Institute of Technology, Cambridge. He is currently pursuing the Ph.D. degree at the MIT Artificial Intelligence Laboratory. He has worked on high-performance/low-power parallel computing and the development of specialized integrated circuits employing power management techniques. His main interest is in reversible computation structures.
