MOSFET gate oxide scaling limits are examined with respect to time-dependent breakdown, defects, plasma process damage, mobility degradation, poly-gate depletion, inversion layer thickness, tunneling leakage, charge trapping, and gate delay. It is projected that the operating field will stay around 5MV/cm for reliability and optimum speed. Tunneling leakage prevents scaling below 2nm, which is sufficient for MOSFET scaling to 0.05µm.
Introduction
In order for a MOSFET to behave as a transistor, the gate must exert greater control over the channel than the drain dose, i.e., the gate to channel capacitance must be larger than the drain to channel capacitance. A simple model suggests (1) L min ∝ T ox • X j 1/3 [1] A survey of the literature would reveal that the most thorough device physics studies of the late 1970's pegged the scaling limit of MOSFET at 0.5µm. In the mid 1980's, it was 0.25µm. Today, no device physics barrier is foreseen for scaling to at least 0.1µm. What fogged the crystal balls of the yesteryears was the uncertainty over the minimum acceptable T ox in Eq. [1] . The scaling limit of T ox is therefore of paramount importance. Besides suppressing the short channel effect, reducing T ox improves I d and generally but not always raises circuit speed. Thinner tunnel oxide would also be desirable for lowering the program voltage of nonvolatile memory. Clearly, there are many strong incentives to reduce T ox at each technology generation. What, then, are the limits to oxide scaling? This paper attempts to answer this critical question.
Oxide Breakdown
Oxide breakdown has historically been the limiting factor in choosing T ox . The past pessimistic predictions of L min can be directly attributed to a lack of understanding of the oxide breakdown limit.
•Intrinsic Breakdown Field. If gross "defects" are not present, i.e., if one studies oxide samples which are very much smaller than 1mm 2 in size, the lifetime of the oxide is surprisingly predictable (2) . Temperature effect (3), a physical interpretation and refinement for very thin oxide(4) have been presented. The model predicts the oxide lifetime as a function of T ox and V ox very well ( Fig. 1 ) (4). Both data extrapolation and the model predict that oxide can have 20 years lifetime at 125°C up to oxide field of 7MV/cm, 8MV/cm for below 5V operation. In Fig. 2 , for example, 5.5V operation can use 8nm, 3.6V operation requires only 4.5nm, and 2.75V requires 3.3nm. •Derating of Breakdown Field due to Defects. A manufacturable process must provide a soft margin above the intrinsic breakdown limit. Recent production experiences suggest that ~30% derating to 5 to 5.5MV/cm seems to be reasonable, i.e., 11nm for 5.5V, 6.5nm for 3.6V, and 4.5nm for 2.75V. For a given manufacturing line, one can use an "effective thickness" defect model to predict the product oxide yield, reliability failure rate, optimal burn-in condition, etc. from the statistical distribution of the breakdown voltage of oxide test samples (5).
•Process Induced Damage. Fig. 3 shows that 18nm oxide with large "antenna" structure has low interface trap density after plasma etch indicating little plasma-induced damage. 11.6nm oxide has an order of magnitude more interface traps, reflecting significant damage. Surprisingly, the very thin 6.4nm oxide exhibits much less damage than the 11.6nm oxide. This exciting trend had first been predicted by a model illustrated in Fig. 4 (6) . Langmuir equation describes the plasma current (I p ) collected by the antenna (plasma probe). Fowler-Nordheim equation describes the oxide current, I FN . The intersections determine the stress currents and voltages in this plasma process equipment. Clearly, the oxide stress current hardly increases as T ox is reduced from 10 nm to 0 nm. To understand Fig. 3 , one needs to also realize that thinner oxide (6.4nm) is more tolerant of the same current stress than thicker (11.6nm) oxide.
Transistor Current and Speed All else being equal, MOSFET current always increases when T ox is reduced, although at a lower rate than a simple model might suggest because of mobility reduction, polysilicon gate depletion, and finite inversion layer thickness. Gate speed, on the other hand, may slow down due to excessive T ox reduction.
•Mobility. Electron and hole mobilities have been shown to be highly predictable function of (V g +V t )/T ox as shown in Fig. 5 (7) . Once V g (=V dd ), V t , and T ox are specified, mobilities are known. See (7) for modification for buried channel MOSFET. •Polysilicon Gate Depletion and Inversion Layer Quantization. Gate depletion can be simply modeled with the usual MOS (polysilicon gate being S) depletion equation (8) . The big surprise, still unexplained, is that the polysilicon doping concentration at the oxide interface seems to be only around 3x10 19 cm -3 . At E ox =5MV/cm, the depletion layer thickness is about 0.6nm equivalent oxide thickness or 0.3V reduction in Vg. This is a significant "excess oxide thickness" as we reduce Tox to 6nm or below. Even with the polysilicon gate biased into accumulation, the CV data would indicate an "electrical T ox " that is about 0.5nm thicker than the physical T ox as determined by optical techniques or from tunneling IV characteristics. This is because the inversion layer carriers are located at about 1.5nm (0.5nm effective oxide thickness) below the Si/SiO 2 interface. The middle curve in Fig. 6 (9) is the I dsat predicted by today's standard 2D device simulators for some future technologies. The bottom curve is obtained when polygate depletion and inversion layer quantization are included in the simulation. As an important beside, the top curve includes polygate depletion, inversion layer quantization, and velocity overshoot (energy balance) effect (9). •Speed. Fig. 7 shows the projected inverter delay time versus the T ox (7) . For low V dd operation, maximum speed may dictate the use of an oxide that is thicker than what the 5MV/cm breakdown limit would allow. While the optimal thickness should be thinner than what Fig. 7 suggests if the circuit is more heavily loaded by the interconnect capacitance, anecdotal data of large CPUs actually agree with Fig. 7 well.
Oxide Leakage and Device Drift
•Direct Tunneling. Whenever oxide voltage is lower than 3.2V (the Si/SiO 2 barrier voltage), the electron tunneling barrier changes from being triangular to trapezoidal and the oxide current, known as the direct tunneling current, remains high at even 1V and is very sensitive to T ox as shown in Fig. 8 (4) . Static logic circuits can tolerate large gate leakage, e.g. 1A/cm 2 (even though the junction leakage is typically 1µA/cm 2 ). For 1V operation, Fig. 8 would suggest 2 or 2.5nm as the scaling limit. DRAM can tolerate less oxide leakage and typically bootstraps above V dd . 3nm may be the T ox limit for DRAM, making scaling below 0.1µm difficult. This discrepancy needs a resolution. •Stress Induced Leakage. High field stress of thin oxide creates low-field leakage, apparently through the generation of neutral oxide traps that facilitate electron tunneling. Fig. 9 highlights the different behaviors of thick and thin oxides (10) . This leakage makes nonvolatile memory tunneling oxide scaling much below 8nm difficult and perhaps impossible unless the 10 year charge retention requirement is relaxed (11) . fluence in 20 year's time? Only preliminary studies have been reported (12) and early indication is that 3nm oxide can tolerate at least 1000 coul/cm 2 , i.e. 20 years at 1V, of charge passage without significant drift. Fig. 10 and Fig. 7 summarize the consideration for choosing T ox . 20 year intrinsic lifetime can be achieved at 7MV/cm in 8nm oxide, rising gradually to 9MV/cm in 3nm oxide. 30% derating for defects puts the field limit at 5MV/cm rising to 6 MV/cm. Below 2.5V, oxide leakage ( Fig. 10) and circuit speed (Fig. 7) , rather than oxide breakdown, will dictate the choice of the oxide thickness.
Summary and Projection
(Oxide defects will still need to be vigilantly controlled.) The optimal T ox are: 12nm if 5V/0.5µm, limited by defect breakdown; 6.5nm if 3.3V/0.35µm, limited by defect breakdown and circuit speed; 4.5nm if 2.5V/0.25µm, limited by defect breakdown and circuit speed ; 4nm if 1.5V/0.18µm, limited by circuit speed; 3nm if 1.5V/0.1µm, limited by circuit speed and defect breakdown; 2nm if 1V/0.05µm, limited by leakage. The above T ox values are the final physical thickness. Electrical thickness would be 0.5nm thicker . Tox can be up to 30% thinner than those listed above if only small area of oxide is used ( so that the margin allowance for defects can be small), e.g. 3.3V I/O devices in 2.5V products, etc. (13). Excess oxide leakage will limit scaling just above 2nm, which makes 0.05µm MOSFET possible. DRAM gate oxide thickness will reach limit earlier at 3nm. Plasma process damage will not become much more serious with future oxide scaling.
