I. INTRODUCTION
Plasmas have been used to etch fine features in Si integrated circuits for nearly 40 years. Without this technology, we would be stuck in the 1970s listening through tinny headphones to disco music on our "small" portable cassette tape player. Carrying laptops around would be more for fitness than for convenience and mobile "smart" phones would require wheels. Today, instead we take these marvelous devices for granted. Among the many important breakthroughs that were required to make this all possible, plasma etching plays a major role in allowing complex circuit patterns printed in a photolithgraphically defined polymer to be transferred to the silicon, silicon dioxide, and metals that make up the integrated circuits at the heart of these devices.
The first commercially available microprocessor, the Intel 4004, was launched in 1971. It was a 4 bit processor, contained 2300 transistors, operated at 1.08 MHz clockfrequency, and a minimum feature size of 10 lm. 1 Intel's third generation multicore processors, launched in late 2012, are 64 bit processors, containing 1.4 Â 10 9 transistors, operating at roughly 3 GHz clock-frequency and a minimum feature size of 22 nm. 2 Although many factors contributed to the advances in microprocessors' performance, a key element has been the ability to fabricate smaller transistors. This is attributed to advancements in lithography and pattern-transfer methods. The purpose of this review is to cover the advancements in the latter. In the early days of integrated circuit fabrication, pattern-transfer was accomplished by wet etching. However, with time, plasma etching became the preferred method.
Here we attempt to provide a modern review of this field in a comprehensive as possible manner. Given the scope of this undertaking, this is a nearly impossible task. Many important studies will be left out. We also note that there are several earlier books on plasma etching 3, 4 as well as more detailed treatments of important aspects such as plasma physics and electrical engineering. [5] [6] [7] Instead, the attempt here is to cover in some detail the applications of plasma etching in integrated circuits and to a lesser extent, in microelectromechanical systems (MEMS) devices. The subject is placed in historical perspective and is accompanied by a discussion of mechanisms of plasma etching and selected diagnostics that provide both fundamental insights into plasma etching processes and are in widespread use in manufacturing. An attempt is also made to predict the future needs for plasma etching, looming problems, and possible solutions.
II. BRIEF HISTORY
The use of glow discharges dates back to the late 19th century where sputtering, first discovered by Grove 8 and also observed near the electrodes in vacuum tubes, was used for the production of mirror surfaces. 9 The term "plasma" to designate partially ionized gas is attributed to Irving Langmuir who studied glow-discharges, and according to his colleague and collaborator, Lewi Tonks, 10 coined the term during a discussion between them. The first known use of the term in the literature is dated 1928. 11 In the early days of integrated-circuit processing, wetetching was used for pattern transfer. With time, however, plasma-based pattern transfer replaced wet chemistry for most if not all the steps. The development of modern plasma-etching equipment for pattern-transfer evolved along a) Electronic mail: vmdonnelly@uh.edu b) Electronic mail: avtek0@gmail.com two paths that eventually merged into the current configuration, where the substrate to be etched is placed on a radio frequency (RF)-powered electrode, with or without an additional plasma-generation source. The two approaches, physical (i.e., sputtering) and chemical, had different goals. The former was needed for pattern transfer that could not be accomplished by wet-chemical means, while the latter was intended to replace wet chemicals in IC fabrication.
The first path, sputtering, was used in the early 1960s for the fabrication of beam-lead devices (beam-leads were an alternate technology to wire-bonding, where gold semirigid cantilevered leads were used to connect the device to the outside world). In the process, developed by Martin Lepselter from Bell Laboratories, [12] [13] [14] gold leads, 12.5 lm in thickness, were electroplated onto a metal stack consisting of an adhesion-promoting layer or a "glue-layer" (titanium or zirconium) and platinum (needed to prevent a chemical reaction between the gold and the glue-layer). The etching of the metal stack below the gold could not be easily accomplished by wet etching. For instance, aqua regia (one part concentrated HCl plus three parts concentrated HNO 3 ), one of the few chemicals that will etch platinum, 15 will etch gold about 190 times faster, 16 leading to severe undercutting of the gold. Back-sputtering of the platinum solved the problem, sometimes with the downstream addition of oxygen 17 to increase the selectivity to the glue-layer. In a further development, an RF sputtering method (using an argon plasma) was implemented by Davidse 18, 19 from IBM, utilizing a "blocking" capacitor 20 to couple the RF generator to the electrode on which the substrate had been placed. The resulting negative bias led to ion acceleration toward the biased electrode and was used to pattern cermet-film resistors.
The chemical approach for plasma etching in the semiconductor industry started in the late-1960s when Stephen Irving from Signetics demonstrated the ability to strip photoresist in oxygen plasma. [21] [22] [23] The reactor, that had been used to remove organic residues from various substrates by "burning" or "ashing" them, was manufactured by Tracer Labs, a division of LFE Corporation. 24 It consisted of a reaction vessel, where an external coil was used to generate the plasma by an electrodeless discharge [the concept was not new and was used first by Thomson in 1891 (Ref. 25) ]. However other issues, such as residues and device damage 23 [associated with ultraviolet (UV) radiation] had to be addressed before plasma-ashing became a viable alternative to wet stripping. Irving recognized the wider implication for etching other materials as well by using either fluorine or chlorine-based compounds to etch SiO 2 or aluminum, respectively. 26 The motivation for use of plasma to clean or pattern semiconductor devices was driven primarily by the need to reduce chemical waste associated with the use of wet etchants, 23, 27 but other advantages became apparent as well. As silicon-nitride became the material of choice for the encapsulating layer, there was no wet etchant that could be used to pattern it to form the contact to the aluminum metal layer below. 27 Fluorine based plasmas became the obvious choice to pattern the nitride without eroding the aluminum metal in the contact-pads.
While etching SiO 2 with fluorine-compound based plasma looked attractive, it would not be practical if the selectivity (i.e., the etch-rate ratio of SiO 2 to the underlying substrate) were low. For instance, CF 4 plasmas can etch silicon faster than SiO 2 . The issue of additives (e.g., H 2 ) or alternative gases (C 2 F 6 , C 3 F 8 , CHF 3 ) to increase selectivity with respect to silicon was discussed extensively by Heinecke. 28, 29 The early plasma reactors were barrel type, where wafers were placed in a quartz chamber with external electrodes 30 (or a coil- Fig. 1 ). The appropriate gas was introduced into the chamber while RF was applied across the electrodes to generate the plasma. These etchers, while adequate for resist stripping, lacked wafer temperature control and, at least initially, suffered from poor etch uniformity, required for device processing. It was not until the radial-flow reactor was introduced that plasma etching became a viable production alternative to wet etching for patterning. In this reactor (known also as the Reinberg reactor 31 ), wafers were placed on the grounded electrode, while RF is applied to the opposite electrode (Fig. 2) .
The convergence of the two approaches for plasma etching took place when Hosokawa et al. 32 introduced fluorine and chlorine-containing gases (e.g., CF 4 , CCl 3 F, CCl 2 F 2 , etc.) instead of Ar to a RF sputtering apparatus. The motivation was to increase the etch rate of various materials, such as silicon, glass, aluminum, molybdenum, stainless steel, and photoresist. However, there was no discussion of the applicability of the technique to pattern transfer. The technique, now called reactive ion etching (RIE), [33] [34] [35] reactive sputter etching (RSE), 36, 37 or ion-assisted plasma etching, 38 became the method of choice (with some enhancements that will be discussed in the next section) for patterning devices.
The first all dry-etched device was processed in 1975 by Texas Instruments. 27 The motivation was to reduce the amount of solvents in the processing line as well as the ability to pattern the silicon nitride passivation-layer to access the bond-pads. However, as critical dimensions (CD) became smaller and smaller, vertical dimensions approached and exceeded horizontal dimensions, mask undercutting became intolerable, and anisotropic etching became the primary motivation for using plasmas for pattern-transfer.
FIG. 1. Barrel reactor.
Wafers are mounted on a quartz boat inserted through a door (on left) into a quartz tube. After pumpdown, gas flow is initiated followed by RF power applied to a coil wrapping around the quartz tube.
III. EQUIPMENT EVOLUTION
The etching tools used in the early days for pattern transfer were diode reactors with the wafers placed either on the grounded or RF-powered electrode that was capacitively coupled to the plasma with an excitation frequency of 13.56 MHz. Tools with the former configuration were referred to "plasma reactors" while those with the latter configuration were dubbed "reactive-ion-etchers" (RIE) or "reactivesputter etchers" (RSE). The plasma reactors operated at relatively high pressures (hundreds of mTorr) with the two electrodes roughly equal in area [ Fig. 3(a) ], while reactiveion-etchers operated at lower pressures, with the powered electrode smaller than the grounded electrode [ Fig. 3(b) ]. The asymmetry of the two electrodes, coupled with the use of a blocking capacitor, yielded a negative dc self-bias voltage on the smaller (usually powered) electrode. The ratio of voltages across the sheaths near the powered and grounded electrodes was originally thought to vary as the ratio of areas of the grounded and powered electrodes to the forth power. 39 While the area ratio dependence was later shown to be much less severe, 40 the smaller electrode was nonetheless found to obtain a much larger sheath potential than the larger electrode. This causes more energetic ion bombardment of substrates placed on the smaller electrode, enabling anisotropic etching.
As the industry migrated toward plasma etching processes and replaced wet etching as the preferred method for pattern-transfer, throughput became an important issue. Reactive ion etchers that could process large batches of wafers were the early solution for the problem. 38, 41, 42 An example is shown in Fig. 4 . Although these etchers could handle up to eighteen 150 mm diameter wafers, the challenges were the control of etching uniformity (within a wafer as well as wafer-to-wafer) and wafer handling (automatic loading and unloading). As the industry migrated to 200 mm wafers, with the need to achieve better etching uniformity (both within-wafer as well as wafer-to-wafer), batch reactors became less attractive, and enhanced etching-rate single-wafer-etchers took their place.
The key to high-throughput single-wafer etchers is enhanced etching-rate and the ability to integrate multiple etching chambers on a single platform. Depending on the application, the higher etching rate can be achieved by increased pressure (>100 mTorr), the use of magnetic fields to confine the electrons, referred to as magnetically enhanced RIE (MERIE), or by using inductive coupling or microwave frequencies to achieve high-density plasmas (>10 11 positive ions/cm 2 ). In this case, the source is decoupled from power delivered to the stage to allow ion-energy control independent of plasma density. 43 
A. Magnetic enhanced reactive ion etching
In this method, electrons spiral around imposed magnetic field lines, increasing their trajectory toward the chamber walls. The end result is higher number of collisions per electron, leading to a higher ionization rate. Early MERIE tools utilized either permanent magnets 44 (e.g., MRC MIE-710) or moving magnets behind the wafers 45 (e.g., Tylan/Tokuda HiRRIE 500). The more common designs, however, involved magnetic fields generated outside the chamber walls either by rotating magnets (TEL DRM) or rotating magnetic-field generated by varying currents in two sets of perpendicular coils 45 (Applied Materials-a number of models). In the latter case (Fig. 5) , the magnetic field could be turned on and off as needed with the additional control of rotation frequency. At low pressures, the magnetic field enhances etching rates-the electrons' path to the chamber wall is longer, leading to higher ionization due to increased collision rate with neutrals. At higher pressures (!$100 mTorr), its effects are smaller, as the inelastic mean free path becomes much smaller than the reactor dimensions. Above 200 mTorr, contributions of the magnetic-field to etch rate are negligible.
B. Multiple-frequency capacitively coupled plasma etchers
The need to decouple plasma generation from ion-energy control was realized as early as 1979. 46 Etchers with dual RF powered electrodes were introduced in the mid-1980s (e.g., Drytek 384T and Tegal 1500). The former had two opposing electrodes powered with the same frequency (13.56 MHz), with a grounded chamber wall and a grid, 47 while in the latter, the two electrodes were powered with different frequencies, with a grounded third electrode. 48 Later generations of the Tegal triode etchers included magnetic confinement 49, 50 as well.
In current dual-frequency capacitively coupled plasma (CCP) reactors, marketed by TEL and Lam Research, highfrequency (!13.56 MHz) is applied to the upper electrode, and the lower electrode that holds the substrate is powered by the lower frequency. In newer configurations, there is the option to couple both frequencies to the lower electrode, with additional hardware to confine the plasma. 51, 52 This minimizes interaction with the chamber wall and facilitates more efficient chamber cleans that can be carried out after every wafer. The exact frequencies will vary by application, manufacturer, and equipment-generation. A schematic of a dual-frequency reactor is shown in Fig. 6(a) . These reactors normally operate at higher pressures than the high-density reactors discussed below and are usually used for dielectricetch applications.
More recently, Lam Research has introduced a CCP etcher where three frequencies, 2, 27, and 60 MHz, are utilized. 53, 54 In this configuration, the upper electrode is either RF powered with one frequency or grounded. In the former, two frequencies are applied to the lower electrode, while in the latter all frequencies are coupled to the lower electrode. 55 In another twist, introduced by TEL, the top electrode, in addition or instead of being RF-powered, may be DC-biased, while the lower electrode is RF biased [DC/RF hybrid reactor, [56] [57] [58] [59] [60] Fig. 6(b) ]. The reason for DC biasing is to generate high-energy ballistic electrons 59 that will accelerate toward the opposite RF powered electrode, with some reported benefits of reduced electron-shading and improved resist integrity. 61 
C. High density etchers
Numerous plasma etchers have been introduced where the plasma is generated by a source that is not capacitively coupled to the plasma. The ion-density in these etchers is generally about an order of magnitude or more higher than in the CCP etchers described above. The wafer is placed on an RFbiased lower electrode, and plasma is generated by a source placed a short distance above the wafer. Common highdensity sources are inductive, electron-cyclotron-resonance (ECR), surface-wave-plasma (SWP), and helicon. These and other sources are discussed extensively elsewhere. 43 
Inductively coupled source
In these systems, a coil (or multiple coils to control uniformity) outside the chamber is used for plasma generation. The coil could be planar 62 [ Fig potential, but they are usually coated with an insulating, ceramic layer and are often behind an insulating liner. The substrate is placed on the stage that is capacitively coupled to a RF source. As in CCP systems, the small electrode holding the wafer develops a negative dc self bias when RF power is imposed; hence, it can be thought of as a cathode. In some etchers, the plasma is confined by a magnetic field generated by fixed magnets.
Inductively coupled etchers generally operate at pressures of a few mTorr to tens of mTorr and are used for the etching of trenches, gate-electrodes, high-j dielectrics, aluminum and its alloys. There are also special configurations to etch materials that do not form volatile compounds (Hitachi offers such a system), including alloys of nickel and iron, noble metals, lead zirconate titanate (PZT), barium strontium titanate (BST), and others. In these cases, where sputtering is the mechanism for pattern transfer, the by-products accumulate on the chamber walls, which then have to be physically cleaned from time to time. If the etch by-products are conductive, however, they may coat the dielectric window; a capacitively coupled plasma reactor is preferred in these instances. 63 
Electron-cyclotron-resonance source
In an ECR plasma, 64 radiation is launched through a dielectric window into a low-pressure volume containing the gas to be ionized [ Fig. 7(b) ]. The frequency of the radiation in commercial etchers is typically in the microwave regime (2.45 GHz), but UHF (450 GHz) was used in some etchers as well mainly for dielectric etch applications. 65 The electrons generated are confined by a magnetic field generated by a few magnets and are forced to move in a circular motion. The frequency of rotation (cyclotron frequency) is given by ¼ ð1=2pÞðeB=mÞ, where e and m are the electron charge and mass, respectively, and B is the magnetic flux density. At resonance, the right-hand side equals the excitation frequency, and therefore B ¼ 0.0875 T (875 G) and 0.0161 T (161 G) for ¼ 2.45 GHz and 450 MHz, respectively. The chamber below the resonance cavity is typically surrounded by magnets for improved uniformity. Currently, the main applications of ECR etchers are silicon or aluminum etching.
Helicon source
Helicon sources 66 are not currently used in mainstream commercial etchers. The source utilizes a specially designed antenna and operates at frequencies in the megahertz range in the presence of an axial magnetic field, producing bounded whistler waves. 67 Depending on the antenna design, different modes of excitation are possible. Early helicon sources for plasma etching operated in the m ¼ 1 mode, [68] [69] [70] but the one that eventually gained presence in the commercial market operated at the m ¼ 0 mode 71 (M0RI source, first marketed by PMT and subsequently by Trikon). The former is characterized by electrical field lines that do not change their pattern with position but the pattern rotates along the direction of the B field, while the latter exhibits field lines that change direction from radial to azimuthal, depending on position. The etch chamber is often magnetically confined. The M0RI source has been used for etching silicon, metal, as well as dielectrics.
Surface wave plasma source
This source uses microwaves to generate a high density discharge without the presence of a DC magnetic field. 72 These type of etchers were first commercialized in the early 1990s (e.g., Sumitomo SW4010), but never became widely used. However, recently TEL has introduced a SWP etch chamber with a radial-line slot antenna, utilizing a 2.45 GHz microwave source. It has been reported that the electron energy distribution function (EEDF) in the plasma-generation zone is non-Maxwellian, 73, 74 and therefore no unique electron temperature, T e , can be assigned. 74 Away from the plasma generation zone the bulk of the EEDF approaches a Maxwellian distribution with T e around 1 eV. The intensity of vacuum ultraviolet (VUV) radiation near the wafer, linked to device damage, 75, 76 is reported to be considerably lower than the radiation associated with inductive sources. 74 
D. Downstream etchers
In these systems, plasma is generated in a remote chamber (either by a microwave or an inductive source), in a manner that does not expose the wafer to UV radiation, which can lead to device damage (e.g., threshold shift). 23 In addition, due to the long length of tubing between the source and the substrate, no charged particles reach the etching chamber and the substrate is exposed only to neutrals. These etchers are used for isotropic etching processes. The main application is for resist-stripping (ashing) with O atoms at elevated temperature (200-300 C) . The formation of nonreactive, ground state O 2 is suppressed by the addition of other gases such as N 2 .
77 N 2 /H 2 mixtures can also be used for resist stripping, albeit at a lower rate. 78 This mixture is useful when oxidation of the exposed substrate (e.g., TiN film, or low-j dielectrics) is to be avoided. 78, 79 Another application is the soft etch of silicon by atomic fluorine resulting from the dissociation of gases such as NF 3 or CF 4 at room temperature. Substrate heating, when necessary, is accomplished either by a heated chuck or lamp. Depending on the tool, wafers can be etched either by resting on the chuck, or on the lift-pins. In the latter configuration, both front and back-sides of the wafer can be etched simultaneously. The chuck can often be RF powered to initiate the etch by removing a hard-to-etch film on top of the layer to be removed, such as the crust formed on top of photoresist during ion implantation.
E. Temperature control
In all applications where photoresist is used as the mask, there is a need to keep the wafer temperature low enough to prevent the resist from flowing or reticulating. Both temperature and UV radiation generated by the plasma are the causes of the latter, and in fact act in synergy. 80 In other cases, temperature is a critical parameter in determining the rate of the etch process (e.g., aluminum etching) and it must be controlled. In earlier generations of etchers, this was accomplished by placing the wafer on the chuck, sometimes with force applied at the periphery of the wafer placed on a domed pedestal. Later generation etchers have utilized helium heatexchange gas, pressurizing the gap between the wafer and the chuck. This pressure is typically between 4 and 30 Torr, and the gap is within or below the mean-free-path of helium at the operating pressure (roughly 0.01 mm at 10 Torr and 40 C). Under these conditions, the heat transfer coefficient does not vary much with variations in the gap dimensions. 81 Although in principle other gases may be used for heat exchange, helium is preferred because of its high heattransfer coefficient and high ionization potential (i.e., the He leaking into the plasma does not perturb it). With the introduction of electrostatic chucks (ESC), the He leak-up rate into the chamber is used to monitor wafer clamping and the health of the ESC. Warped wafers (especially ones that are bowed upwards as a result of a highly tensile film on the front of the wafer) can lead to high helium leak-up rate and inadequate clamping or "chucking."
Both heated (T > 150 C) and cryogenically cooled chucks are used in some applications. The former is used in to increase the vapor pressure of the etch by-products (e.g., high-j etching), while the latter is used in limited applications to minimize lateral etching (e.g., silicon etching with SF 6 in a non-Bosch process).
F. Endpoint detection
At most etching levels, some endpoint detection is needed, first to ensure that etching is complete, and second to avoid excessive etching time that may erode the underlying layer. There are numerous methods that have been used over the years in research laboratories and production environment, such as interferometry, 82 ellipsometry, [83] [84] [85] and optical emission spectroscopy (OES). [86] [87] [88] [89] [90] While ellipsometry is a useful research tool, it has not found much use in production. Interferometry was widely used in batch etchers, but with the transition to single wafer etchers, optical-emission monitoring became the preferred method for endpoint detection, due to its relative simplicity in terms of hardware and software implementation. Interferometry is still useful in some applications, where etching steps are to be changed based on depth of material etched (see discussion below and example in Fig. 8) .
The interferometric method relies on interference between light beams reflected from the top and the bottom surface being etched. If the imaginary component, k of the refractive index, n þ ik, is small, a periodic reflected signal will be observed, with periodicity corresponding to Dd ¼ k=ð2 ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi ffi n 2 À sin 2 h i p Þ, where Dd is the thickness associated with a period, h i is the angle of incidence with respect to the surface normal, and k is the wavelength of the light-source being used. The light source could be external [laser or UV (Ref. 91 )] or even internal, i.e., the plasma glow itself. 92, 93 In the latter case, it could be used to monitor the etching process over the entire wafer. From the periodicity of the reflected light, etch rates can be determined, and endpoint is realized when the reflected signal becomes flat (Fig. 8, end of trace I) .
The monitoring of reflectance can also be used to detect endpoint of an absorbing layer (high imaginary refractive index, k) on top of another by monitoring the change in reflectance. For example, this method was used to monitor the etching of aluminum in a batch reactor, using a He-Ne laser as the light source. 94 A large change in reflectance occurs when the metal is cleared. 94 A similar approach was used in the fabrication of x-ray masks to monitor the etching of tungsten sandwiched between two layers of Cr on top of a polysilicon membrane. 95 In a slight modification of the method, it can be used for endpoint prediction. 78, [96] [97] [98] For example, reflectivity changes have been used in etching of polysilicon gates to stop on less than 1 nm of thermal oxide, by initiating a selective overetch step before the polysilicon cleared. In this case, the wavelength of the external light source was chosen to yield a change in reflectance 10-20 nm before endpoint, which triggered the overetch step. 96 The technique most commonly used for endpoint determination is OES (Sec. IV A), where a particular wavelength in the plasma is monitored throughout the etch process for any change associated with the removal of the film being etched. It can be associated with an etch by-product, like CO emission in oxide etching (which leads to the signal decline at endpoint), or reactant, such as Cl or Cl 2 in polysilicon etching (which leads to a signal rise at endpoint). The appropriate monitoring wavelength is either selected by a filter, photomultiplier tube, or an optical multichannel array. In the latter case, it is possible to monitor multiple wavelengths simultaneously. In cases where a rotating magnetic-field is used, some averaging of the signal over one or more rotation-periods is required to produce a meaningful signal. An example of an emission trace used to monitor WSi x /polysilicon gate etching is shown in Fig. 9 . While OES is a powerful technique for endpoint detection, it is not sensitive enough in cases where the exposed area is very small, such as in contact and via etching.
Both reflectivity and OES techniques require an access windows on the chamber that must remain clean to obtain stable signals of sufficient intensity. Typically, a heated quartz window is used to prevent polymer buildup.
Other endpoint detection methods, such as pressure change, bias change, and mass spectrometry are discussed elsewhere. 94 These are not used in production due to their complexity and/or lack of sensitivity.
IV. DIAGNOSTICS AND MECHANISMS
A. Some basic considerations in ion-assisted etching Anisotropic plasma etching is made possible by the perpendicular bombardment of the surface by positive ions that are accelerated by a sheath potential that develops on surfaces exposed to the plasma. Within this simple classification of ion-assisted etching, many types of reactions can occur. Anisotropic etching requires a combination of energetic ions and reactive neutral species. The flux of neutral species should preferably be much larger than the positive ion flux, so that the etching rate is mainly limited by the ion flux. This allows the fastest etching rates to be obtained with minimum dependence on the feature aspect ratio and the area of exposed material.
Vaporization of products
It is advantageous if the etching gas is able to form a volatile compound with the film or silicon substrate that is being etched. Although it is possible to obtain useable etching rates with some sputtering processes, including with reactive gases that aid in breaking substrate bonds but generate nonvolatile products, these products will redeposit on the wafer and coat the reactor surfaces, causing many feature-scale and reactor scale problems. Also such processes are not very selective.
For a compound to be sufficiently volatile, its evaporation rate should be much higher than the desired etching rate. The maximum evaporation rate is computed by the principle of detailed balance: at equilibrium, the forward and reverse rates of every elementary process are equal. Consequently, for a gaseous species at a number density, n, in equilibrium with its liquid or solid state in a closed system, its evaporation rate equals its impingement rate on the solid or liquid. The impingement rate or flux (molecules-cm 2 s
À1
) of species onto a surface is
The thermal speed, v (in cm/s), is given by 
The equilibrium vapor pressure, p V , is described by the Clausius-Clapeyron equation:
where DH is the heat of vaporization and p 0 is a constant of integration. Consequently, at equilibrium, the evaporation rate (equals the impingement rate) is given by
The evaporation rate is given by the right side of Eq. (5) The intensity fluctuations at the beginning of the breakthrough and the silicide etch reflect instability of the RF power associated with the matching network of the source. The 390 nm trace rises at endpoint of the WSi x etch step while the 742 nm trace falls. The 390 nm emission line is used to determine endpoint of the main polysilicon etch (the fall of the 742 nm trace is associated with RF power being turned off). O 2 , COF 2 , metal halides, and metal oxy-halides. These are not necessarily the primary etching products leaving the surface, but are nonetheless the predominant stable products that are pumped away. With the exception of some refractory metal compounds and Si-dihalides, the evaporation rates for these products are many orders of magnitude larger than typical etching rates, hence these stable products will never reach an appreciable coverage on surfaces and etching of these materials is never limited by the evaporation rate, even in the case of Si etching in an HBr plasma, where SiBr 4 is the least volatile product possible. Hence, even though it is true that SiBr 4 is less volatile than SiCl 4 , Si etching in a HBr plasma is not slower than that in a Cl 2 plasma because the product is less volatile.
For a species to be present on the surface during etching, it must be strongly adsorbed. The rate of thermal desorption is given by
where 0 is the pre-exponential or so-called attempt frequency and E a is the activation energy for desorption, or binding energy for the adsorbate. The pre-exponential factor is often simply assumed to be equal to a typical vibrational frequency of $10 13 s
, although it is in fact equal to kT/h times the ratio of partition functions of the transition state for desorption to that of the reactant state and can vary from typically 10 8 to 10 15 s
. Like 0 , the binding energy can also span a wide range of values, reflecting the complex nature of the surface layer and the multitude of bonding configurations. Using a value of 10 13 s À1 for 0 , it can be seen from Eq. (6) that a species must have a binding energy of 16 kcal/mol (0.69 eV) at room temperature to have a $0.1 s lifetime on the surface, comparable to the time required to etch 1 monolayer. This binding energy exceeds physisorption energies for most adsorbate-substrate combinations, but is less than most chemical bonds; consequently, any chemisorbed species will likely live indefinitely on the surface, while products like SiBr 4 will desorb nearly instantaneously after being formed.
Therefore, surfaces are covered with chemisorbed species during and after etching. It is mostly plasma radicals that will adsorb and form this chemisorbed layer, but feed gases can sometimes also react (e.g., Cl 2 with Si and Al). Once this chemisorbed layer forms, containing atoms from the etching gas and (usually) the substrate, it must be removed (after perhaps being further "activated") for etching to proceed. This is usually accomplished by ion bombardment. In one notable exception, Al etching at higher pressures, ion bombardment is not necessary, and the chemisorbed layer formed by reactions of Cl 2 and Cl spontaneously converts to a physisorbed AlCl 3 layer that rapidly desorbs. In most cases, however, it is the ion-stimulated removal of the chemisorbed layer that makes anisotropic etching possible.
Adsorption and etching by neutrals
A few reactions of neutrals with materials used in microelectronics devices have been studied. These experiments measure sticking coefficients, the reaction coefficients, and recombination coefficients. In an oversimplified treatment, the sticking coefficient can be defined by the LangmuirHinshelwood adsorption model, which, using Cl adsorption on Si as an example, can be described as
where SiÁ is an adsorption site. The relative density of such adsorption sites is defined as h, which can range from 0 to 1. The probability for adsorption is given by
where S is the sticking coefficient (or probability) at SiÁ. When all sites are occupied, the probability for adsorption is zero. Most etching processes operate near this limit. Even on clean, perfect crystalline surfaces, adsorption is more complicated and often occurs by a precursor-mediated mechanism in which an adsorbate has a high sticking coefficient, even on an adsorbate-covered surface. The weakly bound adsorbate diffuses along the surface until it either finds a vacant site for adsorption (SiÁ in the example above), reacts, or desorbs. On rough surfaces that are present during etching, the adsorption and diffusion processes are more complicated, with a range of differing adsorption sites and rates.
Although sticking coefficients are often treated as adjustable parameters in models, some measured values have been reported. These parameters usually do not correspond to the Langmuir-Hinshelwood sticking coefficient at a vacant site, as defined above, and instead are either reaction coefficients that lead to generation of products that desorb or incorporate into a growing film.
A reaction coefficient for etching, e X(S) , can be defined as the probability that an impinging neutral will react with an atomic or molecular material, S, in the absence of ion bombardment or other sources of energetic particles, to generate a volatile product that promptly desorbs. 99 It is given by
where R X(S) is the etching rate, N A is Avogadro's number, q S and M S are the density and mass of the substrate, and x/y is the average stoichiometry of the SX x etching products that desorb, divided by the X y stoichiometry of the etchant (e.g., y ¼ 2 for Cl 2 ). Reaction coefficients have been measured for some relevant etchants and materials and are important for determining isotropic etching rates and so degree of undercutting. The etching rate of substrate S by species X is often given in the form of an Arrhenius expression
where A and E a are the Arrhenius pre-exponential factor and activation energy, respectively, n X is the number density of X at gas temperature T g , and T S is substrate temperature, and n is either 0.5 or 0, depending on whether the dependence of the impingement rate on T g is separated from or included in the activation energy. For n-type Si etching, the dopant level affects the pre-exponential. In these cases, A can be expressed as
where N e is the n-type carrier concentration and c is an empirical factor.
Ion induced etching reactions
The etching rate for many materials during simultaneous exposure to ion and neutral fluxes is much faster than the sum of the sputtering and chemical etching rates (plasma etching's version of the "whole is greater than the sum of its parts" 101 ). The details of this process are extremely complicated. Several distinct mechanisms have been identified. Although ion bombardment sometimes aids in removing a passivating species from horizontal surfaces that would otherwise slow or stop etching, in virtually all cases ion bombardment enhances the reaction between the neutrals and substrate that leads to the formation of a volatile product. This is true even in the case of etching of Si in fluorineatom-generating plasmas such as SF 6 /O 2 , where the etching rate by F atoms is quite high.
The details of ion-assisted etching reactions have been reviewed previously. 5, 102, 103 Coburn, Winters, and coworkers have show that for Ar þ -assisted etching of a fluorinated Si surface, the process occurs by a "chemical sputtering" process in which ion bombardment causes chemical reactions to occur that lead to the formation of products that then desorb. These reactions are generally assumed to happen on a very short time scale ($1 ps) 104 in the collision cascade created by the transfer of momentum from the impacting, neutralized ion to the substrate and reactant atoms in the near-surface region. Product desorption can take much longer, but is usually not rate-limiting.
Other plasma-surface reactions
Plasma interactions with surfaces other than those being etched also affect the plasma etching process, though indirectly. Species in the plasma stick on the walls and masked portions of the substrate and can lead to the growth of a film, or the formation of a product that desorbs. These processes modify the plasma species concentrations and can lead to changes in etching rates, profile shapes, selectivities, and other figures of merit. Therefore, surface reaction coefficients are of interest for conditions as close as possible to real etching plasmas.
Fisher and co-workers have measured sticking coefficients and reaction coefficients of selected radicals under conditions close to those in the plasma. [105] [106] [107] Beams of SiH, OH, NH, NH 2 , CF, and CF 2 radicals generated in various plasmas were directed at substrates such as SiO 2 and fluorocarbon films in a differentially pumped chamber, and detected by laser-induced fluorescence (LIF) as they impinge on a surface and reflect from the surface. An image of the path of the incoming and outgoing radicals is captured, hence the technique is called "imaging of radicals interacting with surfaces." Sticking and/ or reaction coefficients can be obtained with this method. For CF 2 scattering off SiO 2 , Si 3 N 4 , Si, stainless steel, and photoresist substrates, the flux of scattered CF 2 exceeds its incident flux, indicating that this product is formed by a surface reaction of other impinging species (e.g., CF and/or CF 3 ) that are also present in the plasma beam source. 108 
B. Selected diagnostic techniques for etching plasmas
It is beyond the scope of this article to review the many diagnostic methods that are available to characterize processing plasmas, including electrical probe methods to determine ion and electron number densities and electron energy distributions. Instead we will highlight a few selected techniques that are useful for understanding and monitoring etching processes. These can be divided into gas-phase and surface probes.
The gas phase plasma contains mostly neutrals. These are stable feed gas species, radicals formed by the decomposition of the feed gas, stable etching products, and radicals that can be primary products or product fragments that are formed by electron impact. In some cases, a complete determination of the plasma neutral composition is desired for a deeper understanding of plasma chemistry. In other cases, one or several species are monitored to sense endpoints when thin films have been etched away.
There is also an important need to monitor the chemical and physical nature of surfaces immersed in plasmas. This includes not only the surfaces of wafers being etched, but also the plasma chamber wall surfaces. Controlling the latter is essential for maintaining stable processing conditions, since the composition of radicals in the plasma is greatly affected by heterogeneous reactions that are in turn dependent on the nature of the chamber wall surface. Ellipsometry and infrared absorption spectroscopy are in situ methods that have been used to monitor species on surfaces in real time. The chemical composition of surfaces immersed in plasmas can also be investigated by electron spectroscopy methods [mostly x-ray photoelectron spectroscopy (XPS)] that require the sample to be transferred from the plasma chamber to the analysis chamber without exposure to air. Recently advances have been made in a "spinning wall" technique in which a small portion of the chamber wall is rapidly rotated such that the surface can be analyzed by Auger electron spectroscopy and desorption mass spectrometry.
109,110
Mass spectrometry
Mass spectrometry is the broadest gas-phase diagnostic method, capable of detecting any neutral or charged species in the plasma. In practice, however, this is anything but straightforward. In a "residual gas analysis" mode, mass spectrometry can be used to routinely monitor stable species that leave the plasma, and/or form in downstream regions on the way to the mass spectrometer. This can be used to infer which radicals may be present in the plasma (e.g., C 2 F 6 formation downstream from a CF 4 plasma is likely a result of CF 3 recombination). Etching products can also be detected by mass spectrometry, and can be used in end-point detection, although such an approach is almost never used because optical emission spectroscopy is simpler and more sensitive.
To detect reactive neutrals, a small aperture needs to be installed in the wall of the plasma chamber. Line-of-sight detection of species passing through the aperture and entering the input of the mass spectrometer ionizer is of course essential but not sufficient. Two stages of differential pumping are required and the product beam must be chopped to distinguish the beam signal from the background gas in the mass spectrometer chamber. Only when the neutral mean free path, k n , is comparable to the reactor radius, R (almost never the case), does mass spectrometry monitor species with similar efficiency at the center and edge of the plasma. Under typical conditions of R > 15 cm diameter chambers, and k n < 1 cm (for gas pressures of > 5 mTorr), the method is highly biased toward species near the walls.
Another shortcoming with mass spectrometry is that it is difficult to assign peaks to a particular species. Occasionally, this is due to a coincidence in mass (e.g., Si and CO), but more commonly, the issue is distinguishing parents from daughter ions that are formed in the ionizer of the mass spectrometer. Cracking patterns for stable species such as SiCl 4 can be measured to help unravel the connection between the detection of daughter fragments and the number densities of the parent and daughter neutral species. Signal intensities can also be recorded as a function of the mass spectrometer electron impact ionizer energy. Such appearance potential measurements can help to separate the signals originating in daughter fragments from those corresponding to the parent ion of radicals. [111] [112] [113] Ions can also be directly detected with a mass spectrometer, with the ionizer switched off. 113 An energy analyzer between the ion input region and the detector can energyselect the ions and by sweeping the pass energy, an ion energy distribution (IED) can be obtained with respect to ground potential (assuming the mass spectrometer is grounded). In CCPs and inductively coupled plasmas (ICPs) with no Faraday shield, the IED is governed by the electron temperature and the product of the ion transit time across the sheath and the frequency of oscillations in the sheath potential (i.e., the applied RF). 5 When it takes many RF cycles for the ion to cross the sheath, and no collisions occur, ions will impinge on grounded surfaces with a nearly monoenergetic IED and an energy of the mean sheath potential. Knowing the composition of impinging ions and taking into account the influence of substrate bias, mass spectrometer measurements of ions impacting grounded surfaces at the edge of the plasma can be used to infer what happens at the RF-biased substrate.
Optical, gas phase techniques
Practical diagnostic methods for plasma etching must be nonintrusive. Optical techniques satisfy this requirement.
Plasma-phase (as opposed to surface) optical diagnostics techniques include (in roughly decreasing order of usage) optical emission spectroscopy, LIF, UV absorption, infrared (IR) absorption, and laser-Raman scattering. Application of these spectroscopic techniques for thin film materials processing has been reviewed. 114 a. Optical emission spectroscopy. OES is the most widely used diagnostic technique in plasma etching. It was first used in an etching application by Harshbarger et al. in 1977 to study a CF 4 /O 2 plasma during Si etching in parallel plate plasma. 87 They identified F, O, Si, and CO emissions and showed that F and Si emission exhibited a maximum as a function of O 2 addition to CF 4 .
The vast majority of optical emission in etching plasmas is a result of electron-impact excitation. Most atomic and diatomic species can be monitored by OES. Some triatomic molecules such as CF 2 , SiCl 2 , NH 2 , and CO 2 þ can also give rise to optical emission, but emission from larger molecules is either lacking because of low-lying, nonradiative bound and dissociative states or is broad and featureless because of the large density of vibrational states. Because of the complexity of the excitation mechanism, OES is usually a qualitative technique. This does not hamper the main application for OES: endpoint detection. It does, however, make it difficult (but not impossible) to determine quantitative, relative, and absolute species number densities by this method (see below).
Typical optical emission spectra of a chlorine plasma during fast etching of Si and slow etching of SiO 2 are shown in Fig. 10 attributed to electron impact excitation of an ion pair state of Cl 2 at 8.4 eV, or possibly to one at 9.2 eV.
116
where the superscript "*" indicates the electronically excited state(s) and h is a photon of frequency ( ¼ 9.84 Â 10 14 s
À1
for a wavelength of 305 nm). It has been shown that the Cl emissions are mostly a result of electron impact excitation of Cl atoms [116] [117] [118] Cl þ e ! Cl Ã þ e; (14)
This makes detection of Cl emission useful for measuring endpoints as well; as a material that consumes Cl is etched away, Cl emission increases. For more quantitative determination of even relative Cl number densities, it is necessary to consider that Cl* is also excited to a smaller extent from dissociation of Cl 2 .
118 119 If the plasma contains large fractions of SiCl x etching products and not much Cl, then production of Cl emission from dissociative excitation of SiCl x may also need to be included, but the strong SiCl x bond should make this a minor process, except for extreme conditions.
Emission from SiCl x indicates that these species are present in the plasma. Of course, some of these emissions could be the result of dissociative excitation of higher Si-chlorides, such as SiCl 3 þ e ! SiCl 2 * þ Cl þ e. This does not matter for endpoint detection but does prevent OES from providing anything more than a qualitative indicator of ground state number densities of these species. It is much better to detect these species directly by UV absorption spectroscopy 120 or mass spectrometry. 121, 122 Weak emission from Cl þ in the UV and visible regions is also observed in high-density Cl 2 plasma emission spectra, as shown in the expanded spectrum in Fig. 11 (blue, upper trace). Emission from Cl 2 þ between 400 and 550 nm can also be found in the spectrum at high power and low pressure in Fig. 11 (blue, upper trace), but it is relatively much more prominent in lower density plasmas, as shown in the example in Fig. 11 (black, bottom trace). Electronically excited ions such as Cl 2 þ can be produced by a one-step electron impact excitation from the ground state of neutral Cl 2
If little emission is detected from Cl 2 þ and strong emission is observed from Cl þ , then Cl þ is likely the dominant ion, as it was at 0.5 mTorr and 850 W. When the converse is true, then Cl 2 þ is the dominant positive ion, as it is in the example of 10 mTorr and 24 W. This is also consistent with direct measurements of Cl 2 þ by LIF in a Cl 2 plasma: when the reactor was operated in a low-power CCP mode, Cl 2 þ emission was relatively strong, Cl þ emission was weak and LIF measurements taken together with Langmuir probe measurements showed that Cl 2 þ was the dominant ion, while in the high-power ICP mode, Cl þ emission was strong and Cl 2 þ was barely detectable in either emission or by LIF, hence the dominant ion was Cl þ .
123,124
A sample emission spectrum of a fluorocarbon plasma (C 2 F 6 ) during etching of SiO 2 and Si is shown in Fig. 12 . The spectrum contains features that can be assigned to C 2 , Si, SiF, and C. In addition, emission from F was found in the red/near-infrared region (not shown). Because of the presence of a heated "silicon roof" in the reactor used to obtain this spectrum, the plasma was depleted of F atoms, hence fluorine is contained in SiF x species and not as much is available to form CF 2 and CF 3 . Emission from these species tends to dominate lower density plasmas. In high density fluorocarbon plasmas depleted of fluorine, the radical C 2 appears to be a dominant species. The polymer layer that deposits under these conditions is depleted in fluorine.
b. Actinometry. If an excited state (k) of species X is populated solely by electron impact excitation from its ground state (i) then its absolute ground state number density (n X ) can, in principle, be obtained from the intensity (I X,i,j,k ) of emission at wavelength k X,j,k accompanying the transition X k ! X j , and the relationship 5 I X;i; j;k ¼ 4paðk X; j;k Þ n X Q X;k b X; j;k
where a(k X,j,k ) is the spectrometer sensitivity at k X,j,k , r X,i,k (v) is the cross section at electron speed v for electron impact excitation of X k from X i , f e (v) is the electron speed distribution function 4pv 2 f e (v)dv (the number of electrons with speeds between v and v þ dv), Q X;k ¼ s À1 =ðs À1 þ k q PÞ is the quantum yield for emission by X k , where s and k q are the radiative lifetime and quenching rate constant for X k by all species at total pressure P, and b X,j,k is the branching ratio for the transition X k ! X j .
The electron speed distribution and the proportionality constant are difficult to determine. Consequently rare gas actinometry is often used to convert emission intensities into quantitative, relative number densities. This technique was first applied in plasmas by Coburn and Chen. 125 In this approach, a small amount of a rare gas, A, with an excited state A k that has an energy close to that of X k is added to the discharge. The energy levels of the rare gases span the range from 9.7 eV for Xe to 23 eV for He. Assuming that rare gas emissions are caused solely by electron impact excitation of the ground state, an expression analogous to Eq. (19) relates emission from the rare gas to its known number density
It is usually assumed that the relative energy dependence of the cross section for electron impact excitation of the species of interest is the same as that of the rare gas, i.e., r A,i,k (v) / r X,i,k (v) at any v. Consequently, the n X can be simply expressed as
where a X,A is a proportionality constant. Relative densities of atoms (F, Cl, H, and O), and small molecules (Cl 2 , CF, CF 2 , BCl) have been determined in a wide variety of plasmas by this method. In a few cases, absolute number densities have also been measured through several calibration methods. Actinometry is a simple method that, when carefully applied, can provide valuable quantitative measurements of species concentrations in commercial plasmas with limited optical access. Perhaps the most common and most reliable use of actinometry is for measurement of F atom densities in fluorine-containing plasmas. Fluorine atom actinometry, using the 750. 4 119 This is because O atom densities are usually less than those of O 2 (if oxygen is not being largely consumed by reactions with materials or feed gas components), a result of the large O 2 bond strength (5.11 eV).
Actinometry has also been widely used for measurement of relative number densities of Cl-atoms. Often the Ar 750.4 nm line is used, even though it is not such a good energy match for the Cl emitting levels. Using the Xe 828.0 or 834.7 nm line provides a much better energy match to the Cl emitting levels, and more consistent tracking of Cl number density. 118 In addition, dissociative excitation of Cl 2 [reaction (16) ] is a source of Cl emission at low n Cl . Direct evidence for this was observed in Cl 2 plasmas. 118 
Surface techniques
Analysis of plasma-exposed surfaces can be carried out in a number of ways. It is far easier to perform the analysis by moving the sample (usually under vacuum) to a chamber equipped with a standard analysis method such as XPS. This method is discussed in some detail in the next section. In some cases, it is of interest to analyze surfaces while they are immersed in the plasma. Several approaches have been demonstrated. Aydil and co-workers have used total internal reflection, Fourier-transform infrared absorption to monitor adsorbates on a GaAs sample mounted near the reactor wall. 133, 134 This technique provides quantitative analysis for many species with monolayer detection, but requires IRtransparent substrates and relatively long times. In the laser desorption-laser induced fluorescence (LD-LIF) technique, a pulsed laser heats the surface, causing desorption of adsorbates that are detected by LIF, excited by the tail of the same laser pulse. [135] [136] [137] [138] [139] This method has very high sensitivity for some species (e.g., <1% of a monolayer of SiCl and SiBr) and fast (ns) time response, but the interpretation is more difficult and it is limited to a relatively few species, and thermally robust surfaces. Another approach, called the "spinning wall" method inserts a cylindrical substrate into the reactor wall. 109, 121, 122, [140] [141] [142] [143] [144] [145] [146] Part of the cylinder is in the plasma while another section is in a differentially pumped chamber with a mass spectrometer and Auger electron spectrometer (AES) facing the surface. By rapidly rotating the substrate, portions of the surface that were in the plasma as little as 1 ms ago can be diagnosed. Weakly bound species and heterogeneous reaction products can be observed with the mass spectrometer and strongly bound species can be detected by AES.
a. Vacuum-transfer XPS. Electron spectroscopy techniques such as XPS and Auger electron spectroscopy are extremely useful for quantitative identification of species on surfaces, and especially for XPS, chemical bonding, and structural information. The plasma environment does not allow these techniques to be used in real time; therefore, analysis of plasma-etched materials is carried out after etching by transferring the sample to a separate analysis chamber. If this is done by bringing the sample out into the air, then the surface layer will be oxidized in most cases. Of course, characterization of air-exposed wafers is often of interest for long-term reliability reasons, but if studying the plasma-surface interaction is the prime motivation, then air exposure must be avoided.
Several research groups have constructed integrated plasma etching/surface analysis machines that allow samples to be moved under vacuum from the etching chamber to the analysis chamber. [147] [148] [149] [150] [151] [152] [153] [154] This is usually done by moving the sample through a loadlock chamber with linear transfer devices. One such system is shown in Fig. 13 . One obvious question is how does the surface change between the instant when the plasma is extinguished and when analysis begins, a delay of at least several minutes after etching? Chemisorbed species will not desorb, but physisorbed species present at low coverages during etching will react and/or desorb before analysis can be carried out. Weakly adsorbed species, though often important for etching reactions, are not expected to be present at high concentrations. The chemisorbed layer is also very important, since ion bombardment causes reactions in this layer that lead to etching. The vacuum transfer surface analysis method provides valuable insights into the nature of this layer, which is stable and long lived in the absence of air or ion bombardment.
Si etching in F and Cl-containing plasmas has been studied in some detail. Low resolution spectra of unpatterned Si after etching in a chlorine ICP are shown in Fig. 14 . Si(2p), Si(2s), Cl(2p), and Cl(2s) peaks are readily identified, along with O(1s) and C(1s) contamination in some cases. Loss features at multiples of the bulk Si plasmon resonance are also observed. The plasmon features to the high binding energy side of the Cl(2p) and Cl(2s) peaks indicate that some Cl has penetrated rather deep into the Si. When the "take-off" angle is small (the angle between the electron collection direction and the surface), XPS is more surface sensitive. The Cl peaks become more intense relative to Si, indicating that Cl is near the surface. From further analysis of the take-off angle dependence of the intensities in the spectra in Fig. 14, the thickness of the chlorinated layer can be derived ($2 nm). This subject is expanded upon below in the section on the nature of the Si surface layer.
C. Mechanistic studies of etching of selected materials
All anisotropic etching processes involve one (or both) of the above mechanisms. In most cases, no reaction takes place between the neutrals and the material to be etched, despite the fact that a volatile product can form and the reaction between the atomic etchants (e.g., Cl atoms) and the substrate (e.g., Si) is exothermic to produce the gaseous product (SiCl 4 ). In this case, energetic ion bombardment speeds up the rate of reactions that generate gaseous products and anisotropic etching occurs. In a few cases, such as in etching of Al in a chlorine-containing plasma, spontaneous fast etching occurs in the absence of ion bombardment and this process is stopped by depositing a very thin layer on the sidewalls. This protective layer is sputtered away faster than it can deposit on horizontal surfaces, allowing anisotropic etching to be obtained. Selected examples are now presented in some detail.
Si etching in halogen-containing plasmas
Coburn and Winters were the first to show conclusively that the etching rate of a material (Si), normally slow when exposed to either neutral etchants (XeF 2 ) or ion bombardment (Ar þ ), was greatly accelerated in the simultaneous exposure to both. 155 This classic experiment was carried out with beams of XeF 2 and Ar þ in a high-vacuum chamber, thus avoiding the complex plasma environment. Many variations on this experiment have contributed to our understanding of ion-assisted etching processes. A summary of some of this work is presented in Table I of Vitale et al.
156
Early work by the IBM group focused on ion-enhanced etching of Si by XeF 2 , partly because etching of Si in CF 4 -containing plasmas was one of the first plasma etching processes developed. F-atoms are the active etchant for Si in this plasma, as well as in SF 6 plasmas that are used to etch Si at faster rates. Most anisotropic etching of silicon is carried out in Cl and/or Br-containing plasmas, however, because isotropic chemical etching by Cl and Br is much slower than etching by F-atoms. Most, but not all of the technologically relevant combinations of ions (Cl 2 þ , Cl þ , and Ar þ ) and neutrals (Cl 2 and Cl) have been investigated.
Isotropic etching of Si can also occur as a result of chemical reactions with F, Cl, or Br atoms to form volatile products. These processes are described by the reaction probabilities, e X(S) , defined above. Etching rates and reaction probabilities for Si by F atoms have been reported by Flamm et al. 99 Values for Cl atoms have been given by Ogrzylo et al. 100 and Walker and Ogrzylo. 157 These researchers also measured reaction coefficients for Br atoms with Si. 158 Using a density of 2.33 for Si, and assuming that the etching products are SiF 4 for F, and an equal mixture of the di-halide and tetra-halide (so x ¼ 3) for Cl and Br, reaction probabilities from these studies are reproduced in Fig. 15 as a function of substrate temperature.
Several general conclusions can be drawn from the data in Fig. 15 . At room temperature, chemical etching of Si by halogen atoms follows the trend F > Cl > Br, expected from the Si-halogen bond strengths (140, 90 , and 80 kcal/mol, respectively). The dopant type and level strongly affects the Cl and Br reactivity. Highly doped n-type Si (n þ -Si) etches much faster than lightly doped n-type or p-type Si. This dopant dependence has been attributed to the shift in the Fermi level, making formation of Cl À favored. 100 Cl À is drawn through the SiCl x surface layer by the resulting electric field. The formation of F À has also been invoked to explain the smaller enhancement in the etching of n þ -Si by F-atoms. 159 Isotropic etching of Si by F is fast enough to be a concern for anisotropic etching in high density plasmas that generate large percentages of F atoms (e.g., CF 4 /O 2 , SF 6 /O 2 , or NF 3 ). To slow isotropic etching by F atoms, it is often necessary to add a species that coats the sidewall and slows lateral etching. Thermally activated isotropic etching can also be slowed, relative to ion assisted etching, by cooling the substrate. At low partial pressures of Cl atoms (<10 mTorr), isotropic etching by Cl and Br atoms is very slow for all levels of p-type, intrinsic, and lightly to moderately doped n-type Si. Highly doped n-type Si will exhibit a large degree of undercutting at Cl partial pressures of only a few mTorr. Under these cases, some sidewall protection scheme is necessary. For anisotropic etching, the ion-assisted etching rate must greatly exceed the etching rate by neutral species. The fundamental parameter of interest is the ion-assisted etching yield, defined as the number of substrate atoms or molecules removed per incident ion. Figure 16 presents measurements by Levinson et al. 160 of Si yields as a function of the square root of ion energy for Cl 2 and either Ar þ or Cl 2 þ , carried out in the limit of a high neutral-to-ion flux ratio. A simple linear square root relationship was found, with a threshold energy below which ion assisted etching ceases, as has been observed in many etching investigations. 161 At higher energies, the nature of the ion (reactive Cl 2 þ versus unreactive Ar þ ) plays only a small role in determining the yield. For example, near 500 eV, the yields are only $5% higher for Cl 2 þ /Cl 2 than for Ar þ /Cl 2 , while at 60 eV, the yields for Cl 2 þ /Cl 2 are $50% higher than for Ar þ /Cl 2 . 160 Yields extrapolate to 0 at about 25 and 35 eV for Cl 2 þ /Cl 2 and Ar þ /Cl 2 , respectively; hence, near threshold, the nature of the ion becomes important. The Ar þ /Cl 2 yield at 100 eV is in good agreement with the value of 0.7 reported by Chang et al. 162 for the same conditions. Yields have been measured as a function of the Ar þ -toCl 2 flux ratio (Fig. 17) and have been found to saturate at a low ratio, indicating ion-flux-limited etching and a small sticking coefficient by Cl 2 on the ion-bombarded/chlorinated surface. At high ion-to-neutral flux ratios (not usually obtainable in plasma etching processes unless the halogen fraction in the feed gas is very low), the surface is mostly free of Argon ion-assisted etching by Cl-atoms has also been reported. Two to five times higher ion yields are found when the surface is chlorinated with Cl instead of Cl 2 . 162 The enhancement for Cl vs Cl 2 is smaller (about two-fold) with Cl þ , compared to Ar þ . 164 These results indicate that when ion-bombarded Si is exposed to Cl, it forms a more heavily chlorinated surface layer than when it is instead exposed to Cl 2 , but that some of the Cl comes from Cl þ (and presumably Cl 2 þ ) in a chlorine plasma. This results agrees with LD-LIF studies (see Fig. 18 ), where the Cl areal density on Si exposed to a chlorine ICP was about twice that on Si exposed to Cl 2 gas with the plasma off.
Vitale et al. 156 and Jin et al. 165 have also carried out measurements in a plasma beam system. This system does not produce single reactive neutral or ion species, but instead provides a mix of species that would be similar to that in a plasma. Their etching yields as a function of the square root of ion energy for F 2 , Cl 2 , Br 2 , and HBr plasmas are reproduced in Fig. 19 . They find that the number of Si atoms removed per ion has a similar dependence on ion energy above a threshold energy, E th . F atoms from the F 2 plasma will rapidly etch Si in the absence of ion bombardment; 99 hence, E th for the F 2 plasma beam is near zero. They find threshold energies for Cl 2 and HBr plasma beams are $5-10 eV, while E th for Br 2 appears to be much higher (44 eV). E th for the Cl 2 plasma beam is lower than that reported for dual Cl/Cl þ beams ($16 eV), 164 as well as for Cl 2 /Cl 2 þ beams. 166 The apparently low E th values for the Cl 2 beam in the experiments by Vitale et al., 156 derived by extrapolating from ion energies at which appreciable etching is observed to zero etching rate, could be a result of etching at ion energies below E th , caused by low energy ions, electrons, photons, and/or Cl atoms. Recently, Shin et al. 167 reported a similar ion energy dependence for p-type single crystal Si(100) etching in a Cl 2 plasma at ion energies above E th . These measurements are reproduced in Fig. 20 . The substantial etching rate below the energy threshold for ion-assisted etching was unexpected, since it has been reported that ptype Si(100) is not etched by Cl atoms. 100, 157 Indeed, the lack of undercutting of masked samples also indicates that Cl atoms are not responsible for etching below E th . Instead, it was concluded in that work that this subthreshold etching was induced by vacuum ultraviolet illumination of the surface in the presence of gaseous Cl and Cl 2 . 167 Above the photo-assisted etching component (the dashed line in Fig. 20) , an ion-assisted etching threshold of 18 eV is found, with a second threshold at about 26 eV. The lower value is very close to E th ¼ 16 eV for etching with coimpinging Cl and Cl þ beams, 164 while the upper value matches E th ¼ 16 eV reported for Cl 2 and Cl 2 þ beams. 160 Since the plasma is a mixture of Cl, Cl 2 , Cl 2 , and Cl 2 þ , the dual thresholds seem reasonable. If one were to extrapolate to zero etching rate, then apparent thresholds of 2.8 and 18 eV are obtained. Depending on the amount of VUV light reaching the sample in the experiments by Vitale et al. 156 and Jin et al., 165 the low E th values could easily be explained by this effect.
As discussed above, Si etching processes are rarely carried out in pure Cl 2 or Cl 2 /Ar plasmas because of the formation of microtrenches. 168 It has been found that adding HBr to Cl 2 plasmas converts sharp microtrenches into very broad, shallow ones. 168 This is believed to be due to a change from specular reflection of positive ions from a chlorinated sidewall in a chlorine plasma to a broad-angle reflection of ions from sidewalls exposed to an HBr-containing plasma. 169 It was further proposed that the sidewalls may be rougher in HBr plasmas than in Cl 2 plasmas. The enhanced roughness of HBr plasma-exposed surfaces could be due to etching by H atoms. It could also be that surfaces become contaminated with small amounts of carbon in HBr-containing plasmas, while surfaces exposed to chlorine-containing plasmas are carbon-free. This carbon could mask small regions, causing uneven etching. While sidewall etching is hardly detectable in anisotropic etching processes, the roughness need not be substantial to cause glancing angle of incidence ions to be scattered over a wide angle, as is found in HBr-containing plasmas. Simulations by Helmer and Graves 169, 170 indicate that 2 nm roughness is sufficient.
a. Nature of the Si surface layer. The surface layer that forms during Si etching in halogen-containing plasmas has been studied by the many experimental techniques discussed above and below, as well as in beam experiments designed to simulate the plasma environment, and by simulations, including molecular dynamics methods. 171 Of the experimental methods, the vacuum-transfer XPS method described above provides many important details. XPS with vacuum sample transfer has been used to determine Cl coverages and SiCl x stoichiometry on blanket, as well as patterned surfaces. From low resolution spectra such as those in Fig. 14, Si and Cl 2s and 2p core level features and plasmon losses associated with both the Si and Cl can be identified, and it can be concluded that the surface layer contains the equivalent of a couple of monolayers of chlorine.
The stoichiometry of the halogenated layer can be determined from high resolution Si(2p) spectra. An example of a Si(2p) spectrum, with background and the J ¼ 1/2 spin-orbit component removed, is shown in Fig. 21 . 115 SiCl, SiCl 2 , and SiCl 3 were identified at 100.2, 101.2, and 102.3 eV, respectively, in good agreement with binding energies reported by Durban et al. 172 in synchrotron photoemission studies of the chlorination of Si (111) broader than the 0.6 eV resolution of the XPS instrument (especially for SiCl 2 ), and hence cannot be attributed to resolution but are instead an indication of the high degree of disorder in the SiCl x layer formed during plasma etching.
An additional, weak peak appearing as a shoulder on the low binding energy side of the at SiÁ (Si bound to 3 Si and one dangling bond) peak was attributed to Si. Its binding energy was found to be the same (98.8 eV) as that measured for Si after Ar þ sputtering, providing supports for the assignment. From an analysis of spectra at various take-off angles for different plasma conditions, the relative coverages Si, SiCl, SiCl 2 , and SiCl 3 were deduced, as well as the overall thickness of the chlorinated silicon layer. 115, 173 Measurements of Cl coverage as a function of ion energy (Cl 2 þ and/or Cl þ or Ar þ ) from both plasma and beam experiments by several groups are summarized in Fig. 22 , where the areal density of Cl determined by XPS (Ref. 173) and LD-LIF are plotted as a function of the square root of ion energy. Increasing ion energy increases the Cl coverage during Si etching, for ion energies less than 300 eV. Coburn 174 also found that ion bombardment increases chlorine coverage on Si (two "þ" points in Fig. 22 ). The chlorine uptake on Si exposed to simultaneous Ar þ (1 keV) and Cl 2 beams is $3 times larger than with just the Cl 2 beam alone, whether the Si surface was an ordered, annealed surface, or one amorphized by ion bombardment prior to exposure to the Cl 2 beam. The factor of 3 is less than expected from an extrapolation of the XPS and LD-LIF data in Fig. 22 Fig. 22 ). Since the beam experiments by Coburn showed that simultaneous Ar þ and Cl 2 exposure creates a more highly chlorinated layer than sequential Ar þ followed by Cl 2 exposures, the enhanced chlorination was attributed to a knockon process, rather than an increase in binding sights by ion bombardment. In a knock-on process, short range ion impact on adsorbates causes the adsorbates to be implanted into the subsurface region. This mechanism was also proposed by Barrish et al. 176 for Si etching with Cl 2 and Ar þ beams. While knock-on is a well known effect that no doubt plays some role in subsurface chlorination, ion-induced creation of added adsorption sites is not necessarily ruled out by the Cl 2 / Ar þ beam experiments. Consider the differences in the types of amorphous layers created by Ar þ versus Cl 2 þ bombardment. Ar þ bombardment breaks bonds during the collision cascade, but most of these broken bonds reform. The result is a fairly dense lattice with few voids. Cl þ and Cl 2 þ bombardment also breaks bonds, but in addition prevents some of the Si-Si bonds from reforming through formation of mainly SiCl. This favors formation of a much more open surface structure than Ar þ amorphized Si. The strongest evidence for this comes from molecular dynamics simulations by Graves and co-workers. 104, 177 Some of these calculations are reproduced in Fig. 23 . 104 Starting with a Si(100) surface, they found that bombardment with 50 eV Ar þ created a disordered but dense damaged layer, while bombardment with Cl þ at the same energy resulted in a highly disordered, roughened layer with small subsurface voids. Such a layer contains more binding sites for adsorption by neutral Cl and Cl 2 .
Si etching mechanisms in HBr-containing and HBr/Cl 2 -containing plasmas have also been investigated. Horizontal surfaces take up Br and Cl roughly in proportion to the HBr:Cl 2 feed gas ratio, as shown by the XPS and LD-LIF measurements in Fig. 24 . 136 Neither of these methods were capable of detecting hydrogen, so part of the falloff in total halogen coverage as a function of HBr addition could be due to adsorbed H that blocks sites for adsorption by Cl or Br. The etching rate has been found to fall as HBr is added to Cl 2 feed gas. This was attributed to a combination of reduced halogen content in the layer and to a lesser degree, lower ion flux in HBr plasmas compared to Cl 2 plasmas. 136 Sawin and co-workers have also studied etching of Si in HBr/Cl 2 plasmas. 156, 165 Their XPS measurements 165 of halogen coverage measured as a function of HBr fraction are virtually identical to those of Cheng et al. 136 in Fig. 24 . They also found that the etching rate decreases with HBr addition to Cl 2 , but ascribe it mainly to the lower ion flux and not to the reduced halogen coverage on the Si surface. This was based on the observation that if Si was etched in either a Cl 2 or HBr plasma and then subjected to Ar þ bombardment, the initial enhanced sputtering rate was the same. They invoke adsorbed H in HBr plasmas as playing a role similar to Br or Cl in enhancing the rate.
Sawin and co-workers also studied the angle-of-incidence dependence of the etching rate for beams extracted from HBr and Cl 2 plasmas. 156, 165 They found that both yields peaked at normal incidence (¼0 ), but the yield in Cl 2 plasmas was nearly constant between 0 and 45 while HBr plasma yields fell significantly between 0 and 45 . They carried out simulations of feature profile evolutions and attribute the straighter sidewalls in HBr plasmas versus Cl 2 plasmas to this difference in dependence of the ion-assisted etching yield on angle of incidence.
As can be seen from Fig. 15 , the chemical (i.e., isotropic) etching rates by Cl and Br atoms is very slow for all but heavily doped n-type Si. Consequently, no undercutting is expected and none is found. 178 No sidewall protection is needed, whether intentional (e.g., oxygen or fluorocarbon addition) or serendipitous (as a benefit of mask erosion). In fact, vacuum-transfer XPS analysis of photoresist-masked p-type Si in a HBr plasma showed that there is no carbon or oxygen on the sidewall. 178 When some undercutting does occur, either from wide-angle ion bombardment, from possible attack by H atoms, from photo-assisted etching, or in the case of etching of heavily n-doped Si, the deposition of a thin film on the Si etched feature sidewalls can be beneficial in obtaining near-ideal profile shapes. Sidewall films can be formed by adding oxygen to HBr-containing plasmas. 179 An SiO 2 layer has been shown to form under these conditions, with little carbon, even when photoresist masks were used. 154 The presence of this thin layer suppresses bowing and notching. 154 Oxygen addition to HBr-containing plasmas also improves selectivity to underlying oxygencontaining layers such as SiO 2 .
SiO 2 etching in fluorocarbon plasmas
F atoms react slowly with SiO 2 in the absence of ion bombardment (1/40th the rate of F þ Si at room temperature). 99 This rate is too slow to cause much undercutting in most SiO 2 etching processes. Cl, Br, and H do not react with SiO 2 at room temperature. Ion bombardment greatly accelerates the etching of SiO 2 by F atoms 128 (and also of Cl and Br), but because Si also etches rapidly under these conditions, this approach is not commonly used for silicon microelectronics applications. Instead, fluorocarbon plasmas are used for selective etching of SiO 2 over Si. Although the bulk of dielectric etching in IC fabrication is of lower dielectric constant insulators, etching of SiO 2 is still important. In 2 and CF 3 radicals, generated by electron impact dissociation and detachment of the feed gas, do not spontaneously etch SiO 2 in the absence of ion bombardment. 181 Neutral CF x radicals and ions bombard surfaces and lead to the formation of a fluorocarbon film. The deposition of this thin film on horizontal surfaces is the most important aspect of the etching of SiO 2 in fluorocarbon-containing plasmas. Coburn used in-situ Auger electron spectroscopy to investigate the formation of fluorocarbon films on Si and SiO 2 in CF 4 /H 2 plasmas. 182 He found that a thicker film formed on Si than on SiO 2 . The high selectivity of etching SiO 2 over Si was attributed to this difference in film thickness.
Unlike processes that rely on a sidewall film to suppress etching, the fluorocarbon film supplies reactants that are activated by ion bombardment. The film also improves selectivity toward etching of Si. The composition of the film and its deposition rate on SiO 2 depend on which fluorocarbon feed gas is used, the addition of other gases, the reactor materials, and other processing conditions. If the deposition rate is too fast, then the film continues to thicken and no etching occurs. Under useful condition, a constant steady-state film thickness and composition is maintained while the underlying SiO 2 layer is etched at a constant rate.
Butterbaugh et al. 181 studied the etching of SiO 2 in the presence of beams of Ar þ , F, and CF 2 . They found that either F or CF 2 enhanced the yield of Ar þ etching and that the enhancement was larger for F. They also found that supplying a flux of CF 2 had no additional enhancement of the etching with simultaneous Ar þ and F beams impinging. As pointed out by Butterbaugh et al., 181 this is consistent with the prior observation that the etching rate of SiO 2 in CF 4 /O 2 plasmas and NF 3 /Ar plasmas are the same at the same fluorine atom number density. 128 The yields for 500 eV Ar þ can approach unity for high F/Ar þ flux ratios. 183 Oehrlein and co-workers have extensively studied etching mechanisms for SiO 2 in fluorocarbon plasmas. [184] [185] [186] They found that the deposition rate of the fluorocarbon film depends on gas composition and self bias voltage (Fig. 25) . Lower ion energies favored faster deposition rates, displayed as negative values in Fig. 25 . If a fluorocarbon film was deposited at low ion energy and then, under the same plasma conditions, subjected to high ion energies it etched. These net etching rates are given by the positive values in Fig. 25 . Under conditions when the etching rate for the fluorocarbon film is much slower than the deposition rate, a thick film continuously grows and no etching of the underlying SiO 2 or Si occurs. If conditions are such that the film etching rate is much greater than the deposition rate, then little if any film will form on Si or SiO 2 . When the film etching and deposition rates are nearly equal, then the nature of the underlying material determines the steady-state film thickness. If it etches fast, as it does for SiO 2 , then the film is relatively thin, while if it etches slower as for Si, then a thicker film is present.
Some of the processes that occur during etching of SiO 2 in fluorocarbon-containing plasmas are summarized in the model put forth by Sankaran and Kushner; their schematic mechanism is reproduced in Fig. 26 . 187 The film affects etching in two ways. First, it attenuates the energy of impacting ions that penetrate the film and reach the underlying layer. Ion-assisted etching rates typically decrease with the square root of ion energy above a threshold energy, hence, the thicker the film, the slower the etching rate. This inverse dependence of etching rate as a function of film thickness is illustrated by the measurements by Schaepkens and Oehrlein, reproduced in Fig. 27 . 186 Somewhat surprisingly, SiO 2 , Si, and Si-nitride all fall on the same curve.
More recently, it was shown that the etching rate of Si, SiO 2 , and Si-nitride scales inversely with the amount of fluorine in the film. 184 The explanation for this is that this fluorine is liberated by ion bombardment. Some of it diffuses to the underlying surface and participates in etching. Fluorinecontaining products are liberated from the film; hence, fluorine content in the fluorocarbon film is reduced.
Charging in high aspect ratio features is believed to be responsible for "etch stop" that is often observed for dielectric materials. [188] [189] [190] [191] [192] [193] [194] [195] [196] [197] [198] [199] [200] [201] Etch stop can also be caused by deposition of fluorocarbon polymer at the bottoms of high aspect holes in insulating films. 153 The combination of charging and polymer deposition can cause other interesting phenomena. When high aspect ratio holes are etched into an insulating film, a twisting is sometimes observed, as shown in Fig.  28 . 202 It appears to be random; the origin is believed to be due to statistical variation in the amount of polymer and the amount of charge deposited on one side, causing the ions to be deflected off normal incidence, producing asymmetric profiles. Wang and Kushner simulated this twisting effect during etching of SiO 2 . 202 An example from that study is reproduced in Fig. 29 . The different panels represent different random seeds in the Monte-Carlo simulations. They find that once twisting begins in a feature, in propagates. This has to do with the fact that ions are neutralized on glancing collisions with the sidewalls. These neutrals then perpetuate the twisted trench or hole. They also found that injecting high energy electrons into the trench helped to neutralize charge and reduce the severity of the twisted features.
Low-k dielectrics
Low dielectric constant materials usually contain carbon and silica, as in for example methylsilsesquioxane (MSQ), or SiOC(H). Fluorocarbon containing plasmas are used to etch these materials, and the fluorocarbon layer that forms on surfaces during etching plays a similar role in controlling etching rates of the film, as well as allowing etching to be selective to other materials such as SiC(H) etch stop layers. 203, 204 The thickness and C/F ratio of the steady-state fluorocarbon layer is greater on MSQ than on SiO 2 [see, for example, Fig. 30 (Ref. 205) ], suggesting that carbon in the etching film is the main source of C in the layer. For both MSQ and hydrogen silsesquioxane (HSQ), the hydrogen in the film leads to a reduction of fluorine in the film through the formation of HF. 203, 205 The fluorocarbon layer is also thicker on SiC(H) films than on SiOC(H) films 204 [ Fig. 30  (Ref. 205) ], indicating that the oxygen in the film helps to remove the fluorocarbon layer. 203 To further reduce the dielectric constant of insulating layers in interconnects, pores are introduced into the film. The etching of porous silica and other porous low-j films is usually faster than of the same materials without pores. The factor by which the etching rate is enhanced might be expected to scale simply with the reduction of mass of the film. In fact, the rate can be enhanced either more or less than this amount. 206 The introduction of pores can even lead to a suppression of etching rate under certain conditions. 206 The pore size and degree of fluorocarbon film deposition play key roles in determining the etching rate behavior of 20 porous material relative to the nonporous material of the same chemical composition. As proposed by Standaert et al. 206 and simulated by Sankaran and Kushner, 187 filling of the pores with fluorocarbon deposits will slow the etching rate. When the deposition is severe and the pores are large, this can lead to a reduction in etching rate to below that for nonporous material. For small pores, the rate can be enhanced more than the amount expected by the reduction in density. Detailed simulations have been performed by Sankaran and Kushner 207 for etching of porous silica in C 4 F 8 -containing plasmas. Blanket etching rates and profile evolutions were generated, and in cases where experimental data are available, compared favorably.
V. APPLICATIONS A. General concepts
Many issues must be considered in evaluating an etching process. These issues are often coupled, and sometimes trade-offs must be considered in optimizing a process.
Etch-rate uniformity is critical in most applications. In a batch reactor, both wafer-to-wafer and within-wafer uniformities must be considered. Two definitions for etching nonuniformity are commonly used. If E is the average of etch rate E i of N measurements across a wafer (or a batch of wafers), then three standard-deviations, r, from E will be
hence, the etching nonuniformity is 300r= E, expressed in %. This three-sigma nonuniformity means that 99.7% of the etch-rate measurements will fall within E63r. This is rarely the case, however, since etching nonuniformity is usually not random (center to edge, top to bottom, etc.). Another widely used expression for percent nonuniformity is defined as
where E max and E min are the highest and lowest measured etch rates. Typically, 3% nonuniformity within a wafer is considered good. However, there are applications that are more forgiving, while others are more critical. If the process has good selectivity (to be defined below) to an underlying layer, uniformity is less critical, while in applications when etching creates a particular depth feature (e.g., trenches in silicon), uniformity of etching is extremely critical. Uniformity can be optimized by reactor design (electrode design, gas delivery system, magnetic confinement, and use of multiple coils for inductive sources), choice of chemistry and process parameters (e.g., gas flow, pressure, magnetic field and power, as well as the use of additives). Some etching processes are more chemical than ion-assisted (e.g., Al etching in Cl 2 -containing plasmas, or silicon etching with SF 6 or NF 3 ) and therefore are more prone to nonuniformity when large open areas are etched and the feed gas is largely consumed (see "loading effect" below). In these cases, additional hardware (such as focus and shadow rings) can often improve uniformity.
A high selectivity, defined as the ratio between the rate of the layer being etched, relative to that of an underlying layer or masking material, is of critical importance. For example, when polysilicon gate electrodes are formed in field-effect transistors (FETs), the polysilicon (poly-Si) is etched until the underlying gate dielectric (e.g., SiO 2 ) begins to be exposed in those regions where the film was slightly thinner and/or the etching rate was slightly faster. As the process is carried to completion and the remaining poly-Si is etched away or "clears" (the so-called "overetch" step), the exposed gate dielectric layer must remain, since its removal will result in attack of the Si substrate below. Consequently, a high selectivity of poly-Si etching over the gate dielectric is required. Similarly, the selectivity with respect to the mask must be high enough that the integrity of the mask is preserved during the main etch and overetch period.
The selectivity, however, is not always a well-defined quantity. In many instances, the etching rate changes during the process, and as a result, the selectivity varies as well. The etching rates and selectivity can also vary with the aspect ratio of the features being etched. (Aspect-ratio-dependent etching rates are discussed below.) In addition, while selectivity associated with a process may appear high on a macroscale, local effects such as feature "faceting" and "trenching" (discussed below) may make bulk highselectivities meaningless.
The determination of selectivity is not always straightforward. For gate etching, post-etch measurements of the remaining gate-dielectric are often erroneously high (sometimes higher than the initial thickness) due to deposits that form during etching. In addition, when SiO 2 is exposed to the plasma after poly-Si etching in an oxygen-containing plasma such as HBr/O 2 , oxygen can penetrate the exposed SiO 2 layer and oxidation of the underlying Si substrate can occur, also making the remaining dielectric layer thicker and the selectivity appear artificially larger than it is. Loading is the decrease of etching rate with an increase in the amount of exposed material being etched. It can be global, where the spatially-averaged etch rate varies, or local, where the etching rates of regions of the chip depend on the density of the features. The loading effect is observed when the active species in the plasma are depleted rapidly by reaction with the material being etched. 209, 210 It is affected by the lifetime of the active species as well as chamber volume (larger volume, which may be impractical, reduces the loading effect [209] [210] [211] [212] ) and feed gas flow rate (etching rates decrease with decreasing flow rate 115 ). The loading effect is strong when the process is dominated by fast chemical etching by neutrals, where the etching rate is often observed to be first order in neutral etchant number density, 99 and weaker when etching is driven by ionstimulated removal of a halogenated surface layer. 212 The former is observed, for instance, in SF 6 or NF 3 -containing plasmas when a large amount of Si is etched, and F-atoms become depleted. On the other hand, etching Si in Cl 2 plasmas varies much less with the area of exposed Si because the composition of the halogenated surface layer is largely independent of loading, 115 and no chemical etching by Cl atoms occurs (except for n þ -Si). Etching is limited by ion stimulated desorption, and the ion flux is fairly insensitive to loading. In some applications, such as fabrication of silicon microlenses (where a pattern of spherically shaped polymer mask is transferred into the substrate below by polymer-erosion 213, 214 ), the loading effect needs to be addressed, since the area of exposed substrate changes as the polymer erodes, and maintaining constant etch rates of both polymer and substrate material is critical for obtaining the desired lens shape.
Aspect Ratio Dependent Etch Rate (ARDE) or "RIE Lag" refers to a phenomenon where etch rate slows down as the etching proceeds down a hole or a trench. A number of mechanisms have been proposed [215] [216] [217] and involve transport of neutrals and ions as well as charging and shadowing effects. Locally, this effect can take place when the aspect ratio (i.e., the ratio between the depth and width of the space between features) changes, either as a function of depth or as a function of position across the wafer. During contact etching, the aspect-ratio increases and the etching rate generally decreases or, in extreme cases, etching stops. This occurs sooner in smaller features, meaning that when contacts with different diameters are patterned on the same wafer, the smaller contacts etch slower. By adjusting the ion/neutrals ratio (e.g., by increasing the pressure), however, and moving toward a regime dominated by polymer deposition, the effect is reduced and even reversed. 217 Faceting of the mask is the result of ion bombardment. Since the sputtering yield is a function of angle, 218-220 the sharp (or nearly sharp) corners may erode and attain an angle commensurate with the angle associated with the maximum sputtering rate. 221 Since the ion flux per unit area decreases with the angle of incidence, h, the maximum sputtering rate will take place at an angle larger than 0 if the sputteringyield as a function of h rises faster than 1=cos h. 221 If the mask erodes significantly, then the bottom edge will move inward, away from the etched step and the feature will develop a tapered profile. 222 In addition, deflection of ions from the sloped surface can cause a microtrench to develop at the feature base (see below) as well as affect the profile of a neighboring line [see Fig. 31(a) ]. 222 Impinging ions are nearly perpendicular to the wafer surface, but there is a small lateral component that can affect the profile of the feature being defined. In some cases this can lead to a near-specular reflection off vertical sidewalls, increasing the ion flux near the base of the feature and causing microtrenching [ Fig. 31(b) ], an undesired effect. 221 In other instances, the off-normal-incidence ions cause a slope in the profile and may round the bottoms of trenches (depending on the application, this could be a desired or undesired effect). A good example of these two effects is the etching of trenches in silicon with Cl 2 or HBr-containing plasmas (see also discussion above). The former causes the formation of sharp microtrenches while the latter leads to much wider trenches that have rounded corners. [223] [224] [225] [226] Undercutting [ Fig. 31(c) ] is the result of isotropic etching that will lead to lateral as well as vertical etching. When the isotropic component is strong (such as etching silicon with low bias SF 6 or NF 3 plasmas), a profile similar to the one obtained by wet etching is observed. This is caused by a chemical attack by neutral species [F for all forms of Si and Cl for n þ -Si (see Fig. 15 ), Cl and Cl 2 for Al]. It is possible, however, to increase the vertical component (by increasing the bias power, for instance), to reduce the relative undercut. To avoid undercutting completely in these situations, a sidewall protection scheme is necessary (See discussion on aluminum etching, Sec. V C 6 below) Often the application of bias leads to formation of by-products that aid in this sidewall protection, such as resist erosion during Al etching. can be either polymer formed during the etch process combined with etch by-product, or material being sputtered during etching. An example of the former is the excessive amount of sidewall passivation films that deposited during aluminum etching, 227 while the latter could be sidewall deposits of nonvolatile by-products from sputtered copper or gold. 228 Notching at bottoms of etched features [ Fig. 31(e) ], at the interface with the layer below, can occur for a number of reasons. One can be positive charging of the underlying layer that can deflect positive ions toward the sidewall. 229 This is usually observed at low bias power in, for instance, gate overetch, where the low power is needed to preserve the thin gate oxide. Another mechanism occurs in etch processes that rely on sidewall passivation to achieve anisotropy. The sidewall protective layer near the interface may be too thin to provide protection, leading to isotropic etching at the interface. This can occur, for instance, in Al etching with a hard mask, using a Cl 2 and HBr mixture to generate by-products that passivate the sidewalls. 230 The passivation layer at the bottom of the feature is too thin to protect the aluminum from the highly isotropic etch by Cl 2 . 230 Notches at the bottoms of etched features could sometimes be desirable, such as in the formation of a notched gate (T-gate), to reduce the overlap capacitance between the gate and source and drain regions. 231 Reentrant profiles, [ Fig. 31(f) ] could be the result of either inadequate passivation as the etch progresses deeper, 232 the result of ion-scattering by the etch-stop layer, 233, 234 or a divergent ion flux. 235 
B. Etch chemistries
The etch chemistry chosen for a particular application is based on the material being etched, the etchers being used and other considerations such as uniformity, selectivity, etch rate, etc. discussed in the previous sections.
Dielectrics
Fluorocarbon and hydrofluorocarbon gases, usually with some additives, are typically used for dielectric etching, such as SiO 2 , Si 3 N 4 , SiN, and various low-j dielectrics. CF 4 [FreonV R 14 (Ref. 236)], C 2 F 6 (Freon 116), and C 3 F 8 (Freon 218) were used in the early days to etch SiO 2 , 28,29 usually with addition of O 2 when selectivities to resist and/or silicon were not critical, and polymer control was important. Hydrogen addition to the above fluorocarbons was used to increase selectivity to silicon, 28, 29, 237 with the drawbacks of polymer formation, decreased etch rate, 237, 238 and deep penetration of hydrogen into the silicon substrate [239] [240] [241] [242] [243] [244] [245] [246] [247] that can lead to device degradation if not annealed properly. In applications where this is not an issue (such as patterning of waveguides), hydrogen may be used as an additive to control selectivity to silicon and/or photoresists. CHF 3 (Freon 23), 28, 29, 37 with some additives that vary with tool and application, has been used for many generations of device for SiO 2 and SiN etching. The additives are needed to control polymer build-up on the wafer as well as the chamber wall. O 2 and CO 2 were common additives in batch reactors; however, as the industry migrated to single wafer etchers, mixtures of CHF 3 with CF 4 became more common, usually with Ar as a diluent. The selectivity with respect to silicon could be controlled by the amount CF 4 added to the mixture; increased CF 4 concentration yields lower selectivity but less polymer formation. 217 Over the years, etching chemistries for dielectrics evolved too, with the introduction of gases with lower fluorine-to-carbon ratios (F:C) to increase selectivity both to silicon and photoresist masks.
Gases, such as c-C 4 F 8 (Refs. 28 and 29) (octafluorocyclobutane-F:C ¼ 2), c-C 5 252 However, the performance of these gases in a commercial reactor may be different and will depend on the particular etcher used.
Common additives include O 2 , Ar, CO 2 , and CO. The latter is used to control selectivity, since it tends to minimize the amount of fluorine-rich species. 253 Special care is needed with CO since it reacts spontaneously with nickel. Therefore all gas delivery hardware must be void of any exposed nickel surfaces (such as nickel gaskets). Type 316 stainless steel is an acceptable choice as gasket material (and tubing) for CO.
Although most chemistries that will etch SiO 2 will etch silicon nitride films (stoichiometric or nonstoichiometric LPCVD or plasma-enhanced CVD, collectively referred to as SiN), there are applications when one has to be etched selectively with respect to the other. Etching SiO 2 selectively over SiN can be accomplished by a polymer-producing plasma that will sustain etching as long as oxygen byproducts are released to the plasma from the etched film, but will stop on the nitride film, producing polymer deposits. 253 This process is extremely useful in etching self-aligned contacts (SAC) (to be discussed later), but difficult to implement in etching a bulk SiO 2 film on top of SiN (such as silicon trench isolation). The reverse, etching SiN selectively to SiO 2 , is needed in cases where a thin SiO 2 film serves as an etch stop layer, to avoid exposure of the Si substrate to the plasma, such as in a local-oxidation-of-silicon (LOCOS) structure, where subsequent oxidation can create oxidationinduced stacking faults. 254 This can be accomplished in a number of ways. SiN can be etched with selectivities of 3:1 or higher by SF 6 or SF 6 /O 2 /N 2 mixtures under low-bias conditions. 255 Best results are obtained in the so-called "plasma" configuration, where the wafer is placed on the grounded electrode. Higher selectivities, however, can be obtained with SF 6 257 is the suppression of CF 3 þ that are critical for etching SiO 2 , but the process may be much more complex, as discussed above. With the proper additives, and by using the right process parameters (which are tool-dependent), extremely high selectivities to SiO 2 or Si can be achieved. Once the layer below the SiN is exposed, a high rate of polymer deposition often ensues.
Low-j dielectrics in use consist of organosilicate glass, which was nonporous in older applications, but highly porous in current and future technologies (the requirement for the 22 nm node and beyond is j $ 2.2). Fluorocarbon chemistry is used to etch the low-k dielectric, stopping on the capping layer below. C 4 F 8 /CO based chemistry was common at early stages of implementation of low-j materials; 258 however, with the migration to porous materials, the F:C ratio is adjusted to minimize residues at the end of the etch. The key to successful integration is the postetch resist strip, which can affect the final j-value of the dielectric. This will be discussed below in Sec. V C 7, where damascene structures are covered in detail.
Silicon
Silicon (crystalline, polycrystalline, and amorphous) is etched in chlorine, bromine, and fluorine based plasmas; the choice depends on the application. Generally, plasmas that generate large amounts of F atoms will etch Si faster than plasmas that contain large amounts of Cl or Br, but profile control and loading are harder to achieve, due to the fast isotropic etching by F atoms (see above and Fig. 15 ). Cl 2 , HBr, or their mixture are better choices when vertical or near vertical walls are desired. When large heights or depths must be created in Si, for applications such as micromachining for MEMS or through-Si-vias (TSV), high etching rates and large selectivity to the mask are required. Here, SF 6 plasmas, usually with added O 2 , are the chemistry of choice. Since etching is substantially isotropic, the process alternates between an etching plasma containing SF 6 , and a polymer depositing plasma containing most commonly C 4 F 8 to protect the sidewalls from lateral etch. More detailed discussion will follow in the sections discussing the various structures.
Silicides
Various silicides have been used in semiconductor applications, mostly as the top layer in a gate stack. 259 It can either be etched together with the silicon layer below to form the gate-stack, or formed after polysilicon etch and spacer formation (see below) in a silicidation process. 259 The most common silicide in the former is tungsten-silicides, but tantalum and titanium silicides have been used as well. A silicon-rich silicide is usually sputtered on top of a poly-Si or amorphous Si layer, and the best etching results are obtained before the film is sintered. In conventional RIE reactors various chlorofluorocarbons, such as CFCl 3 (Refs. 260 and 261) and CF 2 Cl 2 , 260 were popular etching gases, but due to environmental concerns, as well as issues with polymer residues, they were replaced in later generation etchers by Cl 2 (Ref. 262) or Cl 2 and HBr mixtures, sometimes with the addition of oxygen, especially when the etched material is tungsten silicide. The ability to etch tungsten silicide with Cl 2 is not obvious, since tungsten does not form volatile chlorides at room temperature. However, tungsten silicide is etched at room temperature, at etch rates comparable to or higher than the etch rate of the polysilicon layer below. It is possible that residual oxygen in the chamber or buried in the film helps in the creation of somewhat volatile tungsten oxychloride (WOCl 4 ). 263 
Aluminum
Aluminum and aluminum alloys (with silicon or copper below 0.5%) are generally etched in Cl 2 based chemistries. Detailed discussion will follow in the discussion of metallization structures below.
Titanium and titanium nitride
Both Ti and TiN are used in aluminum metallization stacks. Cl 2 or BCl 3 /Cl 2 chemistries are the most commonly used. Detailed discussion will follow in the discussion of metallization structures below.
Chromium
The main use of Cr is in creating patterns on lithographic masks. Cl 2 and O 2 mixtures 264 are used at moderate pressures (tens of mTorr). The process relies on formation of chromium oxychloride (CrO 2 Cl 2 ) as a volatile by-product. When a hard mask or a silicon-containing resist is used (both erode slowly), there is a slight undercut. 265 The addition of nitrogen to the plasma helps in reducing the undercutting. 265 
Nickel and nickel alloys
Nickel does not form volatile alloys easily. However, it is possible to etch nickel with a CO/NH 3 plasma. See detailed discussion below on magnetoresistive random-access-memory (MRAM).
Other materials
Cu, Au, Pt, Ir, PZT (Pb[Zr x Ti 1Àx ]O 3 with 0 < x < 1), BST (Ba 1Àx Sr x TiO 3 with 0 < x < 1), and others that do not form volatile compounds at room temperature can be etched in specially designed etchers with chucks that can be heated to a few hundred degree Celsius, 63 using a hard mask or by reactive sputtering. 266 The etch chemistry will be material dependent.
There are number of reports of gold etching in HBr/Ar, 125 C and above, [268] [269] [270] Au etching rates in Cl 2 as high as 980 nm/min with 10:1 selectivity to the SiO 2 hardmask was realized. 270 There was no evidence of deposition on the sidewalls and 0.5 lm wide features, 1 lm high, appear to be vertical.
C. Structures
Various structures that are created with the aid of plasma etching are listed below. It is not possible to cover all applications, so the attempt here is to address structures in use in modern ICs. Therefore, older application for structures and techniques such as LOCOS, poly-Si buffered LOCOS, plasma planarization, "champagne glass" contact, etc., have been left out.
Shallow trench isolation
To achieve wafer planarity and enable higher device density, shallow trench isolation (STI) has replaced LOCOS as the isolation scheme for modern devices. It consists of etching trenches into silicon, filling them with SiO 2 , followed by planarization, using chemical-mechanical-polishing (CMP) to achieve a nearly planar structure on which the device is to be built (Fig. 32) .
The process involves deposition of a silicon-nitride film on a thin layer of oxide, followed by patterning with photoresist and then etching the nitride and oxide layer with fluorocarbon etch chemistry (e.g., CHF 3 /CF 4 /Ar-Sec. V B, above). The photoresist can be either left in place, or removed prior to the silicon etch. In the latter case, the silicon-nitride serves as a hard mask during etching, so the etch chemistry must be selective to nitride. The requirements are to have a slightly tapered trench wall (to facilitate a good trench-fill), flat bottom and rounded corner at the bottom of the trench. Rounded corners at the top of the trench are necessary as well, but this can be accomplished in postetch processing. 271 Cl 2 plasmas can etch Si with high selectivity to siliconnitride, but the profile is inadequate and leads to microtrenching. 169, [223] [224] [225] HBr or HCl (Ref. 226 ) containing plasmas lead to highly tapered profiles, but a mixture of Cl 2 and HBr gives the desired profile, usually with the addition of oxygen. The latter is added mainly to increase selectivity to the mask, but it leads to the deposition of etch by-products on the trench wall. 272 The process is highly sensitive to oxygen concentration and the addition of too much oxygen can lead to heavy deposits on the entire wafer. Therefore, to achieve better control, the oxygen is often diluted in helium (usually 20% O 2 in He).
When silicon-on-insulator (SOI) wafers are used, the buried-oxide (BOX) layer is a natural etch-stop for the process. With Si wafers, or when the SOI trench does not reach the BOX layer, uniformity is extremely crucial for consistent trench depths and device performance across the wafer.
The SiN mask remains on the wafer during the trench fill process and the subsequent CMP step. The CMP step is selective to SiN and the planarization is stopped once the SiN layer is exposed. The SiN layer is then removed, and after subsequent wet cleans (which include dipping in HF solution) the surface of the wafer is nearly planar.
Gate
Polysilicon and amorphous silicon with or without a top conducting layer (silicide or TiN) have been used as the gate materials for many device generations. The gates are formed by depositing a poly-Si (or amorphous Si) layer on a thin layer of oxide (either oxidized silicon, or high-j dielectric, usually HfO 2 -based). The poly-Si is doped with proper impurities, usually by ion-implantation, and in symmetric complementary metal oxide semiconductor (CMOS) devices, both p-and n-type gate materials are present on the same wafer (in some devices only n-type poly-Si is used and that can be achieved either in-situ or by postdeposition doping). In some instances, other conductive layers such as a silicide (most commonly WSi x , but historically TaSi x and TiSi x ) have been used. 259 In other schemes, the silicidation takes place after the gate formation, where the silicide is formed on the gate source and drain simultaneously.
The mask can be photoresist or a hard mask (e.g., SiO 2 or SiN). Though adding complexity, the hard mask offers advantages of higher selectivity to oxide, and better CD control, especially when differences between isolated and nested lines have to be minimized. If photoresist is used as a mask, the eroded resist becomes part of the etch chemistry; it can lead to deposits on the sidewalls and lower selectivity toward SiO 2 , hence leading to erosion of the underlying gate oxide layer. 273 When a hard mask (e.g., SiO 2 or SiN) is to be used, it is deposited on top of the gate stack, followed by an antireflective coating (ARC), resist application and lithography. The ARC can be either organic or inorganic. Organic ARCs can be etched either in O 2 or CO 2 plasmas while the inorganic ARC can be etched with the same chemistry used to etch the hard mask below (using CHF 3 /CF 4 /Ar chemistry, for instance). The resist is then stripped followed by the proper wet cleans (Fig. 33) .
In some cases, the resist is trimmed prior to hard mask etching to obtain dimensions not attainable by the lithographic process. This is accomplished in oxygen plasmas under conditions yielding an isotropic etch process. Etch uniformity is extremely critical, and this process will inevitably increase the relative variability in CD control. For example, if the desired gate width is 0.1 lm and the gate is printed at 0.15 lm 6 10%, the best that can be achieved after the trim will be 0.1 lm 6 15%, assuming no additional variability added by the trimming process.
The gate etch process usually consists of multiple steps. First, there is a breakthrough step to remove the native oxide and initiate etching. It usually consists of Cl 2 or Cl 2 /Ar, at relatively high bias power. This is followed by an etch process optimized for good CD control but not necessarily selective to oxide. The etch chemistry usually consists of a Cl 2 /HBr mixture, sometimes with the addition of small amounts of O 2 diluted in He. The Cl 2 /HBr ratio will depend on the application. In pure Cl 2 , highly n-doped polysilicon tends to etch isotropically (and faster than p or undoped polysilicon), and therefore, higher concentrations of HBr will be required, since the isotropic etching rate of n þ -Si by Br atoms is much slower than by Cl atoms (see Fig. 15 ). Once endpoint is detected, usually by optical emission (see Sec. III F), an overetch step follows. This step is intended to clear residual poly-Si remaining on the wafer due to nonuniformity, and in some cases, due to topography. The overetch step usually involves adding (more) O 2 to a Cl 2 /HBr (or just HBr) plasma and reducing the bias voltage. In critical cases, where extremely thin gate oxide is used, the overetch step is initiated before the polysilicon is cleared. An endpoint prediction scheme, relying on interferometry with an external light source, can be used to terminate the main etch step as little as 10 nm before endpoint. [96] [97] [98] The remaining gate oxide after etch cannot be measured reliably. From wet etchrate measurements, 96 ,274 it appears to be a porous material. When silicide or TiN is used as a top layer, the etch process may include another step optimized to etch that particular layer. This step is usually Cl 2 or Cl 2 /HBr based, and in the case of silicide, it may be advantageous to add small amounts of oxygen to increase the silicide:poly-Si etch-rate ratio. When TiN is used as the top layer, large amounts of oxygen added to the plasma will hinder the etch process. Cl 2 , possibly with the addition of inert gases, is the most suitable etch gas.
The above approach has been used for gate formation and is known as the "gate first" approach where the gate is formed before the source and drain are constructed, by doping, epitaxially raised and/or silicided. In some cases, after spacer formation (see below), the hard mask is removed, and source, drain, and gate are silicided simultaneously.
A further evolution of the gate first approach is the introduction of high-j dielectrics (currently HfO 2 based, but earlier investigations included ZrO 2 based dielectrics as well 275 ) and metal gates for added performance. The details are proprietary and will vary between manufacturers, but the gate metal under poly-Si is different for the p-and n-channel gates. 276 If TiN based metal is used for the gate, a Cl 2 /HBr based 277 chemistry is used for etching the gate and BCl 3 based chemistry (possibly with some additives) is used to remove the high-j dielectric after the gate is etched, [277] [278] [279] [280] sometimes at elevated temperature. Another more recent approach, known as "gate last" (Fig.  34) , relies on forming a poly-Si dummy gate (using a hard mask) that stays in place while the source and drain are being formed. There are two flavors to this approach: "highj first" 276, 281 and "high-j last." 281 In the former, the polysilicon gate is formed over a high-j dielectric, in an identical fashion described in the gate first approach. After the formation of source and drain, a SiO 2 layer is deposited over the entire wafer, which is then planarized using CMP, until the top of the polysilicon gate is exposed. The poly-Si is then removed selectively, and the cavity is filled with the appropriate metal layers. A metal CMP step then follows to remove the metal everywhere but the gate cavity. The highj last approach is similar, except that the sacrificial poly-Si gate is constructed on a sacrificial SiO 2 layer, to be removed after removal of the poly-Si. A high-j dielectric layer is then deposited, followed by metal deposition and CMP to form the gate. 281 In either "gate last" approaches, it is possible to etch back the metal gate to create a recess to be filled with a dielectric (e.g., SiN). This will be useful in the formation of selfaligned contact, to be discussed below.
Gate spacer
This sidewall, intentionally deposited on the sides of the gate (Fig. 35) , serves multiple purposes. For simultaneous silicidation of gate, source, and drain, it prevents the shorting of these areas. It is also used as an implant mask in the creation of lightly doped-drain. Either SiO 2 or SiN x can be used as the spacer material. The latter is useful in the fabrication of self-aligned contacts, to be discussed below.
After fabricating the gate, a layer of oxide (or nitride) is deposited over the entire wafer, followed by a blanket etchback. The process is terminated based on optical emission endpoint with some overetch. A typical etch chemistry for this step is CHF 3 /CF 4 /Ar, which will etch SiO 2 as well as SiN, and depending on the CF 4 concentration, will be selective to silicon. In some integration schemes, when nitride is used and it may be desired not to expose the Si substrate to the plasma, a thin layer of oxide is formed before the nitride deposition, and the etchback may consist of one or two steps, depending on the thickness of the nitride. The last step, selective to SiO 2 , is carried out in a CH 3 F (or CH 2 F 2 ) containing plasma that can be adjusted to obtain high nitride:oxide selectivity. The oxide layer can then be removed, exposing the silicon substrate below.
Contacts
In this step, contacts are opened both to the gate and silicon substrate. If the first dielectric layer is planar, the depths of these contacts can be significantly different, depending on whether the contact is to be made to the gate or source and drain. Therefore, the selectivity to the gate material has to be high. The chemistries used for this step are usually C 4 F 8 , C 5 F 8 , or C 4 F 6 based, and typically, a medium density plasma such as in a MERIE or multiplefrequency capacitively coupled etcher is used. A slightly tapered profile (88 -89 ) is often desired to facilitate a good contact metal-fill. ARDE control is extremely important in this step. Although this step calls for equal size openings, there could be slight variability across the wafer. In addition, since the final aspect ratio of contacts to the source and drain regions can be considerably higher than the one to the gate, any etch rate reduction due to ARDE will inevitably lead to a long overetch of contacts to the gate (this is one motivation for using elevated source and drain regions). Selectivity to the underlying layer (Si, SiN, or silicide) is controlled by the thickness of the fluorocarbon film. 282 When the contact is terminated on silicide, excessive exposure of the silicided source and drain to ion bombardment can lead to junction degradation. This is generally addressed by a highly selective etch process that relies on a thick fluorocarbon layer for protection. In some integration schemes, an etch stop layer such as SiN x covers the silicided gate, source, and drain, and a "soft" etch is used to clear all contacts simultaneously with minimal overetch. This approach is also used to form a borderless-contact, 283 where the contact to source or drain may overlap the isolation region and excessive oxide overetch is undesirable. When SiN x is used for the etch-stop layer, a number of fluorocarbon gases can be used for etching; when high selectivity to SiO 2 is desired, either CH 3 F or CH 2 F 2 can be used for the nitride etch.
Self-aligned contacts
In many instances, such as memory devices, it is desirable to have the contact to source or drain extremely close to the gate. Unfortunately, due to misalignment, the printed contact may overlap the gate, and eventually, once the contact is filled with metal, the gate and source (or drain) will be shorted. This can be overcome by encapsulating the gate with a dielectric resistant to the SiO 2 etch chemistry used for opening the contact. Using SiN as the hard mask and spacer will achieve this goal. 283 The etch chemistry used for this step is typically C 4 F 8 based which can be tuned to achieve high selectivity with respect to the nitride. 282, 284 Once the SiO 2 is cleared, a short step to remove the SiN and expose the source or drain is then performed (Fig. 36) . 
Aluminum
Aluminum based alloys were the dominant conductors for interconnect in VLSI circuits before the introduction of copper, although other metals, such as tungsten, 263, 285 have been used occasionally. The aluminum alloys usually contain small amount of silicon and/or copper-the former to improve contact resistance when contact is made to the silicon substrate and the latter to control electromigration. 286 Since copper does not form volatile compounds with the chemistries used for aluminum etching (at the standard operating temperatures), its concentration is limited to less than 5%. When the Al layer is "sandwiched" between TiN layers, the same chemistry is used to etch the TiN layers (and the Ti that is used as an adhesion layer below the lower TiN layer) with possible variation of process conditions tailored for each layer.
Al will etch spontaneously in Cl 2 gas, but the presence of surface oxide inhibits etching. 290 scavenge any moisture in the chamber that may inhibit the etch process, and possibly to inhibit sidewall etching. 291 The interaction of the plasma with photoresist leads to formation of a layer on the sidewall that will prevent lateral etch of the aluminum, 292 but may lead to subsequent corrosion (discussed below). Sometimes an additive to the feed gas helps in the formation of sidewall passivation and can also be used to taper the metal lines. Typical additives are N 2 and CHF 3 , 227 but other polymer-forming additives have been used as well [e.g., CHCl 3 (Ref. 293) ]. Although the additive may have a positive effect on the etched profile, it can cause particulate formation on the wafer due to the flaking off of deposits from the chamber walls. Therefore, the amount and type of the additives are important in establishing the optimum tradeoff between a desired profile and minimum wall deposits.
A hard mask can also be used for the pattern transfer. It has the advantage of minimized variation between isolated and nested aluminum lines, but since a major component of the etching process, the eroding photoresist, is absent, the etching process has to be modified. Etching in a low pressure ($2 mTorr) Cl 2 /HCl/N 2 plasma has been used successfully to pattern an aluminum stack consisting of TiN/Al/TiN/Ti. 230 Etching uniformity is one of the challenges in aluminum etching. 288 Generally, the etch rate at the edge of the wafer is higher than the center and the metal is cleared in "bullseye" pattern. However, by process optimization and the use of focus-rings, the effect can be minimized.
Postetch corrosion is a major concern and it can be either purely chemical or galvanic. The chemical corrosion is associated with residual chlorides present on the wafer, especially on the sidewalls. Although during etching the wafer is heated to C to help volatilize the etch by-products, some AlCl 3 is embedded in the sidewall deposits, leading to chemical reactions with moisture in the air
Al is also consumed by aqueous HCl
The process continues to corrode the aluminum, creating "worm"-like residues that are easily observed in an optical microscope (Fig. 37) . The standard procedure to avoid chemical corrosion is to minimize sidewall deposits, heat the wafer during etching to the maximum temperature that will not reticulate the photoresist, and use a passivation step, combined with a partial or complete stripping of the photoresist. The passivation/strip is carried out by transferring the substrates in a load-lock, under vacuum, to a separate chamber designed specifically for that purpose (in earlier batch metal etchers, resist strip was carried out in-situ using an oxygen plasma with small amount of fluorine containing gas, 227 with the intention to convert the chlorides to noncorrosive fluorides). The passivation step is intended to convert the residual chlorides to volatile HCl and is accomplished by hydrogen-containing plasmas, typically water vapor. 295 Additional steps to reduce chemical corrosion are to heat the wafer after the passivation step on a hot stage in the etch tool, keeping the wafer under vacuum until the entire lot is etched, water rinse immediately after venting to dissolve any residues that are left, and chemical sidewall removal, followed by another resist ashing step to insure complete resist removal. Wafers should be kept in a dry-box before a capping oxide layer is deposited. The time interval between completion of the etch and the deposition step should be short, preferably less than 24 h.
Galvanic corrosion takes place when a galvanic cell is formed between two dissimilar metals in the presence of an electrolyte. The two metals in this case are the aluminum metal and the TiN layer(s) if used, and the electrolyte is the residual chlorides dissolved in water. This galvanic corrosion looks different than the chemical corrosion described above and is characterized by voids in the aluminum, while the TiN layer(s) is (are) intact. Usually, the TiN is oxidized, preventing the formation of the galvanic cell; however, if there are discontinuities in the film the cell can form, and often, galvanic corrosion is observed in a small number of sites. The galvanic corrosion is prevented by minimizing sidewall deposits, and a postetch rinse in an adequate amount of water: quick-dump-rinse or overflow-rinse are preferable to spin-rinse.
The major component in prevention of the two type of corrosion is the minimization of sidewall deposits. However, these deposits are necessary for anisotropic etch. The optimized etch process has to take into account these diametrically opposite requirements: just enough sidewall deposits to prevent undercut, but not enough to trap large amounts of residual chlorides.
Damascene structure
Modern metallization schemes utilize copper wiring inlaid in a dielectric, 296, 297 producing structures similar to damascene ornaments, where precious metal is inlaid in another metal. 298 Since metallization schemes involve a large number of metal layers, the entire structure is repeatedly planarized at every level. The first dielectric level, encapsulating the active area, is doped SiO 2 glass with metal (e.g., tungsten) filled contacts to the source, drain, and gates. All subsequent dielectric layers consist of low dielectric-constant material, such as organosilicate glass, which was solid in early implementations, and porous in subsequent technologies. The etching is accomplished with fluorocarbon-based plasmas and the F:C ratio is adjusted to minimize residues. Trenches (for intralevel interconnect) and vias (for interlevel interconnect) are formed within these layers, to be filled in subsequent steps with copper (in early versions with aluminum) and planarized by CMP. The resulting structure is a planar surface on which the next level is to be built in a similar fashion.
In a single-damascene scheme, contacts are made first to the level below through vias formed within the dielectric, to be filled with metal and polished to achieve planarity. The intralevel connection is achieved by another deposition of dielectric in which trenches are formed and filled with metal in a similar fashion described above. The more common process, however, is the dual-damascene scheme, where vias and trenches are etched before the metal fill step.
There are a number of approaches to create a dualdamascene structure with and without an intermediate etch layer that provides an etch-stop for the trench etching. The "via first" approach shown in Fig. 38(a) has evolved with time and has been the preferred implementation of dualdamascene structures. The version shown in this figure shows more recent implementation of the structure where no trench etch-stop layer is present within the low-j dielectric. In this approach, the via is etched to completion, resist is stripped (discussed below), followed by lithographic and etch steps to form the trench. A planarizing layer, capping layer (both absent in early implementations) and ARC are also shown. The role of the planarizing layer is to smooth the topography and, combined with the capping layer, it enables the use of a thin imaging layer. The pattern is first transferred to the capping layer (which could be either SiO 2 or SiN x , serving as a pattern-transfer layer) followed by a pattern transfer to the planarizing layer. The etch chemistry for the latter step is either O 2 or CO 2 based; the latter gives a more anisotropic profile than the former. Sometimes the ARC layer can serve in dual role as an antireflective layer as well as a pattern-transfer layer. In the "trench first" approach [see Fig. 38(b), shown with  a trench etch-stop layer] , after the deposition of the low-j dielectric, trenches are patterned first by standard lithographic technique and etched to a finite depth into the dielectric. After resist removal, another lithographic step defines the via, and the etch process continues all the way to form the contact to the metal level below. A capping layer on top of the metal below is then etched in a low-bias process to reduce sidewall formation.
Since oxygen plasmas can degrade the low-j dielectric, 299 resist ashing is not performed in an oxygen plasma asher, but rather in a medium-density plasma (e.g., capacitively coupled plasma etcher), using plasmas with lowoxygen concentration or other chemistries, such as N 2 /H 2 based chemistry. Regardless of the ashing process used, some degradation of the dielectric is observed, 299,300 which is substantial for ultra low j (j < 2.3) dielectrics. Therefore, newer dual-damascene schemes such as the trench first metal hard mask (TFMHM) approach attempt to minimize exposure of the low-j dielectric to resist-stripping plasmas.
TFMHM is the latest evolution in dual-damascene implementation, where the low-j dielectric is etched with a metal hard mask. In this approach, metal (usually TiN) is patterned first (Fig. 39 ) with a trench pattern and etched. After resist strip, the vias are patterned and partially etched into the dielectric, followed by resist strip, and simultaneous etching of trenches and vias. There are a number of advantages to this method, such as self-aligned vias, 301, 302 and reduced degradation of the low-j dielectric associated with resist stripping. 303 However, other issues have to be dealt with, such as hard mask etching, and a clean etch of the dielectric without residues associated with sputtered hard mask material. 301, 302, 304 
FinFETs
To achieve higher performance at lower voltages, three dimensional transistors are finding their way into advanced devices. The fabrication involves creating "fins" in the substrate on which field effect transistors are built. [305] [306] [307] The gate wraps around the fins (Fig. 40) and depending on the number of the sides of the fin used to form the device, bigate or trigate FETs are created.
308 Figure 40 depicts a trigate device, while a bigate FET will have an insulator on top of the fin. There are two approaches for fabricating these devices, depending on the substrate-SOI or bulk silicon. These two approaches are shown in Figs. 40(a) and 40(b) for SOI and bulk silicon, respectively. The approach taken will depend on a number of factors, one of which is the cost of the substrate. 309 The gate width for a single-fin tri-gate device will be the sum of twice the height plus the width. Multiple fins can be used to achieve wider gates.
The height of the fins depends on the technology-node. For instance, at the 22 nm node, the fin is about 8 nm wide at its midheight and 34 nm high. 310 With SOI substrates, where the device-layer thickness equals the targeted fin-height, trench isolation is performed first, and then the fins are etched into the device layer, stopping on the buried-oxide layer. If bulk silicon is used, the fins are etched at the same time as the isolation trenches, using a hard mask. Ideally, vertical fins are desired, and the etch is carried out in two step-the first to create the vertical profile needed for the fins, followed by an etch step that creates a tapered profile needed for the trench. 311 After oxide deposition and planarization, the oxide in the fin areas is etched back to the appropriate depth to expose the fins. Since the sidewalls of the fins are used for the active devices, surface roughness associated with the etch can lead to creation of interface-traps. This can be addressed by the etch process 311 and/or postetch surface treatment.
Once the fins are created, gate formation follows with either "gate first" or "gate last" approaches. The only difference between formation of planar and FinFET gates is the long overetch-steps associated with the topography. In the gate-first approach, both metal-gate and hard mask have to be cleared, 312 while in the gate-last approach, the sacrificial polysilicon has to be etched in its entirety without penetrating the SiO 2 layer below. In both cases, the spacer formed before the source and drain construction must be cleared from the walls of the fins, while leaving the gate (or sacrificial gate) protected.
Evolution to more complex 3D devices is expected. The "Pi-Gate," "Omega-FET," and "Gate-All Around" as well as the use of new materials 313, 314 will, no doubt, pose some integration and etching challenges in the future.
D. Micromachining
Silicon-based MEMS can be fabricated by techniques similar to those used in silicon IC fabrication. Although ceramics, polymers, and metals are also MEMS materials, the focus of this review is on silicon-based systems only. As will be discussed below, etching techniques used for bulk micromachining have been adopted for etching TSV to facilitate electrical connections to the backside of the wafer.
Some MEMS involves fabrication of devices only on the surface of a silicon wafer, and the techniques used to fabricate these devices fall into the class of surface micromachining. Bulk micromachining involves utilizing a greater portion of the silicon wafer, typically more than 10 lm of silicon.
Surface micromachining
The etch processes used for surface micromachining are similar, with some modifications to etch processes used in IC fabrication. Cl 2 or HBr based processes are used to etch silicon, but in most cases, high selectivity to oxide is not required. CD control is important in some cases, and controlling CD when high aspect-ratio structures are desired could be quite challenging, similar to that encountered in the fabrication of trench capacitors in DRAMs.
In many cases, especially when high-aspect ratio structures are to be etched, a hard mask (SiO 2 or SiN x ) is used. For these applications, SF 6 -based chemistry can be used. Profile control is a challenge, since SF 6 will tend to etch Si isotropically, but with the proper additive for sidewall passivation (e.g., C 4 F 8 , Cl 2 , or HBr), anisotropic profiles can be achieved (Fig. 41) . If the process is terminated on an etchstop layer, such as the BOX layer used in SOI wafers, the overetch step usually differs from the main etch step to avoid undercut during the overetch. This is accomplished either by modifying the gas flows and/or bias conditions, or switching to another etch chemistry, such as Cl 2 /HBr.
Bulk micromachining
Silicon bulk micromachining requires etching to depths greater than 10 lm. It is commonly referred to as "deep reactive-ion-etching" (deep RIE or DRIE) and etch tools are modified high plasma-density etchers, with additional features to facilitate high etch rates with some degree of profile control.
To achieve high etch rates, SF 6 is the source gas, sometimes with the addition of O 2 primarily to reduce the chances of sulfur build-up in the exhaust line. To obtain anisotropy, the sidewalls of the features being etched must be passivated. One approach is to cool the substrate [315] [316] [317] to typically below 220 K (Ref. 317 ) and slow the rate of isotropic etching by F-atoms (see Fig. 15 ). (Ion-assisted etching processes typically have little or no temperature dependence.) The cold temperatures also lead to a buildup of SiO x F y byproducts on the sidewalls, also suppressing isotropic etching. 315, 317 It produces smoother sidewalls than the switched process described next, but sufficient time is needed for wafer cooling and warming before and after the etch, respectively. Once the wafer is warmed, the protection layer is volatilized, and if additional etch time is needed after postetch inspection, undercutting will occur due to the absence of a protection layer on the sidewall. 317 A second method, known as the Bosch Process, 318,319 is a switched-gas process that utilizes alternate steps of etching with SF 6 as the feed gas, and polymer deposition, usually with c-C 4 F 8 as the feed gas. The switched process is the one more commonly used, and it requires fast acting mass-flowcontrollers to switch between etch and deposition steps, which occur every few seconds. An ICP etching tool is commonly used for the process and bias power is turned on only during the etch step and is kept low-only a few watts. Source power during the etch step depends on the total gas flow (to be discussed below). During the deposition step, polymer is being formed on the horizontal surface as well as on the sidewall. The etch step then removes the polymer from the horizontal surface, and proceeds with etching of silicon, while the remaining polymer on the sidewall, even though it is being eroded, provides protection against lateral etching. The resulting sidewalls show striations (Fig. 42) , and may be an issue when smooth sidewalls are needed (e.g., mirror surface).
It is desirable to use etch mask and etch stop materials (if needed) that are resistant to the fluorine plasma. SiO 2 , Al 2 O 3 , and photoresist etch masks etch very slowly with selectivities with respect to silicon of 250:1, >10,000:1, and 50:1, respectively. 320 SiO 2 and Al 2 O 3 can also be used as an etch stop material. Care must be used when choosing the etch stop and etch mask material such that the material chosen does not sputter during the etch to deposit nonvolatile products. This will cause "grass" to form in the etched areas due to micromasking. 321 In many instances, the etch depths are hundreds of lm, sometimes the full thickness of a silicon wafer (725 6 25 lm for 200 mm diameter wafer). Therefore, a high etching rate is important. Etching rates can be increased by increasing gas flow, coupled with an increase in source power (Fig. 43) . However, selectivity is reduced (the Si etching rate increases at a slower rate than does erosion of the mask), and it is claimed that this is due to increased ion flux. 229 Plasma source power affects ion energy distributions as well as ion flux. Ion-flux/energy control, offered on some etchers, is the key to achieving high etching rates without the adverse effects such as degraded selectivity mentioned above. One approach is to use pulse shape biasing. 322, 323 Etching of Si for MEMS devices has many of the same issues as for Si integrated circuit manufacturing. Etch rates are governed by the flux of reactants reaching the substrate surface. Therefore, etch rates drop as the total silicon area exposed to the plasma is increased (i.e., loading effect- Fig.  44 ). Therefore, Si etch rates for via patterning will be higher than for trench patterning. Although absolute etch rate is an important consideration, uniformity of the etching rate is important as well. Etching nonuniformity is less important when an etch stop layer is used, but selectivity and thickness of the etch stop layer must be adequate to offset the etching nonuniformity.
Another important structure in advanced MEMS as well as integrated circuits is a TSV, also known as through-wafer-via. 324 In MEMS fabrication, it enables electrical routing on the back of the wafer, freeing important "real estate" on the front of the wafer. The structure can also be used for bonding various MEMS modules to create a highly complex system, which combines MEMS and low and high-voltage control circuitry in one assembly. 325 In IC fabrication TSVs are used for 3D interconnects. There are number of ways to create electrical interconnect using TSV structures. They all involve creation of vias, depositing an insulating layer on the sidewalls, filling the via with conducting material, wafer thinning and stacking multiple wafers to create the 3D device.
E. MRAM and FeRAM
Magnetoresistive and ferroelectric random-access-memory devices known as MRAM and FeRAM (or FRAM), respectively, are classes of nonvolatile RAM devices made of materials that do not form volatile compounds when etched in plasmas near room temperature.
MRAM memory cells utilize magnetic tunnel junctions to form a nonvolatile storage cell, integrated into the backend processing of a CMOS device. Multiple stacked thin films form the cell. At a minimum, there are two magnetic layers separated by a thin dielectric layer. The magnetic moment orientation of one of the magnetic layers is fixed while the orientation of the moment of the second magnetic layer is free and it could be set by the control circuitry to be either parallel or antiparallel to the orientation of the fixed moment. The tunneling magnetoresistance will change based on the orientation of the moment of the free layer and that can be sensed by the CMOS circuitry. In reality, there are additional layers used to pin the moment of the fixed layer and prevent it from changing orientation during a "write" operation. The end result is a stack of thin (order of 1 nm or below) layers with (Fig. 45) .
Unlike the etching of ferroelectric devices (see below), the etching of these stacks cannot be carried out at elevated temperatures due the instability of the alloys used. Chlorine, bromine, and fluorine based chemistries can be used 266, 328, 329 to create compounds that are sputtered away, but postetch corrosion associated with residues on the wafer must be addressed. 266 In some instances (NiFeCO), enhanced etch-rate in Cl 2 /Ar plasmas with UV illumination has been observed, 330 but the mechanism is not well understood. Alternately, a CO/NH 3 mixture has been demonstrated as another option, since it can form carbonyls that are volatile, [331] [332] [333] [334] with some enhancement with the addition of Xe. 335 This process is not entirely chemical, however, and sputtering does take place, as evident from the thick sidewalls of redeposited sputtered material. 336 Another option to avoid patterning tough-to-etch materials is to use damascene structures. In this scenario, cavities are etched into the dielectric, and a stack of all of the above materials is deposited followed by planarization by CMP. 266 Although this solution seems simple, it basically transfers the difficulty from etch to the process of chemical-mechanical-polishing. There may be some other integration issues associated with this approach.
FeRAM, known also as FRAM or F-RAM, is a random access memory, where the memory cell consists of one capacitor and one access transistor, similar in structure to a DRAM. The dielectric in the capacitor is a ferroelectric material such as PZT, while materials such as Ir and IrO 2 are used for the electrodes. Etching of the stack at temperatures above 350 C in a capacitively coupled plasma reactor with magnetic confinement yielded vertical profiles. 63 Chlorine and fluorine based gases together with oxygen and argon were part of the etch chemistry.
VI. FUTURE
It has been said that "Studying the past is no sure guide to predicting the future. 337 " It has also been said, allegedly by Niels Bohr (and more recently, in a slightly different manner by others, including Yogi Berra), that "Prediction is very difficult, especially about the future. 338 " Nevertheless, for an industry with annual sale of roughly $300 Â 10 9 , 339 some judicial assessment of future technologies is needed. The semiconductor industry, like others, develops and lives by roadmaps. The corresponding roadmap is the International Technical Roadmap for Semiconductors (ITRS, published in earlier editions by the Semiconductor Industry Association as the National Technical Roadmap for Semiconductors), 340 revised annually, with complete reports published biennially. Over time, changes in the ITRS may be significant; however, it is a blueprint for evolution of devices, technology, and processes, and it gives guidance to research and development efforts. The discussion below is based loosely on future device and technology requirements, covered in the relevant chapters of the 2011 edition of the ITRS. [341] [342] [343] [344] [345] [346] A. Patterning
Aside from advances in lithographic techniques, such as using next-generation-lithography tools, there are numerous techniques to achieve sub-lithographic features. Some involve pure lithographic methods, such as double-exposure, 346 some rely on combinations of lithography and etching, and other approaches use solely etching techniques to define features smaller than the capability of the lithographic tool. A number of methods have been proposed; some are beginning to find their way into manufacturing, while some are at the early stage of development and assessment of their capabilities and limitations. Some techniques are more appropriate for less dense logic devices, while others are intended for dense structures.
The method of trimming is discussed in the section on gate patterning, and it has been in use in production for a number of years. It is adequate for isolated lines, but not for creating dense patterns. Aside from CD control issues, there are also limitations associated with thinning of the resist as the result of the trimming operation.
The method of double patterning (DP) [346] [347] [348] utilizes two lithography and etch steps to pattern a single layer, which is then used to transfer the pattern to the target layer (e.g., gate, trench, etc.). It is often referred to as the litho-etch-litho-etch (LELE) method. 346 The technique is illustrated in Fig. 46 ; after the first lithographic step [ Fig. 46(a) ], a hard mask is etched, followed by resist strip [ Fig. 46(b) ] and another lithographic step [ Fig. 46(c) ], where additional features are printed and etched to create a dense pattern after the resist is stripped [ Fig. 46(d) ]. The resulting hard mask is then used to etch the target layer [ Fig. 46(e) ].
Spacer double patterning approaches come in two varieties, positive and negative tones. In the positive tone approach (known also as self-aligned double patterning [349] [350] [351] or sidewall image transfer 352 ), the sidewall-spacer is used as the hard mask for pattern-transfer to create features well under the lithographic capability of the lithographic tool. The concept is not new; it was first used by utilizing sidewall deposits formed during trilevel etching, 353 where the etched hard-baked resist was ashed away, leaving the sidewalls as the mask for the pattern transfer. 354 Similarly, small platinum features, 50 nm wide, were fabricated by sputtering away platinum films deposited on amorphous-silicon sacrificial structures that were subsequently removed by chemical dry etching. 355 In the positive tone approach, the size of the features is controlled by the CVD and etch processes, while the space between the features is determined by lithography. The method, shown in Fig. 47 , can be used when all the features are of equal size, such as the fins in FinFETs. The first step in using this technique is to create structures (mandrels), on top of the target layer [ Fig. 47(a) ] that will be used to construct the sublithographic features. After CVD and etch, sidewall-spacers are created [ Fig. 47(b) ], followed by the removal of the mandrels [ Fig. 47(c) ], pattern transfer and spacer removal to create the desired structure in the target layer [ Fig. 47(d) ]. The mandrel material will depend on the application. To fabricate the fins for FinFETs, for instance, the mandrel could be amorphous carbon, patterned with photoresist (possibly after trimming, discussed above, and using SiO 2 or SiN as the pattern-transfer layer), while the spacer material is SiO 2 or SiN. The spacer remaining after the ashing can be used as the hardmask for the pattern transfer directly into the silicon to create the fins. Another application of the technique is metal gates on high-j dielectrics. In the negative tone approach, dense features can be created, and their widths need not be the same. In this case the spacers are disposable. 351, 356 Following the formation of the mask [e.g., SiO 2 , SiN, or polysilicon, Fig. 48(a) ] on top of the target layer (which could be either a device layer or a hardmask used for the pattern-transfer to the device layer), a spacer is formed by deposition and etchback [ Fig. 48(b) ]. A layer of oxide is then deposited to fill the gap between the feature [ Fig. 48(c) ], followed by etchback or CMP to expose the spacers [ Fig. 48(d)] . The spacers are then removed to create a dense pattern shown in Fig. 48(e) , which is then used to transfer the pattern to the target layer. The choice of materials will obviously depend on the application. Jung et al. used the technique to create a dense pattern in a target layer consisting of 50 nm SiON on top of 200 nm amorphous-carbon. 356 In this case, the mask was 200 nm polysilicon, the sacrificial spacer-sidewall was 20 nm fluorocarbon and the gap-fill material was spin-on glass.
Self-aligned quadruple patterning (SAQP) 349, 357 is an extension of the self-aligned double patterning described above. In this case, the first mandrel is used to create a denser pattern in a 2nd mandrel layer, as shown in Fig. 49 . After the formation of the first mandrel [ Fig. 49(a) ], spacers are formed [ Fig. 49(b) ], and the mandrels are removed [ Fig. 49(c) ]. The remaining features are then used to transfer the pattern to the hard mask and form a second mandrel [ Fig. 49(d) ]. After spacer formation [ Fig. 49(e) ], and mandrel removal [ Fig. 49(f) ], the pattern is transferred to the target layer [ Fig. 49(g) ].
In another variation, self-aligned triple patterning is claimed to offer some advantages over the other multipatterning schemes. 349, 357, 358 The technique involves the creation of two consecutive spacers, the first of which is sacrificial. The mandrels in this case are not sacrificial, and can be used, in addition to the spacers, as the hard mask for the pattern transfer. Details can be found in the references cited. a measure of variation in width of the line expressed in 3r, while LER is the variation from a straight line along the edge of the line (also expressed in 3r). 359, 360 The guidelines for low-frequency LWR are generally less than 8% of the CD, 346 and it can occur immediately after the resist is patterned, with amplification during the pattern-transfer. 361 Low-frequency line-edge roughness is linked to fluctuation in CD, 359, 360 while high frequency LER in polysilicon gate can lead to enhanced lateral diffusion, ultimately causing reduced channel-length. 362 The roughness is an issue that is becoming more critical as CD decreases, since LWR does not scale with linewidth shrink. In the etch process, it is associated with plasma interaction with the photoresist, 361 which roughens the surface, thus causing edge roughness, especially if the resist is chemically amplified. 363, 364 Ion bombardment, plasma radicals, and polymers deposited during the etch are claimed to be contributors to the resist roughness. 363 Also, photons and heat have been shown to synergistically cause LER. 361, 364 Although the etch process can be tuned to reduce the effect, 365 it is clear that more robust resists are necessary in the future to reduce LWR and LER in the patterned resist as well as in the postetch pattern. Nonchemically amplified resists are claimed to be the solution for the problem, but they may present a different set of challenges for the etching process. One option considered is the use of inorganic resists, 366 which, depending on the material, may be easy or difficult to remove after the etch. For instance, one candidate contains both Hf and Zr and exhibits >7:1 SiO 2 :resist selectivity, 367 so removal of the material after the pattern transfer may be a challenge.
C. New materials
Silicon continues to be the substrate material for the foreseeable future, but SiC is emerging as the material of choice in some applications. 368 In silicon devices, other materials are expected to be incorporated in the future. Alternate channel materials with high mobility are of great interest, and their integration to large-scale devices may be quite challenging. The list of materials of interest is long and includes n-InGaAs and p-Ge in the near future, and other III-V materials, n-Ge, carbon nanotubes, and graphene in future technologies. 344 All of these materials are expected to be grown epitaxially (there may be other options for graphene 369 ), but forming a good contact to some of these materials (e.g., graphene) is a major challenge.
The implementation of high-j materials as the gate dielectrics has addressed the high leakage current density associated with thin effective-oxide-thickness (EOT), but future devices may require even higher-j materials [e.g., TiO 2 (Ref. 344 )] than the current Hf based dielectric to reduce EOT even further. Removal of these materials from the substrate with no appreciable damage will have to be addressed.
Although there is some discussion on using carbon nanotubes 345 as a potential interconnect material, in all likelihood copper will be the conductor of choice for some time, and the focus for improved performance will be the reduction of the dielectric-constant of the interlevel dielectric. The porevolume in the dielectric is expected to increase, leading to j values below 2.0. Etching the highly porous material, postetch-cleans as well as its integration into current metallization scheme are doable, but may not be simple extension of current processes. One material is pure-silica-zeolite 370 which, depending on its porosity, could reach j values below 2.3, but with an increase in leakage current. Alternately, a nonporous low-j dielectric can be used. A polycarbosilanebased dielectric with a j value as low as 2.32 and low leakage current is one choice. 371 The advantage of this material is that once it cured, it resists copper diffusion into the dielectric (attributed to the lack of oxygen in the dielectric), thus enabling copper metallization without the need for a barrier material in the vias and trenches. As a result, lower resistivity interconnects can be realized with this material compared to a dielectric where a barrier material is needed. 342 A carbon-rich fluorocarbon material with k < 2 also has been reported. 372 The material is deposited by a plasma discharge of C 5 F 8 and Ar. It exhibits good adhesion to the SiCN barrier layer, and good thermal and mechanical properties. The ultimate goal for reduction of j value is a number close to 1.0. This will be discussed in the following section.
High-j dielectrics have been introduced to DRAMs as well to maintain the capacitance as the device dimensions shrink. Current capacitor structures utilize TiN as the electrode material with high-j insulator as the dielectric (TiN/insulator/TiN or TIT structure). 373 Hf x Al y O z dielectrics were used initially, 373, 374 but they have been replaced by ZrO 2 /Al 2 O 3 /ZrO 2 (ZAZ) 373, 375 at the 40-30 nm half-pitch technology nodes, to achieve EOT of 6.3 Å . 375 Future replacement with perovskite structure materials will have a higher j value. 376 Etching of these materials is not required, since they are incorporated in a capacitor structure that currently requires formation of cylindrical cavities in oxide, with no need to etch these dielectrics (see discussion on structures below).
Other memory devices in the near future, phase-change memories (PCM, known also as phase-change random access memory, or PRAM devices), utilize changes in resistivity of chalcogenides in their amorphous and crystalline states to create nonvolatile memories. 340, 344 These devices are seen as a future replacement of Flash memory, and in some applications, as replacements for DRAM and static random access memory. PCM devices are already commercially available for mobile applications, with endurance of 10 6 write-cycles in current product offering, 377 and a potential of increased reliability in future devices (6.5 Â 10 15 cycles in one recent publication). 378 It seems that most of the new materials to be introduced in future devices will not require new etching processes and they will be incorporated into structures formed in traditional materials.
D. New structures
The ITRS lists a large number of emerging devices that are in different stages of research or development. Obviously, only a small number will be commercialized. At this stage, it is not clear what role plasma etching will play in the fabrication of these devices. In the following discussion, we will concentrate on structures where plasma etching may be important.
3D structures are expected in most if not all devices in the near and distant future. FinFETs are now used in manufacturing of advanced ICs and are expected to become more ubiquitous. The current trigate structure is expected evolve to "Pi-Gate," "Omega-FET" and "Gate-All Around" structures (see discussion of FinFETs in Sec. V C). In forming these structures, control of the damage to the underlying substrate will be highly important. Various modifications of FinFETs have been implemented for DRAM applications. The combined structure of saddle FinFET (S-Fin) 379 and Recessed Channel Array Transistor 380 or R-FinFET seem to be the preferred structure for current and future generation of DRAMs. 381 The DRAM capacitor has evolved over time too. Since minimum capacitance of roughly 25-30 fF per cell is required regardless of the technology node (as determined by the sensing circuit, data retention requirements, and single disrupting events, such as alpha particles and cosmic radiation 382 ), the capacitor structure has evolved to maintain this minimum value with reduction in the area occupied on the device. The initial planar capacitor was replaced by a trench 382 and stacked capacitors. 382, 383 The latter has evolved with time to a 3D, cylindrical-cavity structure 383 with the use of high-j dielectrics. The capacitor structure is expected to change to a pillar type as device dimensions and dielectric thickness shrink further. 344 Bonding of multiple DRAM chips vertically (with the use of through-wafer-vias) to form high-capacity devices is also expected. [383] [384] [385] In interlevel metallization, the goal is to lower the dielectric constant with the ultimate goal of j close to 1.0. One way to achieve this is by using nonconformal interdielectric deposition that will lead to void formation between the metal lines. [386] [387] [388] [389] In this approach, the dielectric between the metal interconnect is etched away, followed by the nonconformal deposition of the dielectric. An alternate approach is to use a sacrificial material to build the entire metallization structure, to be removed (at least partially) later-on, 390 creating a structure that is generally devoid of solid dielectric between the lines. Reactive gas can be used to remove some sacrificial materials. 387, 391, 392 Other sacrificial materials can be removed by thermal decomposition, with the resulting gases diffusing through the capping layer. [393] [394] [395] Regardless of the method used, numerous etching, integration, packaging, and reliability issues must be addressed before this scheme becomes viable.
Quantum computing is an area of active research that promises to deliver future computing power far exceeding what is available today. 396, 397 Some of the approaches are based on a nuclear spin of 31 P donor in silicon, [398] [399] [400] quantum dots, [401] [402] [403] and ion traps, [404] [405] [406] [407] [408] [409] [410] and many of the fabrication steps are similar to the ones used in IC and MEMS fabrication today.
Clearly, regardless the path chosen from the choices above, device dimensions will continue to shrink, and the requirements for high-fidelity pattern transfer as well as reduction in damage and particle formation will become ever more stringent. As new 3D structures are introduced, selective and residue-free etch processes will be even more challenging.
E. Improved processes and equipment
One obvious source of concern for future device technologies is the particles that may be generated in the chamber and end up on the wafer before or during etching. Since the size of particles must be much smaller than the smallest device dimension, the fraction of particles in a given size distribution that qualify as "killer defects" keeps increasing over time. Particles are generated by the process, as deposits flake from the chamber walls and fall on the wafer. They can also arise from erosion of materials in the chamber. Process generated particles can be addressed by frequent cleaning of the chamber wall (e.g., between wafers), or by trapping the particles in a polymer coating. 54 The approach taken will be process dependent. Particles generated by the degradation of material in the chamber can be mitigated by choice of better materials and/or replacement of components in the chamber at frequent intervals. Chamber materials, including coatings, will likely become even more important in the future for both particle generation and for consistency in etching behavior over time.
Better control of ion and neutral fluxes as well as ion energy will be required to address issues such as CD control, ARDE, line-edge roughness, and lattice damage due to ion bombardment. In addition, improvements in the precise control of amount of material removed and etch uniformity will be needed.
Since vertical dimensions (with the exception of the photoresist mask) do not decrease as much as the width of devices shrinks, aspect ratios of contacts and trenches continue to increase. Therefore, ARDE control is important both to compensate for variation in the lithography-defined features as well as changing aspect-ratio with time. In principle, longer etch-time may be used to compensate for the etch-rate reduction; however, the thin photoresist may erode before the etch process is complete. It has been shown that control of the ion/neutral ratio is the key to reduction of aspect-ratio dependent etch rate of dielectrics. 215, 217 This was accomplished by changing process parameters, such as pressure and fluorocarbon feed gas fraction. 217 However, these conditions tend to lead to high polymer deposition, which can suppress etching entirely, and do not offer sufficient selectivity to current thinner photoresist. Bias power pulsing seems to offer an additional means for controlling ARDE, 53, 54 without changing pressure or gas flows, by operating in an ionlimited regime. 162, 411 In this case, the neutral flux stays nearly constant, but ion-flux is controlled by the on/off cycle of the bias. Although etching rates may still be decreased by the increased aspect-ratio, the required longer etch times are not prohibitive. With bias pulsing, photoresist erosion is reduced more than the dielectric etching rates, increasing selectivity to photoresist as compared to continuous-wave plasmas.
As discussed above, line-edge roughness has been always a concern, but it has become more acute as device dimensions shrink. Roughness along the edges of photoresist has been shown to be induced by a combination of exposure to UV light, ion bombardment, and heat. 80, 412 Preliminary results have shown 53 that time modulation of neutrals, where conditions are repeatedly switched from depositing polymer on the sides of the photoresist (and tops) to etching, can prevent rough edges on the photoresist. This is done while keeping the ion flux constant.
Using pulsed plasmas may be the potential solution to other concerns too, such as surface damage to the layer exposed to the plasma. In the past, the solution has been to remove the damaged layer either by chemical etching (e.g., downstream etching) or by oxidizing the surface and removing the oxide in HF solution. This loss of material is becoming less tolerable now and will be too high a price to pay in future device technologies.
Two general areas of research have re-emerged to address many of these issues: time-modulated plasmas and neutral beam etching. Pulsed plasmas have been an active area of research for many years. [413] [414] [415] [416] [417] [418] [419] [420] [421] [422] [423] [424] [425] [426] [427] [428] [429] [430] In most of these early studies, the plasma-generating power was modulated; in a few instances, pulsed bias was also investigated. RF or microwave power is modulated at frequencies of typically $10 kHz, and $50% duty cycle. Within $10 ls of the power being turned off, T e drops from several eV to $1 eV and then more slowly to <0.5 eV. During this time, the positive ion density decays only some near the center of the plasma, but can stay constant or even increase near the edge of the plasma. 429, 431 If the plasma is in a highly electronegative gas (e.g., Cl 2 ), then the electron density will decay rapidly in the afterglow (power off) period, due to dissociative attachment and formation of negative ions (e.g., Cl À ). Deep into the afterglow period in such plasmas, the negative ion density can greatly exceed the electron density, causing the sheaths to collapse and the negative ions can then reach surfaces, as they are no longer repelled by the negative potential difference between surfaces and the plasma. If, under these ion--ion plasma conditions, a positive bias is applied to the substrate (usually the positive period of RF bias at frequencies well below the ion response frequency), then negative ions can be accelerated to the substrate surface. 430 With negative bias, positive ions bombard the surface, so with equal portion positive and negative bias at low RF bias frequencies, the differential charging problems obtained with electrons as the negative charge carrier can be greatly reduced. 432 Apparently, this approach has not been used in commercial etching processes to date, but could become important again in the future.
Pulsed plasmas also allow very narrow ion energy distributions to be obtained for most of the afterglow period. 433, 434 By applying bias synchronously in the afterglow, a nearly monoenergetic IED can be obtained, as shown in Fig. 50 . The wafer will of course be bombarded by ions during the power-on portion of the cycle, but with the bias off, and the pressure high enough that T e < 2, the ion energy will not exceed $10 eV (the low energy portion of the IED in Fig. 50 ). This could allow very high selectivities to be obtained by tuning the IED to be above threshold for one material (e.g., Si) and below that for another (e.g., SiO 2 ), making it possible to obtain near-infinite selectivities. (Of course etching rates will be low, but for many applications this is not an important constraint.) As discussed above, when Si is etched in a Cl 2 /Ar plasma under these conditions (Fig. 20) , the etching rate increases with ion energy above a threshold energy in a manner that is consistent with beam studies, but with a substantial etching below the ion threshold energy. This component has been ascribed to etching assisted by the light generated by the plasma, mainly in the VUV. 167 It should be noted that pulsed plasmas produce little VUV light in the off portion, and therefore less average VUV light while maintaining high positive ion density. The ion flux during the off portion is reduced by $T e 1/2 , so there should be a net gain in the ion-to-VUV photon flux ratio for pulsed plasmas, perhaps suppressing photo-assisted etching, photoresist degradation, and defect generation in sensitive regions of circuits. This could also be of added importance in future devices with small dimensions.
Synchronization of source and bias powers 435 as well as tailored the waveforms 322 have also been explored specifically for finer control of etching processes. The idea behind the tailored bias waveform is to invert the problem of letting the waveform determine the IED. Instead choose a desired IED and then determine the waveform that will produce that IED, synthesize it, and apply it to the substrate. 436 Another way to suppress all forms of plasma damage associated with positive ions, electrons, or photons is to bring the substrate out of the plasma and use directed, energetic neutral beams instead of ions to induce etching. Since positive ions are neutralized before striking the surface, the energy dependence of the etching yields for neutrals is expected to be the same as those of ions. Neutral beam assisted etching was explored in the mid 1990s and then largely abandoned. [437] [438] [439] Recently, however, there has been a re-emergence of interest in neutral beam etching. These studies extract neutral beams by allowing ions extracted from a plasma to make glancing angle collisions with the walls of high aspect ratio holes in a grid. 440, 441 One such system for etching is shown in Fig. 51 . 442 Either positive or negative ions from an ICP are allowed to enter a high aspect ratio grid plate. For negative ions in a strongly attaching gas like chlorine, a pulsed plasma must be used and negative ions are extracted deep in the afterglow. The ions entering the grid are accelerated by an imposed potential, and are neutralized by specular collision with the grid wall, forming a highly directed, energetic neutral beam. The beam can retain a large portion of the velocity of the incoming ion. 443 This mostly neutral energetic beam, along with background gas, enters the processing chamber below. The etching mechanism is similar to the synergistic ionassisted process, but with few charged species. These conditions have been used to etch various nanostructures including Si IC FinFETS 440 and Si quantum dots, 440 as well as GaAs/AlGaAs heterojunction nanopillars. 444 A transmission electron micrograph (TEM) of the Si quantum dot structure is reproduced in Fig. 52 . The mask for this structure was ferritin, a 7 nm diameter spherical protein. The TEM reveals little if any disruption of the Si crystal lattice below the sidewall surfaces. The authors conclude that neutral beam etching creates such little damage because the VUV light levels are so much lower than in conventional plasma etching.
Atomic layer deposition (ALD) has become a mainstream technology for high-j dielectrics. In this process, two alternating half steps are carrier out to first deposit a metal and then oxidize it. Each half step is self limited. The metal deposition follows Langmuir-Hinshelwood kinetics, leaving the surface covered with a saturated layer of metal that is up to one monolayer. This metal layer is then completely oxidized in the second half step. Rapid gas pulsing brings the process from one step to the next. The process is repeated many times to grow precisely the same amount of material all over the wafer.
Atomic layer etching (ALET), the analog to ALD, is attractive for similar reasons. For example, if an etching process could be made self-limiting in the same manner, it could eliminate ARDE. ALEt (sometimes referred to as "digital etching") was first reported for GaAs etching with alternating Cl 2 adsorption and electron beam etching. 445 Sasaue et al. used ion bombardment to effect ALEt of silicon, but the etch rate per cycle was less than a monolayer. 446 ALEt of silicon with one monolayer etched per cycle was achieved by Athavale and Economou. 447 Their approach, depicted in Fig. 53 , consisted of four steps: 448 (1) Exposure of a clean substrate to a reactant gas, and adsorption of the gas onto the surface (chemisorption step). The reactant gas flow (chlorine in this case) is turned ON only during this chemisorption step. This process is self-limiting; chemisorption stops when all available surface sites are occupied. monolayer precision is not possible.) (2) Purging of excess Cl 2 gas with inert (Ar) gas. (3) Exposure of the surface to ion bombardment in an Ar plasma, to induce chemical sputtering of the surface chlorinated layer (etching step). Ideally, this process is also self-limiting; ions react only with substrate atoms bonded to the chemisorbed gas. Once this top chlorinated layer is removed, further etching (physical sputtering now) of the substrate must not occur or be very slow. (4) Evacuation of the chamber to exhaust the etching products. If the periods of chemisorption (step 1) and etching (step 3) are long enough, the etching rate approaches one atomic layer per cycle, where the atomic layer thickness is that of the chlorinated layer, and not necessarily one monolayer of the substrate. If the substrate surface remains (atomically) smooth during repeated ALEt cycling, it is possible to achieve the ideal condition of removal of exactly one monolayer of the substrate per cycle. This approach to ALEt process requires a very long $150 s per cycle. 449 While pulsed gas valves and fast switching mass flow controllers have improved in recent years, the cycle time is too low for this process to be widely applicable. Even with very fast, stable flow switching, the sticky halogen compounds [etching products 121, 122 and even Cl 2 (Ref. 143 )] demand long purging times. Recently, a new method has been proposed in which rapid ALEt is achieved by pulsed bias. 53, 450, 451 The idea is to form a self-limited halogenated surface layer in the plasma with no bias on the substrate so that no etching occurs (chemisorption step). A bias pulse is then initiated and the halogenated layer is sputtered faster than it can reform (etching step). Etching will greatly slow after this layer is removed provided the ion energy is below the sputtering threshold. Bias is then turned off and a halogenated layer forms again (chemisorption step).
Finally, it is very difficult to predict what is beyond silicon CMOS devices and what role plasma etching will play in that technology. Such devices will have dimensions much smaller than conventional ICs. The smallest components will likely consist of single monolayers (e.g., graphene 452 ), molecules 453 (e.g., carbon nanotubes 454 ), or even single atoms. 455 As these emerging technologies find their way to commercial devices, the need for conventional circuitry and processing will likely remain for a long time. For example, an IC with carbon nanotube FETs still needs metallization, interlayer dielectric layers, vias, contacts, etc. If some future device required placing single molecules such as 1,4-benzenedithiol between two electrodes in trillions of locations, the need would still exist to fabrication those electrodes and create a separation between the electrodes (in this example, 0.85 nm) required for the covalent bond linkage. 453 It is highly unlikely that a fully bottom-up approach will ever be able to produce an advanced integrated circuit, hence some form of top-down fabrication is likely to continue and highly selective plasmas processes will be called upon to carry out patterning even on this scale. Low energy ion bombardment would be essential, but ultimately more processes may demand novel sidewall deposition of composite layers followed by isotropic etching to remove material sandwiched between two closely spaced layers.
VII. SUMMARY
In the 1970s, plasma etching became an essential method for pattern transfer for silicon integrated circuits. As circuitry has become ever more complex with no sign of reaching the end of the roadmap for reduction in feature dimensions, the need for and importance of plasma etching continues to increase.
The need for plasma etching began roughly forty years ago when the undercutting of etch masks inherent in wet etching methods was no longer tolerable for transistor and interconnect formation. Silicon etching for transistor fabrication began in fluorine atom-generating plasmas such as CF 4 / O 2 , but it was quickly realized that the undercutting by F atoms was not desirable and much better profiles were possible in chlorine-containing plasmas. Parallel plate, capacitively coupled plasma reactors were used first for silicon etching but have largely been replaced by higher density inductively coupled or microwave-generated plasmas. Further improvements were realized with the addition of HBr to Cl 2 . The etching of poly-Si and single crystal Si proceeds by a mechanism in which positive ions are accelerated across the sheath that develops adjacent to the wafer. The voltage drop across this sheath (and hence ion energy) is controlled by a separate bias power applied to the substrate stage. This energetic ion bombardment causes a disruption of a halogenated chemisorbed layer and induces chemical reactions that lead to the formation of volatile products.
Anisotropic etching of aluminum for interconnects was also developed with chlorine-based CCP plasmas and later also migrated to higher density ICPs. Here the mechanism is quite different. Cl and Cl 2 react readily with Al in the absence of ion bombardment, which will lead to severe undercutting of masks. To prevent this, species such as BCl 3 feed gas or CCl x fragments of photoresist erosion are introduced and bare Al surfaces become coated with a passivating layer that prevents chemical etching of Al by Cl and Cl 2 . Positive ion bombardment keeps horizontal surfaces relatively clean, allowing chemical etching to occur, with some added enhancement by ion bombardment. On vertical surfaces, the passivation layer prevents etching and anisotropic profiles are obtained.
Silicon dioxide etching for providing patterned insulating layers between interconnecting Al wires and Si transistors began around the same time. Fluorocarbon-containing plasmas emerged and remain the only way of achieving anisotropic etching that is selective toward Si. SiO 2 etching evolved from CCP to ICP and back to CCP plasmas. The reemergence of CCP for SiO 2 etching has been accompanied by the use of radio frequency power at two (or even three) frequencies. The mechanism for etching of SiO 2 has been well studied, due to its importance and highly complex nature. Etching occurs in the presence of a thin fluorocarbon film that also inhibits unwanted etching of Si. Ion bombardment of this fluorocarbon layer causes reactions to occur that lead to the generation of volatile SiF 4 , CO, CO 2 , and perhaps other etching products that must then diffuse out of the layer.
Recently, insulating materials with dielectric constants lower than SiO 2 have emerged. These films usually contain Si, C, and O and often have voids to further reduce the dielectric constant. Fluorocarbon plasmas etch these materials with a mechanism that is similar to SiO 2 . The same equipment that is used to etch SiO 2 is also used to etch these low-j materials.
Following the development of processes for Si, Al, and SiO 2 patterning, plasma etching was quickly called upon for etching a host of other conducting and insulating materials. For the first 25 years or so, plasma etching was used strictly for transferring patterns from photoresists to these materials, some of which (e.g., SiO 2 and amorphous Si) were used as hard masks for delineating underlying layers. In the last roughly 5-10 years, however, many applications have begun to emerge where plasma etching is taking on more of the task traditionally carried out by photolithography. These involve using a starting structure with a relatively wide linewidth and then creating one, two, or even three narrower lines by trimming processes or by depositing thin layers on the sides of lines that are then removed. All indications are that the number of such processes will only increase in the future.
The future needs in plasma etching will be for evermore tighter control of process variability, higher selectivity and less damage. This may require one or more of the following: migration to pulsed plasmas, lower ion energies, tighter control of ion energy distributions, and reduced photon fluxes. Evolutionary transitions to atomic layer etching or neutral beam etching could become necessary if sensitive devices can no longer tolerate monolayer-scale damage produced by continuously immersing substrates in the plasma.
Future devices will certainly have smaller critical dimensions, will incorporate new materials and structures, and will be fabricated on larger wafers. Although self-assembly is considered as for some structures and materials, dry etching will still be used for most of the pattern transfer of the evershrinking lithographic features in the foreseeable future. In some cases, new materials will be incorporated in cavities formed in traditional semiconductor materials, and in other cases, these materials will require dry etching, and new etching processes will have to be developed. The choice of structures and materials will be influenced greatly by the capabilities of the dry-etching processes and equipment on hand. Control of selectivity (to the substrate as well as the mask), profile and CD control, lattice damage, plasma damage (which may be enhanced by photon flux), particle formation, process reproducibility, and equipment reliability will dominate future etching technologies and equipment.
Plasma etching technology has evolved from a manually loaded quartz tube with a coil wound around it to sophisticated automatic multimillion dollar machines, with advanced equipment and process control. This evolution continues.
