Quilt Packaging (QP), a novel chip-to-chip communication paradigm for system-in-package integration, is presented. By forming protruding metal nodules along the edges of the chips and interconnecting integrated circuits (ICs) through them, QP offers an approach to ameliorate the I/O speed bottleneck. A fabrication process that includes deep reactive ion etching, electroplating, and chemical-mechanical polishing is demonstrated. As a low-temperature process, it can be easily integrated into a standard IC fabrication process. Three-dimensional electromagnetic simulations of coplanar waveguide QP structures have been performed, and geometries intended to improve impedance matching at the interface between the on-chip interconnects and the chip-to-chip nodule structures were evaluated. Test chips with 100 µm wide nodules were fabricated on silicon substrates, and s-parameters of chip-to-chip interconnects were measured. The insertion loss of the chip-to-chip interconnects was as low as 0.2 dB at 40 GHz. Simulations of 20 µm wide QP structures suggest that the bandwidth of the inter-chip nodules is expected to be above 200 GHz.
Introduction
As state-of-the-art transistor features shrink to 45 nm or smaller, and incorporate strained and SiGe layers on silicon [1] , chip density and performance are improved. However, not all of the performance enhancement can be passed to the system level due to packaging limitations, especially at multi-GHz clock rates [2] . Besides, the demand by consumer electronics for high speed, low power consumption (longer battery life), low cost, and better portability presents challenges to conventional packaging techniques. Several approaches, including system-on-chip (SoC), system-in-package (SiP) [3] and system-onpackaging (SoP) [4] [5] have been actively pursued. SoC combines several kinds of functional blocks into one chip to form a complete system that offers the most compact and light-weight design, but also faces the challenges of long design times, integration complexity, low yield, and intellectual property issues. SiP incorporates multiple chips into a single package, which alleviates some of the problems faced in SoC. There are about 30 IC and packaging companies gearing up to produce SiP-based multichip modules [4] . Some of these technologies being developed to especially address throughput issues include IBM's "Transfer and Join" [6] , Sun Microsystem's "Proximity Communications" [7] , SiliconPipe's "Off-theTop" [8] , and Freescale's "Redistributed Chip Package" [9] . SoP is a variation of SiP that utilizes thin film technology to combine passives into the package.
We have proposed a novel SiP method, called "Quilt Packaging" (QP) for interconnecting ICs that promises to circumvent the critical speed bottleneck in chip-to-chip communications [10] . Potentially, thousands of small metal nodules can be formed at the chip edges during IC fabrication, and once the ICs are separated the nodules protrude beyond of the edges of the chips as contacts. Chipsets are then connected by butting chips against each other and soldering the nodules. A basic schematic of two chips communicating through QP is shown in Figure 1 The notch at the forefront is schematic only and reveals the depth of the metal nodules into the substrate. Figure 1 (b) shows a three-chip system interconnected by QP. The bonding pads shown in the figure (or, alternatively, solder bumps) suggest that QP can be used in combination with existing packaging techniques. Compared with the techniques currently pursued by industry, QP provides the shortest path (< 50 µm) between chips and leads to minimum delay, wider bandwidth and lower electrical noise [11] . Additionally, as will be shown in the fabrication process for QP, nodules are defined by standard photolithography and can achieve very high input/output (I/O) density. Furthermore, as much as 50% of generated heat and silicon die area can be attributed to the I/O buffers for high performance application specific integrated circuits. By connecting through QP, package capacitancesand therefore pin drivers -can be eliminated, effecting a reduction in power consumption.
Fabrication of QP
A fabrication process flow for QP structures is presented in Figures 2 (a) -(g) . After the IC front-end processing on the wafer is completed, nodules are defined along the edges by standard photolithography processing, and etched by deep reactive ion etching (DRIE) to create trenches for the nodules (Figure 2a) . A layer of plasma enhanced chemical vapor deposition (PECVD) SiO 2 is deposited to form an isolation layer inside the trenches between the silicon substrate and the metal contacts formed later (Figure 2b) . A thin Ti/Cu seed layer is then sputtered, and copper electroplating is performed to fill the trenches (Figure 2c ). ). DRIE etches all the way through the wafer to undercut part of the nodules and separate the chips. The thick photoresist mask is consumed partly in the meantime (Figure 2f ). Sn, electrolessly plated on the chips, coats only the copper surface at the protruding nodule sidewall. After stripping the photoresist, two or more chips are connected by soldering them together to form the QP structure ("quilt") ( Figure 2g ).
We use the Bosch DRIE process [12] , which alternates between an SF 6 etch cycle and a C 4 F 8 sidewall passivation cycle. Both anisotropic etching to define the nodules and reentrant etching to expose copper nodules while separating the chips can be achieved by tuning the process parameters [13] .
There are at least two ways that die separation for QP can be achieved. The first is called "dicing by thinning" [14] [15], in which the wafers are thinned by back side grinding and polishing to separate the dice along the previously-etched trenches on the front side of the wafers. The vertical alignment of nodules depends on the control of wafer thickness after grinding, which is estimated at better than 1 µm [16] . We used the second method, which is to DRIE all the way through the wafer for die separation. Figure 3 shows the cross section of a separated chip that has an undercut profile so as to ensure that the nodules will contact first as the dice are pushed together. The thickness of the wafer shown here is about 600 µm. The etch profile was achieved by the Bosch process in an Alcatel A601E inductively coupled plasma etcher with SF 6 flow rate at 300 sccm, C 4 F 8 at 130 sccm, with cycle durations at 7 s and 2 s respectively. The automatic pressure control position was at 23%. The plasma was generated by a source power of 1800 W, and the substrate power was 80 W. The temperature on the surface of the wafer during processing was maintained at 20 deg. C.
By using DRIE die separation, the edges of ICs are no longer required to be straight, which is a limitation in conventional dicing saw separation. Self-aligned structures can be formed to help align the chips. Two approaches, interlocking self-aligning silicon structures and keyed copper nodules, were demonstrated. The devices we tested used the keyed nodules approach. Shown in Figure 2 (e), the thick positive photoresist used as the protection mask layer was AZ® P4620 spun four times to form a layer about 16 µm thick, which was sufficient to survive the long DRIE die-separation step. The electroless Sn plating solution is from Transene. Figure 5 shows die photos of two chips connected through QP. Coplanar waveguide (CPW) microwave test patterns were fabricated. The center signal lines have widths of 100 µm (Figure 5a) . A closer look at the CPW QP connection is shown in Figure 5 (b). The detailed design of CPW QP prototypes will be discussed below.
The fabrication of QP is a low-temperature process, and can be easily integrated into a standard IC fabrication process. Only two additional masks, one for defining the nodules and the other one for die separation, are needed. 
Microwave Measurements and Simulations
Since the QP structure results in a short extension of interconnects from one chip to another in almost the same planar geometry, the electromagnetic discontinuities inherent in conventional wire-bond or flip-chip packaging technologies can be nearly eliminated, which leads to resonance-free, high-speed interconnects that minimize signal and ground bounce, overshoot and ringing. The only sources of electromagnetic discontinuity in QP interconnects are from the thick, short copper nodules (~ 80 µm per chip) inlaid inside the silicon substrate and the nodule bridge between the two chips (< 25 µm per chip).
Three CPW QP geometries along with on-chip interconnect (without nodules), as shown in Figure 6 , were fabricated. Figure 6 (a) is a "simple QP" prototype. Figure 6 (b) is "QP improved 1," which has a tapered nodule signal line to reduce capacitance and improve impedance matching between the on-chip interconnect and the nodule structure. Figure 6 (c) is "QP improved 2," which has both a tapered signal nodule and a tapered ground plane to further reduce the interface discontinuity. The silicon substrate has a relative permittivity of 11.9, resistivity of about 10 Ω • cm, and thickness of around 600 µm. The isolation SiO 2 has a thickness of 1 µm. The metal (Cu) on-chip interconnect is 0.6 µm thick. The depth of the nodules for both signal and ground is about 20 µm. Test chips with signal nodule widths of 100 µm were fabricated following the processes described in Figure 2 , except for one variation. Instead of soldering by melting plated Sn, the chips were pushed together, fixed in place, and about 2 µm of electroless Sn was plated at the interfaces to ensure good electrical contact. The connections in Figure 5 were made with this variation. The QP geometries are summarized in Table I . QP structures I, II, III correspond to simple QP, QP improved 1 and QP improved 2, respectively, as shown in Figures 6 (a) -(c) . Figure 7 . In all cases, the "improved" QP geometries provide better impedance matching and loss characteristics than the "simple QP" structure. A better than 18 dB return loss and less than 0.2 dB insertion loss at 40 GHz are achieved.
Table I Dimensions of the 100 µm QP interconnects (Unit: µm)
Quilt Packaging offers a highly-optimized chip-to-chip interconnect in terms of performance and overall benefits. In a system using QP technology, signals that originate near the edges of the dice need to traverse a very short distance between them, so that latency would be extremely low. A worst-case scenario would be signals that originate deep in the interior of a chip and must terminate deep within the receiving chip. Even in such cases, the combination of using a low-loss upper metal level interconnect in conjunction with QP will offer a significant advantage over a "standard" signal path through a solder bump to a package substrate and back again. The advantage to QP is that the connection between ICs is now direct without the need to traverse package structures such as leads, bumps, and package wiring.
Here, we compare our measured performance of QP to alternative structures. Devlin et al. [17] showed that a 25.4 µm (1 mil) diameter, 0.3 nH bond wire introduced an insertion loss of 2 dB at 40 GHz. Braunisch et al. [18] demonstrated solder joints with an insertion loss of more than 1 dB at 40 GHz. When considering the die-package interface with additional pad capacitance (C = 300 fF), the insertion loss at 40 GHz increased to about 4 dB. Mantysalo and Ristolainen [19] investigated stacked 3-D package interconnects. A single solder-plated polymer ball introduced 0.3 dB insertion loss and 15 dB return loss at 10 GHz, while a ball-via-ball structure introduced more than 0.5 dB insertion loss and about 12 dB return loss at 10 GHz. Pfeiffer and Chandrasekhar [20] showed that a 30 µm high, 90 µm wide gold stud bump for flip-chip interconnects introduced 0.35 dB insertion loss at 30 GHz. Above 30 GHz, the insertion loss was predominated by the onset of a structural resonance. Banerjee and Drayton [21] designed and measured both monolithic (on the same substrate) and hybrid (on different substrates, similar to QP) packages built on high-resistivity silicon substrates (> 2000 Ω • cm) using double aluminum wirebonds. The insertion loss due to the wirebonds was 0.15 dB for the monolithic package at 50 GHz and 0.5 dB for the hybrid package at 40 GHz, respectively. Lahiji et al. [22] presented low-loss multiwafer vertical interconnects built on 100 µm thick, high-resistivity silicon (> 2000 Ω • cm) and GaAs substrates. At 20 GHz, the vertical vias in the two silicon designs showed 0.12 and 0.38 dB insertion loss and 12.9 and 17.3 dB return loss, respectively, and in the GaAs design, insertion loss was 0.2 dB and return loss was 13.6 dB. It is not surprising that excellent performance was obtained in 3-dimensional packaging systems, given the short distances involved in these direct chip-to-chip interconnects. Considering the costs and benefits of the myriad options in chip-to-chip interconnects, QP offers extremely high performance as well as many other advantages [10] .
From Figure 7 , we see that the insertion loss has a moderate increase at low frequencies (< 3 GHz). This is attributed to dielectric loss in the low-resistivity substrate. QP interconnects fabricated on high-resistivity substrates (~ 8000 Ω • cm) do not exhibit this low-frequency dispersion.
It should be noted that due to the 100 µm nodule size, the characteristic impedance of the QP interconnects is not optimized. For the nodules, Z 0 ≈ 47 Ω, and so the measured insertion loss and return loss include the effects for this modest impedance mismatch.
Table II Dimensions of the 20 µm QP interconnects (Unit: µm)
QP structures with 20 µm wide signal nodules on low-and high-resistivity silicon substrates were simulated over a broader frequency range (1 -200 GHz) using a threedimensional full-wave electromagnetic simulator [23] . The detailed dimensions used in the simulations are listed in Table II . QP structures I, II and III again correspond to the simple QP, QP improved 1 and QP improved 2, respectively. Figure 8 shows the simulated return loss and insertion loss of the CPW QP structures after de-embedding. The insertion loss is less than 1.2 dB over the whole frequency range, which demonstrates that the bandwidth of QP structures is expected to be well beyond 200 GHz.
We are now working on the fabrication of QP structures with 50 µm and 20 µm wide nodules for measurements up to 110 GHz. tapered nodule signal and ground lines were designed and fabricated. Test patterns with 100 µm wide signal nodules were measured up to 40 GHz. On low-resistivity silicon substrates (10 Ω • cm), insertion loss as low as 0.2 dB was measured. High-resistivity silicon substrates can eliminate most of the dielectric loss and improve insertion loss. Our data confirm that the QP interconnect scheme is capable of wide-band chip-to-chip communications.
Nodules are defined by standard photolithography and DRIE. There is no physical limit preventing nodules from scaling down to or below 20 µm wide. Simulations show that the bandwidth of QP is well above 200 GHz. Reliability issues including thermal conductivity and expansion, and mechanical strength of QP require further investigation.
