I. INTRODUCTION

I
NTERCONNECT has long been perceived to be a bottleneck in present and future high-performance digital integrated circuits because of its inability to keep pace with advances in transistor speeds [1] . Consequently, alternative solutions such as on-chip optical interconnects have been proposed in order to avoid the problems associated with global on-chip wires altogether [2] . However, due to technology incompatibility, cost considerations, and nonnegligible delays in converting the signals between optical and electrical domains, this has not yet been shown to be practical. In this brief, we explore the velocity limitations for systems built with conventional silicon processing and show that data transmission at near the speed of light is possible in an all-electrical system.
II. LIMITATIONS OF CONVENTIONAL REPEATER-INSERTION APPROACH
The most common means of global communication is through the use of wires with appropriately spaced repeaters [3] . Based on a 0.18-m 1.8-V CMOS six-level aluminum interconnect technology, simulations were carried out in HSPICE for minimum-width (0.44-m), minimum-spacing (0.46-m) lines with varying number of repeaters for a fixed overall distance ( Fig. 1) [4] . The optimized minimum delay, including both wire and buffer delays, is about 1.35 ns for propagation over a length of 20 mm, translating into an effective velocity of about one tenth of the speed of light in silicon dioxide . This optimized delay configuration yields a power dissipation of 30 mW, assuming 1-GHz switching, with an average transition time of 200 ps. While the buffer delays can be improved by using more advanced technology and/or expending more power, the intrinsic wire delay is fundamentally limited. To understand the reasons for this, the power spectral density of a 500-ps digital pulse is compared with the frequency-dependent characteristics of the aforementioned minimum-sized wire without repeaters (Fig. 2) . The digital signal is broadband in nature, while the phase velocity for signals propagating along the wire changes dramatically over this frequency range. For transmission line systems, the phase velocity, rather than the group velocity, is meaningful to consider [5] . The telegrapher's equation describes the signal propagation as a function of time: (1) where is the distance along the wire, is time, is voltage, and , , and are the resistance, inductance, and capacitance 0018-9200/03$17.00 © 2003 IEEE per unit length, respectively. This intuitively suggests that at lower frequencies, the first term on the right side of (1) dominates, and the wire behaves as a distributed network. In this regime, signals travel slowly by diffusion and undergo frequency dispersion. As the frequency increases, the inductive component of the wire [second term on the right side of (1)] begins to dominate over the resistance, and the wire behaves more as a waveguide. The high-frequency regime allows for propagation of an electromagnetic wave; consequently, the peak phase velocity is the speed of light in the dielectric surrounding the interconnect (Table I) . This is the key characteristic of the wire that can be exploited in order to achieve high-speed signal propagation. Fig. 3 shows the experimental velocity and attenuation characteristics of an on-chip 1-mm-long coplanar waveguide, with 4.8-m-wide signal, 20-m ground wires, and 5-m spacing, which is obtained through -parameter measurements [6] .
In the example given in Fig. 2 , most of the spectral components of the 500-ps pulse are in the regime of the minimum-sized wire, traveling at speeds much less than the speed of light. It is not until over 100 GHz that the velocity reaches its maximum and becomes independent of frequency. The effect on digital signals is that different frequency components travel at different speeds and attenuate at different levels, resulting in a significantly distorted output. Repeater insertion does not change the frequency characteristics of the interconnect, but can improve speed by amplifying the faster high-frequency components of the signal, when placed at appropriate intervals. This comes at the cost of increased power dissipation and a delay overhead from the intrinsic delay of the repeater.
III. EXPLOITING THE LC NATURE OF WIRES
The characteristics shown in Fig. 3 suggest that a high-speed system can be built by taking advantage of the wave nature of interconnect. At the same time, it is beneficial to eliminate the low-frequency portion of the signal that lags behind and contributes to inter-symbol interference (ISI). This can be achieved by modulating the digital data with a sufficiently high-frequency carrier, and as a result, concentrating all of the signal power in the faster regime. In order for this system to be realizable, wires must have a low crossover frequency between the and regimes. By explicitly emphasizing the parasitic inductance and reducing the resistance, this transition can be shifted into the single-gigahertz range. This design strategy of deliberately using the parasitic inductance may be foreign to designers, since the inductance for on-chip wires is often difficult to estimate. In this case, the on-chip transmission lines used have explicit ground returns that provide well-controlled and predictable inductance values.
While the high-frequency regime offers high-speed frequency-dispersionless propagation, the interconnect is substantially more lossy at higher frequencies, as demonstrated by Fig. 3 . Therefore, the wire must be optimized to minimize loss, while occupying a reasonable amount of area. Thicker top layers of metal and dielectric facilitate the realization of on-chip transmission lines [7] , [8] . For example, a microstrip structure with a 6-m-wide 2-m-thick copper signal wire on a 2-m-thick dielectric provides a loss of about 0.5 dB/mm [6] . This totals 10-15-dB loss for a cross-chip global wire. This amount of attenuation is tolerable for a system to reliably recover the signal. In this case, no gain elements, such as repeaters or amplifiers, are present or necessary along the length of interconnect. Fig. 4 shows the impact of using modulated signaling in combination with the use of low-loss interconnect. Using optimized interconnect lowers the crossover frequency between the and regimes to a few gigahertz. Using a high-frequency carrier pushes the signal spectral components to lie predominately in the high-speed inductance-dominated region.
IV. SYSTEM IMPLEMENTATION
A simple implementation of this system uses direct conversion from baseband to RF, as illustrated in Fig. 5 . The carrier frequency must be in the regime and sufficiently above the desired bandwidth of the signal. Because the overall latency of the system not only includes the time of flight across the interconnect, but also includes the transmitter and receiver delays, minimizing the delay of these components is an important criterion in designing the circuits so that the speed gains are not negated in the overall system.
As a demonstration for transmission of 1-GHz data across chip, a 7.5-GHz local oscillator (LO) carrier has been chosen. A passive ring mixer is used to perform this upconversion while minimizing power consumption [9] . This ring mixer then drives an interconnect wire that extends uninterrupted across a chip-edge length of tens of millimeters. This modulation system requires no repeaters, has low power consumption, and simplifies floorplanning since the interconnect does not need a dedicated channel of silicon to accommodate any repeaters. The transmission line is a differential microstrip topology, each with a 1-m-thick aluminum 16-m-wide signal line over 2.1-m intervening SiO layer. This microstrip topology confines the electromagnetic fields to minimize crosstalk. Additionally, this microstrip configuration suffers from less current crowding because the current flows across the entire width of the wire, even at high frequencies. On the receiver end, a double-balanced mixer downconverts the signal back to baseband with a voltage gain of 2. A sense amplifier follows to restore the signal to digital logic levels [10] . A simplified circuit schematic is shown in Fig. 6 . Additional buffers, as depicted in Fig. 5 , are used exclusively for testing and measurement purposes.
V. EXPERIMENTAL RESULTS
To demonstrate the feasibility of across-chip speed-of-light data transmission, the system has been fabricated in a TSMC 0.18-m standard logic CMOS technology with six levels of aluminum/silicon dioxide interconnect. The differential LO carrier signals and digital input pulse are generated off-chip, while the transmitter, receiver, and all other components shown in Fig. 7 . Sample as-measured input and output waveforms for system described in Fig. 6 . Fig. 7 shows sample as-measured input and output waveforms, and Fig. 8 shows a sample eye diagram of the output operating at 1 GHz. The overall delay from the test chip input to the output is 322 ps for the falling edge and 490 ps for the rising edge. This includes a delay of 123 ps due to the inverters needed for driving signals on and off of the chip for testing and measurement. Subtracting these inverter delays yields an average delay of 283 ps with a power consumption of 16.1 mW (including the power delivered by the external LO, but excluding the testing inverters). This total delay over the length of 20 mm corresponds to an effective signal propagation speed of nearly one half of . Fig. 9 shows a die photo of the 0.8-mm 5-mm test system. The interconnect is laid out in a serpentine manner around dummy fill metal.
VI. SCALABILITY OF THE SYSTEM
One of the main features of this design is the low-loss transmission line that uses wide wires to reduce the loss seen at high frequencies. Alternatively, the wire width may be reduced at the expense of increased power dissipation. As the wire width is reduced, the resistance increases, but so does the inductance, at a slightly slower rate. This indicates the crossover frequency between the and regimes is relatively insensitive to the wire width, and there is little or no delay penalty to scaling down the wire. The higher inductance translates into a higher impedance, which reduces the amount of power required to drive the interconnect. On the other hand, the loss along the length of the interconnect, which, at high frequencies, is approximately proportional to the ratio of the resistance to the characteristic impedance, has increased. Therefore, these effects compete to determine the required power. Simulations show that reducing the wire width to 4 m requires an additional 2 mW of power to produce the same signal level at the receiver input [6] .
In addition, this system design will benefit from impending technology enhancements, such as low-dielectrics, copper, more metal layers, and faster devices.
VII. COMPARISONS WITH OTHER TECHNIQUES
Some of the performance improvement in delay for modulated signaling can be attributed to the use of optimized low-loss on-chip transmission lines. Fig. 10 shows that using the same wide microstrip lines in conjunction with repeaters improves the optimal delay from 1.4 ns using minimum-sized wires to 0.4 ns, over a three times improvement. On the other hand, this comes at the cost of higher power dissipation; the power has increased from 30 to 50 mW, assuming 100% activity.
On-chip optics promise speed-of-light transmission without the frequency dispersion and loss associated with electrical wires [2] . For on-chip optical interconnects to be viable, an integrated method of transforming signals from electrical to optical domains and back again is necessary. Detailed simulations have been performed in [11] that quantify the delays due to this transformation overhead. We observe that the modulated signaling approach on electrical wires, in terms of delay, performs just as well or even better than the on-chip (Table II) . Furthermore, the metal waveguide in a silicon dioxide environment has a faster time of flight than an integrated optical polysilicon waveguide.
VIII. SUMMARY
This brief demonstrates, for the first time, the feasibility of all-electrical near speed-of-light across-chip communication by designing modulated signaling systems around optimized low-loss on-chip transmission lines. The measured effective speed is roughly one half the speed of light in oxide, for a 20-mm-long interconnect. The system consumes 16.1 mW of power and supports a bandwidth of 2 Gb/s. Furthermore, the microstrip transmission line size may be reduced in exchange for increased power consumption. When compared to conventional repeater techniques, the modulated signaling approach offers a five times improvement in delay.
