A bus-transceiver test chip in 0.13 µm CMOS achieves 3 Gb/s/ch over 10 mm long uninterrupted differential interconnect of only 0.8 µm pitch. As crosstalk would impede this high data rate, twists are used. Analysis shows that the optimal positions of the twists depend on the termination of the interconnect. Theory and measurements show that only one twist at 50% of the even interconnects, two twists at 30% and 70% of the odd interconnects and equal source and load impedances are very effective in mitigating the crosstalk.
Introduction
On-chip communication is a field that is getting more attention, as (global) interconnects are rapidly becoming a speed, power and reliability bottleneck for digital systems [1] . In [2] we demonstrate a bus-transceiver test chip in 0.13 µm CMOS that uses 10 mm long uninterrupted differential interconnects of only 0.8 µm pitch (82 MHz RC-limited bandwidth) and achieves 3 Gb/s/ch. However, in [2] we do not analyze the twists that we use to cancel crosstalk. These twists are necessary, because due to small spacings (0.4 µm) and the 10 mm long parallel interconnects in the bus, there is a considerable amount of crosstalk. This crosstalk limits the achievable data rate if it is not mitigated. Twists are also used in CMOS memory cells to cancel crosstalk between bitlines [3] . Recently, the use of twists in on-chip global interconnects was proposed [4] . They use eight evenly-spaced twists, thereby overlooking via resistance. In our chip, we show that only one twist in the even interconnects and two twists in the odd interconnects are sufficient. Furthermore, it turns out that the optimal positions for these twists depend on the termination of the interconnect. In section 2, the optimal positions for the twists, depending on the termination, are calculated. Section 3 shows measurement results from our test chip. Fig. 1 shows a model of the global bus (cross-section). The global bus is placed in metal 5 as we assume the top metal layer (metal 6) to be reserved for power and clock The transfer functions of Fig.2 show two properties of the interconnect: First, the interconnect has a limited bandwidth of only 100 MHz (82 MHz for a differential interconnect) that limits the achievable data rate. In order to have a data rate of 3 Gb/s, we make use of a lowohmic R L and pulse-width (PW) equalization [2] . Second, the neighboring interconnect creates severe crosstalk. Especially for frequencies above 1 GHz, the transfer functions H A and H V are almost equal. Therefore, in order to achieve the data rate of 3 Gb/s, it is necessary to mitigate the crosstalk.
Optimal twist positions and termination

Interconnect model
Twist analysis
The neighbor-to-neighbor crosstalk in the bus is reduced by using differential interconnects with twists. Fig. 3 shows how the twists are organized (interconnects in metal 5 and part of the twists in metal 4). Every differential interconnect has only one or two twists (alternately). The positions of the twists are at x 1 *l T , x 2 *l T and x 3 *l T , with l T the total length of the interconnect.
In this section, we show how to calculate the transfer functions H A and H V for differential interconnects with twists. First, we calculate the transfer functions H A+ = out+/V S1 and H A− = out−/V S1 with V S2 = 0 (see 
M is dependent on the signal that is on the neighboring interconnects (Miller multiplication of capacitance C M '). The twists divide the interconnect into four sections. For every section k, M can have a different value. These values of M are shown in the table below. Also, the length of every section is given. With these values for M and l k , the s-parameters [5] of every section k are (see bottom of Fig. 3 ): 
Signal-to-Crosstalk-Ratio
With the help of these transfer functions, the optimal positions for x 1 , x 2 and x 3 can be found. We define the signal-to-crosstalk-ratio (SCR) as follows:
where X(f) is the power spectral density of the input signal. value for x 2 is shifted towards 0.7 and the peak value of the SCR decreases. Note that the optimal case, one twist at x 2 = 0.5 and choosing R L = R S , nicely coincides with the fact that for highest bandwidth, both R S and R L should be chosen low-ohmic [2] . DM crosstalk can be cancelled with the twist at x 2 , but there will still be CM (common mode) crosstalk. This can be removed by the twists at x 1 and x 3 . Fig. 5 shows the SCR for both DM crosstalk and for CM crosstalk as a function of x 1 and x 3 (x 2 = 0.5 and R L = R S ). The figure shows that the DM crosstalk is canceled if x 3 = 1 -x 1 . On this line, the CM crosstalk is minimal at x 1 0.3 and x 3 0.7. So, the optimal twist positions are at x 1 = 0.3, x 2 = 0.5, x 3 = 0.7 and R L = R S . However, the SCR remains adequate (>35 dB) for large variations in x i and R L (Figs. 4 and 5 ).
3D EM-field simulations
In order to check the optimal positions, two differential interconnects have been drawn in a 3D EM-Field simulator. The length l T is only 1 mm to limit the simulation time. Note that for l T = 1 mm, the crosstalk voltage is much lower than for l T = 10 mm. One of the differential interconnects has one twist and the other has two twists. Fig. 6 shows the simulated crosstalk voltage (step response) for different positions of the twists (R S = 50 ). For DM crosstalk, the optimal position of the twist (x 2 ) is at 0.5 for an R L of 50 and between 0.6 and 0.7 for an R L of 20 k . This coincides with the theory, as the model of the previous section predicts 0.5 and 0.64 respectively. For CM crosstalk, the optimal positions of the twists (x 1 and x 3 ) are at 0.3 and 0.7 for an R L of 50 and at 0.35 and 0.8 for an R L of 20 k . Again, this coincides with the theory that predicts x 1 = 0.27 and x 3 = 0.73 for an R L of 50 and x 1 = 0.37 and x 3 = 0.82 for an R L of 20 k .
Measurements
On a test chip [2] , a bus of seven 10 mm long differential interconnects is measured. The seven channels (see width equalization is used to achieve a data rate of 3 Gb/s. This data rate is measured on channel 4, as described in [2] . In this paper, we show the results of measurements on channels 1 and 6. These measurements show the effectiveness of the twists. Fig. 8 shows the measured transfer function from ch. 6 and the crosstalk transfer functions from ch. 5 and 7 to ch. 6. As expected, the crosstalk from ch. 5 is less than the crosstalk from ch. 7 (double twist in ch. 5 reduces CM crosstalk, see top Fig. 8 ) and both the crosstalk from ch. 5 and ch. 7 is reduced for the differential output (single twist in ch. 6 reduces DM crosstalk, see bottom Fig. 8 ).
The transfer functions of Fig. 9 have a smaller bandwidth due to the high-ohmic termination of ch. 1. There is more crosstalk from ch. 2 on out1+ then on out1−, because out1− has no signal carrying neighbor. The bottom graph shows that the crosstalk is not reduced for the differential output (no twist in ch. 1). In Fig. 10 
Conclusions
By using pulse-width equalization and low-ohmic termination, we achieve a data rate of 3 Gb/s over 10 mm long differential interconnects with a bandwidth of only 82 MHz [2] . However, because of the small spacing, long interconnects and high data rate, the crosstalk is considerable. Therefore, in order to achieve 3 Gb/s the crosstalk has to be mitigated also. The twists that we use for this are analyzed in this paper. Our analysis shows that the optimal positions of the twists depend on the termination of the interconnect. Differential mode crosstalk can be canceled with only one twist at 50% by choosing equal load and source resistances. Two twists in the neighboring interconnects at 30% and 70% reduce common mode crosstalk. Measurements show the Figure 10: Single-ended (SE) and differential (DIFF) eye-diagram measurements.
effectiveness of the twists.
Acknowledgements
This research is supported by the Technology Foundation STW, applied science division of NWO and the technology programme of the Ministry of Economic Affairs. Authors thank Philips Research for chip fabrication.
