Radio Frequency Modulated Signaling Interconnect for Memory-to-Processor
  and Processor-to-Processor Interfaces: An Overview by Du, Jason Y.
 
Abstract— With the evolution of heterogeneous computing 
system, such as network-on-chip, high-performance distributed 
computing, accelerator-rich architectures and cluster computing, 
high-speed, energy-efficient and low-latency interfaces among 
memory-to-processor and processor-to-processor become the key 
technology to enable those technologies. Simultaneously, the 
scaling of CMOS makes the switching speed of the transistor up to 
sub-THz. Radio-frequency or even millimeter-wave modulated 
signaling interconnect has unique features in ultra-low power 
operation, dynamic allocation of bandwidth and low latency, 
compared with convention baseband signaling interconnect. In 
this work, we overview the different generations of 
radio-frequency interconnect (RF-I) technology, compare them 
with conventional baseband signaling interconnect technologies. 
The limitations and potentials are also discussed in the end.   
 
Index Terms — interconnect, memory-to-processor, 
processor-to-processor, radio frequency modulated signaling, 
energy efficiency, radio-frequency interconnect (RF-I) 
 
I. INTRODUCTION 
o meeting the even-increasing computation-intensive 
applications and the demands of low-power, low-cost and 
high-performance system, the number of heterogeneous 
computing systems in a single chip has enormously increased, 
such as network-on-chip (NoC), high-performance distributed 
computing (HPDC), accelerator-rich architectures (ARA) and 
cluster computing (CC) [1]. The key and common requirement 
of those systems are a high-speed, energy-efficient and 
low-latency interconnect technology, which supports 
communication among memory to the processor, processor to 
processor, accelerator to memory, accelerator to accelerator, 
and accelerator to processor. Microsoft’s Project Catapult is a 
great example in hyper-scale cloud-based acceleration by 
utilizing conventional baseband high-speed interconnects on 
Altera’s Stratix V D5 FPGA [2].  
At the same time with the key benefits of the ultra-scaling 
CMOS technology, the switching speed of transistor increases a 
lot over each technology node. Based on ITRS reports [3], fT 
and fmax, will exceed 800 GHz and 1 THz, respectively in 10nm 
CMOS technology. With the advance of CMOS 
radio-frequency and millimeter-wave circuits, higher and 
higher bandwidth will be available shortly. Recently, many 
published works demonstrated millimeter-wave band more 
 
 
than 60 GHz [4-9] and even up to THz range [10-14]. With 
more frequency band resources, CMOS-based circuits are 
driving all kinds of radio-frequency related applications. For 
instance, the CMOS RF circuits are used for wireless and 
wireline communication [15-21], human-machine interfaces 
[22-24], navigation [25], etc.   
Power consumption and heat dissipation are very critical 
issues of modern high-performance computing platform [2]. 
For example, Fig. 1 shows that the serial interface power 
consumption is almost comparable with computing core’s 
power. Another example published by Intel, saying the serial 
interface power is going to exceed 50% of total CPU power 
with higher and higher IO data rate in the very near future. 
In this paper, the conventional baseband signaling 
interconnects, and equalization techniques will be reviewed in 
Section II and III. In Section IV and V, we will introduce 
radio-frequency signaling concept and summarizes the 
radio-frequency interconnect (RF-I) technology development 
generation by generation. Section VI will draw the conclusion.  
 
II. CONVENTIONAL BASEBAND SIGNALING INTERCONNECT  
The data rate of peripheral serial input/output (I/O) for PC 
and mobile computing platforms continue to scale to meet 
high-bandwidth applications including high-resolution 
displays, camera sensors and large-capacity external storage 
[26]. In Fig. 2 (a) the blue curve is technology scaling, and the 
red curve is the number of functions per chip, increasing 
dramatically with technology scaling.  
Radio Frequency Modulated Signaling 
Interconnect for Memory-to-Processor and 
Processor-to-Processor Interfaces: An Overview 
Jason Y. Du, Student Member, IEEE 
T 
[Oracle T3 processor]
[Intel JSSC 2013]
 
Fig. 1 Trend of power consumption of interconnect 
More interestingly, Fig. 2 (b) shows that the CPU clock rate 
does not change much over 15 years, mainly because power and 
heat dissipation becomes a severe problem. Moreover, 
similarly, the IO pads number per chip also remains relatively 
constant but for a different reason, mainly because of 
packaging cost. Nowadays, the packaging plus testing have 
already taken more portion than fabrication in the 
semiconductor industry, especially for high-speed 
high-pin-count chips. It is very obvious that there is a large gap 
between IO data rate and internal data rate, 15 ~ 20 times 
difference.    
Interconnect design evolved a lot over these years as Fig. 2 
shows. Starting from 1980’s, when the data rate is around tens 
of Mb/s, the interconnect on-chip routing wire, cable or copper 
traces on printed circuits board (PCB) could be modeled as a 
simply lumped capacitor. The transceiver design was CMOS 
inverter-based driver and receiver.   
Then in 1990’s and 2000’s, data rate gets to hundreds of 
Mb/s and Gb/s level, the similar interconnects (on-chip routing 
wire, cable or copper traces on PCB) have to be modeled as a 
transmission line, which has more distributive effects, such as 
characteristic impedance mismatch, inter-symbol-interference 
(ISI), incident/ reflected wave superposition, and so on. At that 
time, a lot of parallel interfaces with the multi-drop features, 
like PCI, old generation DDR were replaced by point-to-point 
serial interfaces. 
Today, the data rate of mobile or PC computing platform 
interfaces data rate are going to exceed more than 10 Gb/s. 
There are also some cables, vias, and novel substrate materials 
progress to improve signal integrity. However, if compared 
with the data rate increasing, they are far more enough to 
overcome the communication channel bandwidth limitation. It 
has to rely on complicated and power-hungry equalization 
technique. Moreover, transmission line channel model is not 
accurate enough, all the discontinuities and non-ideal effects, 
including vias, bumps, bonding wires, pads, traces in packages, 
connectors, etc., should be modeled carefully. It is tough to 
meet both data rate requirement and power and cost budget at 
the same time. It is one of the hottest areas both in academic and 
industrial fields.  
 
III. STATE-OF-ART EQUALIZATION SOLUTIONS 
Fig .4 illustrated the common interface data link transmitter 
(TX) and receiver (RX) architecture with a comprehensive 
combination of all the equalization mentioned above 
techniques.  
In the time-domain analysis, if a single bit is sent out onto the 
channel there will a long tail existing, which is very severe 
inter-symbol-interference (ISI). From the frequency-domain 
point of view, if the data bandwidth is lower than the available 
channel bandwidth, for example, 2Gb/s, there would not be any 
effect observed if channel 3-dB bandwidth is 3 GHz. However, 
if we are trying to send more than 10Gb/s data, eye diagram 
will completely close due to strong ISI and bit error rate (BER) 
will be awful. Similarly, for the low-cost connector and cable 
channel, there are a lot of discontinuities, such as bumps, vias, 
traces in the package, traces on the PCB, connector transition. 
All of these in the signal path might create strong resonances. 
They are also sensitive to fabrication variations. We can find 
for this particular low-cost cable; a single-bit transmission 
created more than 20 unit intervals (UI) after the channel at the 
T
ech
n
o
lo
g
y
 N
o
d
e (n
m
)
F
u
n
ctio
n
s p
er C
h
ip
 (G
b
it)
 
(a) 
N
o
rm
a
lized
 B
a
n
d
w
id
th
15x ~ 20x
 
(b) 
Fig. 2 (a) Technology trend with functions per chip, (b) data bandwidth, 
and CPU clock rate  
Today
[B. Casper, Intel]3GHz
 
Fig. 3 Evolution of serial interface over the several decades 
TX FIR EQ
Channel
Z
-1
Z
-1
Z
-1
Z
-1
Z
-1
RX CTLE + DFE EQ
 
 
Fig. 4 Conventional comprehensive combination of equalization 
techniques 
receiver input. In this situation, if two or more bits are sent out 
onto the channel continuously, the receiver cannot tell it is one 
or zero without any equalization techniques. To make the 
matter worse, the location and depth of these frequency notches 
are very sensitive to PCB or connector fabrication variations. It 
is pretty challenging and power-hungry to use equalization 
technique to equalize so many non-ideal channel effects.   
There are several equalization methods commonly used. 
They are FIR filter equalization at the transmitter side, CTLE 
(continuous time linear equalization) and nonlinear DFE 
(decision feedback equalization) at the receiver side. All of 
them have their advantages and disadvantages. The practical 
way is to combine all them together to achieve optimal 
operation point regarding energy efficiency, maximum data 
bandwidth.  
 
IV. CONCEPT OF RADIO FREQUENCY MODULATED SIGNALING 
The fundamental consideration of multi-band signaling is 
precisely the same as the cable TV system or wireless 
orthogonal frequency-division multiplexing (OFDM) system. 
However, both cable TV and wireless OFDM system are 
relatively narrow band systems, while the serial interface is 
broadband.  Channel conditions of serial interface are also very 
different. 
In Fig. 5, PAM-8 and 64-QAM are shown as an example. 15 
parallel data streams are running at 1 Gb/s as a data source. The 
PAM-8 modulator modulates three of them, the time-domain 
waveform of which are still in base-band but with multi-level 
features.  Six of them pass to the 64-QAM modulator, the 
time-domain waveform of which is modulated by RF carrier 
frequency f1. Similarly, another 6 of them are modulated by 
another RF carrier frequency f2. Then, all of these waveforms 
are summed together. There are one baseband, one RF band at 
f1 and another RF band at f2 in the frequency domain, 
respectively.  
The merits of frequency-domain multi-band signaling over 
time-domain baseband are emphasized in Fig. 6. (1)
 Multi-band signaling can offer simultaneous and orthogonal 
communication channels in freq. domain; (2) It is easy to adapt 
with channel frequency notches by smartly choosing carrier 
frequency; (3) Multi-band signaling can relax equalization 
requirement because of self-equalization effect.  
 
V. RF-I TECHNOLOGY DEVELOPMENT OVERVIEW  
The first (radio-frequency interconnect) RF-I transceiver 
was published in 2009 VLSI by Dr. Tam [27] as shown in Fig. 
7. At that time, 30-GHz and 50-GHz radio frequencies 
(mm-wave frequencies) were used. It achieved 10Gb/s 
aggregated data rate.  However, the channel is only 5mm 
on-chip transmission line for point-to-point communion. It is a 
practical data link for network-on-chip (NoC) and 
accelerator-rich architectures (ARA) applications. However, 
the channel distance is too short for high-performance 
distributed computing (HPDC) and cluster computing (CC) 
applications. The modulation scheme was non-coherent on-off 
key, which is the simplest non-coherent modulation scheme but 
also with the least power and hardware overhead. 
The second version of RF-I system was demonstrated by Dr. 
Kim in 2012 ISSCC [28] as shown in Fig. 8. 18GHz RF carrier 
frequency was used. Moreover, it achieved 8Gb/s aggregated 
data rate. The channel condition was much more challenging, 
compared with the first generation RF-I. It included 5-cm PCB 
traces on the FR4 material. It was still point-to-point 
communication with on-off key modulation. 
Baseband RF-Band1 RF-Band2 Multi-Band 
Signaling(ex: PAM-8)
f1 f2 f1 f2
(ex: 64-QAM) (ex: 64-QAM)
 
Fig. 5 Concept of radio frequency modulated signaling 
Ts Ts Ts Ts
 Time-domain Baseband 
Signal 
Frequency-domain Multi-
band Signal
f2f1
B B
f1
f2
IQ Mixer
 
Fig. 6 Concept comparison of radio frequency modulated signaling and 
baseband signaling 
*Sai-Wang Tam, 2009 VLSI
50G Band 4Gbps30G Band 4Gbps
 
Fig. 7 On-chip RF-I transceiver in 2009 VLSI 
 
Fig. 8 On-board RF-I transceiver in 2012 ISSCC 
*Yanghyo Kim 2012 ISSCC
4Gbps RF-band6.5Gbs BB-band
A more advanced version of multi-band RF interconnects 
transceiver is realized [15] as Fig. 9 shown. It is proved that if 
one wants to extend communication distance and makes the 
whole serial interface more industrial friendly. The carrier 
frequencies have to be reduced from mm-wave frequency to 
within 10-GHz rather than using millimeter wave frequency 
range, in which range, it was difficult to achieve high energy 
efficiency due to skin effect metallic loss and dielectric material 
loss at high frequency. For this implementation, five carrier 
frequencies are used. It achieved 4 Gb/s aggregated data rate. 
Moreover, the channel is 2-inch transmission line on FR-4 PCB 
for point-to-point communication. It was the first time to 
demonstrate coherent modulation with QPSK in multi-band 
signaling serial interface transceiver. 
The updated version of multi-band RF serial interconnects 
transceiver uses 3GHz and 6GHz carrier frequencies to 
achieved 10 Gb/s aggregated data rate per differential pair [16], 
as shown in Fig. 10. The channel is 2-inch copper traces on 
FR-4 PCB. More significantly, both point-to-point 
communication and multi-drop channel communication are 
demonstrated with 16-QAM coherent modulation scheme.  
The latest generation of RF-I transceiver also uses 3GHz and 
6GHz carrier frequencies to achieved 16 Gb/s aggregated data 
rate per differential pair [17], as shown Fig. 11. The most 
advanced feature of this generation RF-I is cognitive to 
different data transmission channel conditions. It can learn 
tough channel conditions, such as multi-drop bus memory 
interface channel, and low-cost cable/connector channel. 
Simultaneously, it achieves less than one pJ/bit energy 
efficiency. The TX features learning an arbitrary channel 
response by sending a sweep of continuous wave, then 
detecting power level at receiver (RX) side, and accordingly 
adapting modulation scheme, data bandwidth and carrier 
frequency based on detected channel information. The 
supported modulation scheme ranges from NRZ/QPSK to 
PAM-16/256-QAM. The highly re-configurable TX is capable 
of dealing with low-cost serial channels, such as low-cost 
connectors, cables or multi-drop buses (MDB) with deep and 
narrow notches in the frequency domain (e.g., 40 dB loss at 
notches).  The adaptive multi-band scheme mitigates 
equalization requirement and enhances the energy efficiency by 
avoiding frequency notches and utilizing the maximum 
available signal-to-noise ratio (SNR) and channel bandwidth.   
 
VI. CONCLUSION  
In this paper, the radio frequency modulation signaling 
 
Fig. 9 On-board RF-I transceiver with QPSK in 2015 CICC 
*W. H. Cho, 2015 CICC
*W. H. Cho, 2016 ISSCC
 
Fig. 10 On-board multi-band RF serial interconnect transceiver with 
16-QAM supporting multi-drop bus (MDB) channel in 2016 ISSCC 
Digital Control 
+ PRBS
Carrier 
Generator
300μm 
Total active area 
= 0.016 mm
2
 Analog 
Front-End
50μm 
40 40 
40μm 
Divider
(14%)
Phase 
Interpolator
(9%)
Summation
(34%)
Clock 
Buffer
(19%)
DAC
(19%)
Controller
(1%)
PRBS
(3%)
Mixer
(1%)
 
(a) 
1ns
600mV
QPSK
16-QAM
1ns
150mV
1ns
1ns
256-QAM
64-QAM
50mV
15mV
(b) 
Fig. 11 Cognitive RF-I transmitter with reconfigurable coherent 
modulation up to 256-QAM in 2016 VLSI 
TABLE I 
DIFFERENT GENERATIONS OF RF-I COMPARISON 
Paper 
Data 
Rate/diff. 
pair 
Modulation 
Scheme 
Channel 
Condition 
Latency 
Energy 
Efficiency 
2009 VLIS 
[27] 
8 Gb/s OOK 
on-chip 
trace 
Low (wo/ 
clock 
forwarding) 
15 pJ/ bit 
2012 ISSCC 
[28] 
10 Gb/s OOK 
on-board 
trace 
Low (wo/ 
clock 
forwarding) 
10 pJ/ bit 
2015 CICC 
[15] 
10 Gb/s QPSK 
on-board 
trace 
Low (wo/ 
clock 
forwarding) 
2 pJ/ bit 
2016 ISSCC 
[16] 
10 Gb/s 
16-QAM / 
PAM-4 
on-board 
trace 
Low (w/ 
clock 
forwarding) 
1 pJ/ bit 
2016 VLSI 
[17] 
16 Gb/s 
256-QAM / 
PAM-16 
on-board 
trace, 
MDB, 
low-cost 
cable 
Low (w/ 
clock 
forwarding) 
< 1 pJ/bit 
 
interconnect is introduced, compared the conventional 
baseband signaling. We overview the different generations of 
radio-frequency interconnect (RF-I) technology. Table I 
compares the performance differences of different generations 
regarding data rate, modulation schemes, channel conditions, 
latency and energy efficiency.  
  
REFERENCES 
[1] Jason Cong, Mau-Chung Frank Chang, Glenn Reinman, and Sai-Wang 
Tam. 2009. Multiband RF-interconnect for reconfigurable 
network-on-chip communications. In Proceedings of the 11th 
international workshop on System level interconnect prediction (SLIP 
'09). ACM, New York, NY, USA, 107-108. 
[2] A. M. Caulfield et al., "A cloud-scale acceleration architecture," 2016 
49th Annual IEEE/ACM International Symposium on Microarchitecture 
(MICRO), Taipei, Taiwan, 2016, pp. 1-13. 
[3] http://www.itrs2.net/itrs-reports.html  
[4] H. Wu et al., "A Current-Mode mm-Wave direct-conversion receiver with 
7.5GHz Bandwidth, 3.8dB minimum noise-figure and +1dBm P1dB, out 
linearity for high data rate communications," 2013 IEEE Radio 
Frequency Integrated Circuits Symposium (RFIC), Seattle, WA, 2013, 
pp. 89-92. 
[5] Z. Z. Chen et al., "A wide-band 65nm CMOS 28–34 GHz synthesizer 
module enabling low power heterodyne spectrometers for planetary 
exploration," 2015 IEEE MTT-S International Microwave Symposium, 
Phoenix, AZ, 2015, pp. 1-3. doi: 10.1109/MWSYM.2015.7166709 
[6] H. Wu, N. Y. Wang, Y. Du and M. C. F. Chang, "A Blocker-Tolerant 
Current Mode 60-GHz Receiver With 7.5-GHz Bandwidth and 3.8-dB 
Minimum NF in 65-nm CMOS," in IEEE Transactions on Microwave 
Theory and Techniques, vol. 63, no. 3, pp. 1053-1062, March 2015. 
[7] Y. Li et al., "A multi-band low-noise transmitter with digital carrier 
leakage suppression and linearity enhancement," in IEEE Transactions on 
Circuits and Systems I: Regular Papers, vol. 60, no. 5, pp. 1209-1219, 
May 2013. 
[8] A. Tang et al., "A 95 GHz centimeter scale precision confined pathway 
system-on-chip navigation processor for autonomous vehicles in 65nm 
CMOS," 2015 IEEE MTT-S International Microwave Symposium, 
Phoenix, AZ, 2015, pp. 1-3. 
[9] A. Tang et al., "CMOS (Sub)-mm-Wave System-on-Chip for exploration 
of deep space and outer planetary systems," Proceedings of the IEEE 
2014 Custom Integrated Circuits Conference, San Jose, CA, 2014, pp. 
1-4. 
[10] A. Tang et al., "A 65nm CMOS 140 GHz 27.3 dBm EIRP transmit array 
with membrane antenna for highly scalable multi-chip phase arrays," 
2014 IEEE MTT-S International Microwave Symposium (IMS2014), 
Tampa, FL, 2014, pp. 1-3. 
[11] A. Tang et al., "A 200 GHz 16-pixel focal plane array imager using 
CMOS super regenerative receivers with quench synchronization," 2012 
IEEE/MTT-S International Microwave Symposium Digest, Montreal, 
QC, Canada, 2012, pp. 1-3. 
[12] Y. Zhao et al., "A 0.56 THz Phase-Locked Frequency Synthesizer in 65 
nm CMOS Technology," in IEEE Journal of Solid-State Circuits, vol. 51, 
no. 12, pp. 3005-3019, Dec. 2016. 
[13] R. Al Hadi, Y. Zhao, Y. Li, Y. Du, and M.-C. F. Chang, "Retroactive 
terahertz displacement sensor in a standard 65nm CMOS technology", in 
Proc. OSA Conf. Lasers and Electro-Optics (CLEO), San Jose, CA, Jun. 
2016, pp. 1−3. 
[14] R. A. Hadi et al., "A spectral profiling method of mm-wave and terahertz 
radiation sources," 2016 IEEE MTT-S International Microwave 
Symposium (IMS), San Francisco, CA, 2016, pp. 1-3. 
[15] W. H. Cho et al., "A 5.4-mW 4-Gb/s 5-band QPSK transceiver for 
frequency-division multiplexing memory interface," 2015 IEEE Custom 
Integrated Circuits Conference (CICC), San Jose, CA, 2015, pp. 1-4. 
[16] W. H. Cho et al., " A 38mW 40Gb/s 4-lane tri-band PAM-4 / 16-QAM 
transceiver in 28nm CMOS for high-speed Memory interface," 2016 
IEEE International Solid-State Circuits Conference (ISSCC), San 
Francisco, CA, 2016, pp. 184-185. doi: 10.1109/ISSCC.2016.7417968 
[17] Y. Du et al., "A 16Gb/s 14.7mW tri-band cognitive serial link transmitter 
with forwarded clock to enable PAM-16 / 256-QAM and channel 
response detection in 28 nm CMOS," 2016 IEEE Symposium on VLSI 
Circuits (VLSI-Circuits), Honolulu, HI, 2016, pp. 1-2. doi: 
10.1109/VLSIC.2016.7573523  
[18] Y. Du et al., "A 16-Gb/s 14.7-mW Tri-Band Cognitive Serial Link 
Transmitter With Forwarded Clock to Enable PAM-16/256-QAM and 
Channel Response Detection," in IEEE Journal of Solid-State Circuits , 
vol.PP, no.99, pp.1-12 doi: 10.1109/JSSC.2016.2628049 
[19] B. Hu et al., "A Capacitor-DAC-Based Technique For 
Pre-Emphasis-Enabled Multi-Level Transmitters," in IEEE Transactions 
on Circuits and Systems II: Express Briefs , vol.PP, no.99, pp.1-1 
[20] Du, Yuan. (2016). Cognitive Serial Interface with Multi-Band Signaling 
and Channel Learning Mechanism. UCLA: Electrical Engineering 0303. 
Retrieved from: http://escholarship.org/uc/item/8vs373c5 
[21] Y. Li et al., "Carrier synchronisation for multiband RF interconnect 
(MRFI) to facilitate chip-to-chip wireline communication," in Electronics 
Letters, vol. 52, no. 7, pp. 535-537, 2016. 
[22] L. Du et al., "A 2.3 mW 11 cm Range Bootstrapped and Correlated 
Double Sampling 3D Touch Sensing Circuit for Mobile Devices," in 
IEEE Transactions on Circuits and Systems II: Express Briefs, 2017. 
[23] L. Du et al., "6.7 A 2.3mW 11cm-range bootstrapped and 
correlated-double-sampling (BCDS) 3D touch sensor for mobile 
devices," 2015 IEEE International Solid-State Circuits Conference - 
(ISSCC) Digest of Technical Papers, San Francisco, CA, 2015 
[24] L. Du et al., "Invited: Airtouch: A novel single layer 3D touch sensing 
system for human/mobile devices interactions," 2016 53nd 
ACM/EDAC/IEEE Design Automation Conference (DAC), Austin, TX, 
2016, pp. 1-6.  
[25] Lv Jingjing Du Li, "Vehicular Collision Avoiding System Based on Two 
Ultrasonic Receivers, " Value Engineering, vol. 22, 2010. 
[26] J. Jaussi et al., "26.2 A 205mW 32Gb/s 3-Tap FFE/6-tap DFE 
bidirectional serial link in 22nm CMOS," 2014 IEEE International 
Solid-State Circuits Conference Digest of Technical Papers (ISSCC), San 
Francisco, CA, 2014, pp. 440-441. 
[27] S. W. Tam, E. Socher, A. Wong and M. C. F. Chang, "A simultaneous 
tri-band on-chip RF-interconnect for future network-on-chip," 2009 
Symposium on VLSI Circuits, Kyoto, Japan, 2009, pp. 90-91. 
 
 
