A 440pJ/bit 1Mb/s 2.4GHz multi-channel FBAR-based TX and an integrated pulse-shaping PA by Paidimarri, Arun et al.
A 440pJ/bit 1Mb/s 2.4GHz Multi-Channel FBAR-based TX
and an Integrated Pulse-shaping PA
Arun Paidimarri, Phillip M. Nadeau, Patrick P. Mercier, Anantha P. Chandrakasan ∗
Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Abstract
A 2.4GHz TX in 65nm CMOS defines three channels us-
ing three high-Q FBARs and supports OOK, BPSK and MSK.
The oscillators have −132dBc/Hz phase noise at 1MHz offset,
and are multiplexed to an efficient resonant buffer. Optimized
for low output power ≈−10dBm, a fully-integrated PA im-
plements 7.5dB dynamic output power range using a dynamic
impedance transformation network, and is used for amplitude
pulse-shaping. Peak PA efficiency is 44.4% and peak TX effi-
ciency is 33%. The entire TX consumes 440pJ/bit at 1Mb/s.
Introduction
Body Area Networks (BANs) for continuous health mon-
itoring applications require radios that are both reliable and
energy efficient in order to minimize device size and extend
battery lifetime. Due to short transmit distances and an energy-
asymmetric star topology, a low output power of −10dBm is
sufficient for the sensor node [1]. Low PA power consumption
places stringent power constraints on LO generation and mod-
ulation to maintain high overall TX efficiency. Recent work
shows that high-Q direct-RF resonators, such as FBAR and
SAW, provide low-power and stable LOs [2, 3], avoiding slow-
starting, power-hungry PLLs. Low tuning range, however, lim-
its operation to a single channel. Typical PAs are not optimized
for low output power, and this presents additional challenges.
This work proposes a high-Q RF resonator-based frequency
generation architecture that scales to multiple channels by mul-
tiplexing resonators. A three-channel FBAR-based TX operat-
ing in the 2.4GHz ISM band demonstrates the idea. Addition-
ally, a PA optimized for low output power is proposed, with an
integrated tunable impedance transformation network capable
of amplitude pulse-shaping for improved spectral efficiency.
Multi-Channel Transmitter Architecture
Fig. 1 shows the TX architecture. Three FBAR oscillators
are multiplexed to an efficient resonant buffer which directly
drives the PA. The buffer stage also incorporates matched de-
lays with inverted phase to provide BPSK modulation. Since
the output power is low, overall TX efficiency is critical, and
informs the design choices for LO generation, modulation, and
channel multiplexing. A low voltage design (0.7V) coupled
with rail-to-rail swing on all RF nodes is used for improved
power efficiency. Full swing on the oscillator minimizes short-
circuit current in the buffers, while full swing at the input of the
PA maximizes overdrive. The TX is designed for a datarate of
1Mb/s and supports three simple modulation schemes: OOK,
BPSK and MSK, all with pulse-shaping capability.
Fig. 1 also shows the schematic of a Pierce oscillator [2]
used for one channel. This inverter-based circuit provides
rail-to-rail output swing and reduces power consumption by
∗The authors acknowledge the financial support of the Interconnect Focus
Center, one of six research centers funded under the FCRP, an SRC entity. The
authors thank TSMC University Shuttle Program for chip fabrication and Dr.
Richard Ruby and Avago Technologies for providing FBARs.
-10dBm
PA
2.4GHz
ULP-TX
Resonant
BufferFB
A
R
O
sc
ill
at
or
s
M
U
X
n
Filtering QuantizerFPGA
Data
Pulse
Shaping
FBAR
off-chip
En
En
En
CMSK
2
/ 180o0o
Fig. 1. TX architecture with the FBAR oscillator of one channel
2x over an NMOS-only implementation via current re-use.
The oscillator consumes 150𝜇W. A digitally controlled capac-
itor bank, CMSK, tunes the oscillator center frequency over a
600kHz range with a 9.5kHz step-size, which is sufficient for
1Mb/s GMSK modulation. The measured phase noise at the
antenna is −132dBc/Hz at 1MHz offset. The oscillators have
a startup time <4𝜇s, permitting aggressive TX duty-cycling.
The outputs of the three FBAR oscillators are multiplexed
using transmission gates onto a resonant buffer that isolates the
oscillator from the variable Miller capacitance of the PA. The
low input-capacitance of the buffer also minimizes loading on
the oscillator. Resonance of the large PA gate capacitance with
an on-chip inductor minimizes power consumption since the
buffer only needs to compensate losses in the tank. As pulse-
shaping is handled by the PA and linearity is not a concern, the
buffer is implemented as a CMOS inverter and gives rail-to-rail
swing to the PA. Full-swing input from the oscillator leads to
low short-circuit current. The buffer consumes 100𝜇W. When
compared to a non-resonant buffer, resonance results in greater
than 2.5x lower buffer power, and 30% lower system power.
Integrated Pulse-Shaping Power Amplifier
The proposed push-pull PA is illustrated in Fig. 2. Tran-
sistors M1 and M2 are biased at ≈𝑉𝑇𝑁 and ≈(𝑉𝐷𝐷 + 𝑉𝑇𝑃 )
respectively with on-chip resistive-divider DACs to trade-off
short-circuit current and on-resistance. With the PA driven
strongly by rail-to-rail inputs from the buffer, drain node𝐷 has
a full 0 to 𝑉𝐷𝐷 swing, which gives the highest PA efficiency.
However, in order to radiate −10dBm, a 50Ω antenna requires
only 200mVp-p swing. To achieve this while also maintaining
rail-to-rail swing at𝐷, an impedance up-converter to 𝑍𝐿>50Ω
is used. The push-pull topology is better than inductor-biased
topologies (such as class E) since node𝐷 only requires a swing
of 𝑉𝐷𝐷 (and not 2𝑉𝐷𝐷) for maximum efficiency operation.
A fully-integrated tapped-capacitor matching network is
used for the impedance up-conversion. Digitally tunable ca-
pacitors 𝐶1 and 𝐶2 are simultaneously varied to maintain res-
onance, while up-converting to various values of 𝑍𝐿 as shown
by the governing equations and examples in Fig. 2. A 7.5dB
output power tuning range is achieved with negligible increase
in matching network complexity, while rail-to-rail swing at
node D provides maximum efficiency in all these settings. In
C1
C2
L
50Ω
ZL
VDD+VTP
VTNDACN
DACP
VOUT
VBUFF
En En
M1
Antenna
M2
En En
D
𝐿 𝐶1 𝐶2 𝑍𝐿 𝑃𝑂𝑈𝑇
(nH) (fF) (fF) (Ω) (dBm)
960 1010 300+j0 -7
6.44 850 1810 600+j0 -10
780 2820 1200+j0 -13
𝑍𝐿 ≈ 50
(
1 + 𝐶2
𝐶1
)2
+ 𝑗0
𝜔2 ≈ 𝐶1+𝐶2
𝐿𝐶1𝐶2
𝑃𝑂𝑈𝑇 ≈ 𝑉
2
𝐷𝐷
8ℜ(𝑍𝐿)
Output Power Control
and Digital Amplitude
Pulse-Shaping

Fig. 2. Integrated PA with pulse-shaping using dynamic impedance
transformation and its governing equations
addition, these capacitor bank settings can be switched dynam-
ically, at rates>10MHz, and are hence used for efficient ampli-
tude pulse-shaping. This is opposed to a power-hungry linear
mixer + PA approach. Traditional methods of efficient pulse-
shaping like supply modulators [4] can be used in addition to
this technique to expand the output power range even further.
Measurement Results
The TX is fabricated in a 65nm CMOS process. A 0.7V sup-
ply powers the RF circuits and a 1V supply powers the digital
switches in the multiplexers and capacitor banks. Unless other-
wise specified, all measurements reported are for a PA supply
of 0.7V. The peak PA efficiency is 43% for an output power of
−7dBm. When the PA operates at 0.5V, peak PA efficiency is
44.4% for an output power of−9.3dBm. Overall TX efficiency
is plotted in Fig. 3 as the impedance transformation ratio is var-
ied. At 0.7V PA supply, peak TX efficiency is 28.6% and at 1V,
it is 33%, the highest for all frequency-stable transmitters with
sub-mW outputs operating in the GHz frequencies (Fig. 3).
−16 −14 −12 −10 −8 −6 −4 −20
5
10
15
20
25
30
35
 RFIC’06
 ISSCC’05
 JSSC’09
 CICC’04
 ISSCC’08 BIOCAS’06 
 VLSI’10
TX
 E
ffi
cie
nc
y 
(%
)
Output Power (dBm)
 
 
VDD,PA = 0.5V
VDD,PA = 0.7V
VDD,PA = 1.0V
Fig. 3. Overall TX efficiency versus output power.
−90
−80
−70
−60
−50
−40
−30
−20
Sp
ec
tru
m
 (d
Bm
/R
BW
)
Center 2.450 GHz
Res BW 47 kHz
Span 100 MHz
10 MHz/divVBW 470 kHz
Channel 0 Channel 1 Channel 2
5d
B/
di
v
9 dB
6 dB
1MHz/div

Fig. 4. Spectra of Gaussian pulse-shaped and phase-scrambled
1Mb/s OOK for the three channels measured from the chip
Fig. 4 shows the multi-channel capability of the TX. Spec-
tra of 1Mb/s Gaussian pulse-shaped (𝛼=0.3) phase-scrambled
OOK (10x oversampling) of the three channels measured from
the chip are superimposed. Spurs from the 10x oversampling
are all below−30dBc. Pulse-shaping reduces the first sidelobe
−150
−100
−50
0
50
100
150
O
ut
pu
t V
ol
ta
ge
 (m
V)
Time (μs) 1 μs/div
+1 +1 −1 +1 −1 −1 +1 +1 −1
Fig. 5. 1Mb/s BPSK modulated waveform with SRRC filtering
−60
−55
−50
−45
−40
−35
−30
Sp
ec
tru
m
 (d
Bm
/R
BW
)
 
 
BPSK − Pulse Shaping Off
BPSK − Pulse Shaping On
Center 2.491 GHz
Res BW 4.7 kHz
Span 10 MHz
1 MHz/divVBW 47 kHz
13 dB
Fig. 6. 1Mb/s BPSK spectra with and without pulse-shaping
by 6dB and the second sidelobe by 9dB. At a −12.5dBm aver-
age output power, the TX consumes 440pJ/bit, which is lower
than all resonator-based transmitters at GHz frequencies.
Measured transient waveform and spectra for 1Mb/s BPSK
are shown in Figs. 5, 6. Square Root Raised Cosine (SRRC)
filtering (𝛽=0.3) at 8x oversampling suppresses the first side-
lobe by 13dB. The TX consumes 530pJ/bit while transmit-
ting −11dBm in this mode. In GMSK mode, the TX con-
sumes 550pJ/bit at −10dBm output power. The digitally fil-
tered GMSK modulation reduces the first sidelobe by 7dB and
second sidelobe by greater than 20dB as compared to MSK.
Technology 65nm CMOS
Supply 0.7V (RF), 1V(Switch)
Num. Channels 3
Startup Time 4𝜇𝑠
Data Rate 1Mb/s
Phase Noise -132dBc/Hz (1MHz off.)
PA Peak Eff. 44.4%
TX Peak Eff. 33%
POUT -17dBm to -2.5dBm
Energy per bit and Average POUT
OOK (Gauss.) 440pJ/bit at -12.5dBm
BPSK(SRRC) 530pJ/bit at -11dBm
GMSK 550pJ/bit at -10dBm
Fig. 7. TX performance summary and packaging photograph
Fig. 7 summarizes the TX specifications and shows the
packaging photograph of the CMOS die co-packaged with
three FBARs in a QFN40. The TX core occupies an area of
0.324𝑚𝑚2. The TX has also been used to successfully trans-
mit packetized ECG data to a commercial receiver.
In conclusion, a TX architecture optimized for the short-
distance link budgets of BANs is presented. The PA is opti-
mized for operation at −10dBm with integrated pulse-shaping
capability. High-Q FBAR based LO generation is extended to
three-channel operation. Integrated on-chip resonators [5] will
enable area-efficient expansion to larger number of channels.
References
[1] IEEE P802.15 TG6 Body Area Networks (BAN) draft standard,
accessed Dec. 2011, http://www.ieee802.org/15/pub/TG6.html
[2] Y. Chee, et al., Proc. Symp. VLSI Circuits, pp. 43-44, Jun. 2006.
[3] M. K. Raja, et al., Proc. A-SSCC’08, pp. 341-344, Nov. 2008.
[4] F. Raab, et al., Microwave Theory and Techniques, IEEE Trans-
actions on, vol. 50, no. 3, pp. 814-826, 2002.
[5] D. Weinstein, et al., Nano Letters, pp. 1234-1237, Apr. 2010.
