Low Power Analog to Digital Converters in Advanced CMOS Technology Nodes by Liu, Qiyuan
  
 
 
 
LOW POWER ANALOG TO DIGITAL CONVERTERS IN ADVANCED CMOS 
TECHNOLOGY NODES 
 
A Dissertation 
by 
QIYUAN LIU  
 
Submitted to the Office of Graduate and Professional Studies of 
Texas A&M University 
in partial fulfillment of the requirements for the degree of 
 
DOCTOR OF PHILOSOPHY 
 
 
Chair of Committee,      Jose Silva-Martinez 
Committee Members,     Sebastian Hoyos 
     Jiang Hu 
     Jay Porter 
Head of Department,     Miroslav M. Begovic 
 
 
May 2017 
 
Major Subject: Electrical Engineering 
 
Copyright 2017 Qiyuan Liu
ii 
ABSTRACT 
The dissertation presents system and circuit solutions to improve the power 
efficiency and address high-speed design issues of ADCs in advanced CMOS 
technologies. 
For image sensor applications, a high-performance digitizer prototype based on 
column-parallel single-slope ADC (SS-ADC) topology for readout of a back-illuminated 
3D-stacked CMOS image sensor is presented. To address the high power consumption 
issue in high-speed digital counters, a passing window (PW) based hybrid counter 
topology is proposed. To address the high column FPN under bright illumination 
conditions, a double auto-zeroing (AZ) scheme is proposed. The proposed techniques are 
experimentally verified in a prototype chip designed and fabricated in the TSMC 40 nm 
low-power CMOS process. The PW technique saves 52.8% of power consumption in the 
hybrid digital counters. Dark/bright column fixed pattern noise (FPN) of 0.0024%/0.028% 
is achieved employing the proposed double AZ technique for digital correlated double 
sampling (CDS). A single-column digitizer consumes total power of 66.8μW and 
occupies an area of 5.4 μm x 610 μm. 
For mobile/wireless receiver applications, this dissertation presents a low-power 
wide-bandwidth multistage noise-shaping (MASH) continuous-time delta-sigma 
modulator (CT-ΔΣM) employing finite impulse response (FIR) digital-to-analog 
converters (DACs) and encoder-embedded loop-unrolling (EELU) quantizers. The 
proposed MASH 1-1-1 topology is a cascade of three single-loop first-order CT-ΔΣM 
 iii 
stages, each of which consists of an active-RC integrator, a current-steering DAC, and 
an EELU quantizer. An FIR filter in the main 1.5-bit DAC improves the modulator’s 
jitter sensitivity performance. FIR’s effect on the noise transfer function (NTF) of the 
modulator is compensated in the digital domain thanks to the MASH topology. Instead 
of employing a conventional analog direct feedback path, a 1.5-bit EELU quantizer 
based on multiplexing comparator outputs is proposed; this approach is suitable for high-
speed operation together with power and area benefits. Fabricated in a 40-nm low-power 
CMOS technology, the modulator’s prototype achieves a 67.3 dB of signal-to-noise and 
distortion ratio (SNDR), 68 dB of signal-to-noise ratio (SNR), and 68.2 dB of dynamic 
range (DR) within 50.5 MHz of bandwidth (BW), while consuming 19 mW of total 
power (P). The proposed modulator features 161.5 dB of figure-of-merit (FOM), defined 
as FOM = SNDR + 10 log10 (BW/P). 
 
 iv 
DEDICATION 
 
To my parents and Xin 
 
 
 v 
ACKNOWLEDGEMENTS 
 
First of all, I would like to express my sincere appreciation to my advisor, Dr. Jose 
Silva-Martinez. Without his guidance, my research and dissertation will not be possible. 
During my research and study, he is always patient and supportive when I have problems. 
I have learnt from him, not just limited to technical aspect, but also from his personality. 
He is an extremely nice person and also a wonderful friend. 
I would like to thank Dr. Sebastian Hoyos, Dr. Jiang Hu, Dr. Jay Porter, and Dr. 
Shuguang Cui for their critical comments on my research. I would also like to thank Dr. 
Edgar Sanchez-Sinencio, Dr. Karman Entesari, and Dr. Samuel Palermo as instructors of 
my classes. 
I would like to thank Alexander Edward, Carlos Briseno-Vidrios, Haoyu Qian as 
reliable teammates and great friends collaborating on research projects. I would like to 
thank my colleagues and friends at Texas A&M for their help and support. Many thanks 
also go to the former Analog & Mixed Signal Center (AMSC) alumni who provide many 
valuable suggestions and help on my career. 
I would like to thank Mario Caresosa and Jun Cao for guiding me during my intern 
experience at Broadcom. I would like to thank Martin Kinyua and Eric G. Soenen for 
mentoring me during the internship at TSMC.  
Last but not least, I would like to express deep appreciation to my family for their 
support. I cannot thank more to my parents for bringing me to the world and shaping me 
to who I am today. 
 vi 
CONTRIBUTORS AND FUNDING SOURCES 
 
Contributors 
This work was supervised by a dissertation committee consisting of Professors Jose 
Silva-Martinez, Sebastian Hoyos, and Jiang Hu of the Department of Electrical and 
Computer Engineering and Professor Jay Porter of the Department of Engineering 
Technology and Industrial Distribution. 
The work for the Section III of the dissertation was completed by the student, in 
collaboration with Alexander Edward, under the advisement of Dr. Jose Silva-Martinez 
of the Department of Electrical and Computer Engineering together with Dr. Martin 
Kinyua and Dr. Eric G. Soenen from TSMC Technology, Inc. 
The work for the Section IV of the dissertation was completed by the student, in 
collaboration with Alexander Edward and Dadian Zhou, under the advisement of Dr. 
Jose Silva-Martinez of the Department of Electrical and Computer Engineering. 
The rest part of the dissertation was completed by the student, under the advisement 
of Dr. Jose Silva-Martinez of the Department of Electrical and Computer Engineering. 
Funding Sources 
Graduate study was supported by Texas Instrument Excellence Fellowship and 
Broadcom Corporation Fellowship. 
 
 vii 
TABLE OF CONTENTS 
 
 Page 
ABSTRACT ....................................................................................................................... ii 
DEDICATION .................................................................................................................. iv 
ACKNOWLEDGEMENTS ............................................................................................... v 
CONTRIBUTORS AND FUNDING SOURCES ............................................................. vi 
TABLE OF CONTENTS ................................................................................................. vii 
LIST OF FIGURES ........................................................................................................... ix 
LIST OF TABLES ......................................................................................................... xiii 
I. INTRODUCTION .......................................................................................................... 1 
1.1. Motivation ...................................................................................................... 1 
1.2. Research Contribution .................................................................................... 3 
1.3. Dissertaton Organization ................................................................................ 5 
II. ANALOG-TO-DIGITAL CONVERTER TOPOLOGIES ........................................... 6 
2.1. Introduction .................................................................................................... 6 
2.2. Nyquist ADCs ................................................................................................ 6 
2.3. Oversample ADCs ........................................................................................ 18 
2.4. Hybrid ADCs ................................................................................................ 24 
2.5. Conclusion .................................................................................................... 31 
III. A LOW POWER DIGITIZER FOR BACK-ILLUMINATED 3D-STACKED 
CMOS IMAGE SENSOR READOUT WITH PASSING WINDOW AND 
DOUBLE AUTO-ZEROING TECHNIQUES ........................................................... 33 
3.1. Introduction .................................................................................................. 33 
3.2. Sensor Architecture ...................................................................................... 36 
3.3. Proposed Techniques .................................................................................... 38 
3.4. Circuit Implementation ................................................................................. 47 
3.5. Measurement Results ................................................................................... 57 
3.6. Conclusion .................................................................................................... 65 
 viii 
IV. A 50-MHZ BW 67.3-DB SNDR MASH 1-1-1 CT ΔΣ MODULATOR WITH 
FIR DAC AND EELU QUANTIZER IN 40-NM CMOS ......................................... 66 
4.1. Introduction .................................................................................................. 66 
4.2. Modulator Architecture ................................................................................ 69 
4.3. Circuit Implementation ................................................................................. 79 
4.4. Measurement Results ................................................................................... 92 
4.5. Conclusion .................................................................................................... 98 
V. CONCLUSION ......................................................................................................... 100 
REFERENCES ............................................................................................................... 101 
 
 
 ix 
LIST OF FIGURES 
 
 Page 
Fig. 1.1 Resolution and bandwidth requirements for different applications. ..................... 2 
Fig. 2.1. Quantization noise ADC. ..................................................................................... 6 
Fig. 2.2. Sloping ADC architecture. ................................................................................... 7 
Fig. 2.3. Pulse-position modulation sloping ADC architecture (adapted from [1]). .......... 8 
Fig. 2.4. Pulse-position modulation sloping ADC timing scheme (adapted from 
[1]). ..................................................................................................................... 9 
Fig. 2.5. Multi-ramp single-slope ADC timing scheme (adapted from [2]). ..................... 9 
Fig. 2.6. Flash ADC architecture. ..................................................................................... 11 
Fig. 2.7. SAR ADC architecture. ..................................................................................... 12 
Fig. 2.8. Synchronous conversion for SAR ADCs (adapted from [3]). ........................... 13 
Fig. 2.9. Asynchronous conversion for SAR ADCs (adapted from [3]). ......................... 13 
Fig. 2.10. Monotonic capacitor switching procedure (adapted from [4]). ....................... 14 
Fig. 2.11. Time-interleaved SAR ADC (adapted from [5]). ............................................ 15 
Fig. 2.12. Pipeline ADC architecture. .............................................................................. 16 
Fig. 2.13. Time-interleaved pipeline ADC architecture (adapted from [8]). ................... 18 
Fig. 2.14. Delta-Sigma ADC. ........................................................................................... 19 
Fig. 2.15. Quantization noise: (a) oversample ADC; (b) delta-sigma ADC. ................... 19 
Fig. 2.16. Block diagram of second-order DT-ΔΣM. ....................................................... 21 
Fig. 2.17. Second-order DT-ΔΣM implementation. ......................................................... 22 
Fig. 2.18. Block diagram of first order CT-ΔΣM: (a) time-domain model; (b) 
linearized time-domain model splitting signal feedforward and output 
feedback paths. ................................................................................................. 23 
 x 
Fig. 2.19. Pipelined SAR ADC (adapted from [11]). ....................................................... 25 
Fig. 2.20. Noise-shaping SAR using FIR and IIR filter (adapted from [12]). ................. 26 
Fig. 2.21. Noise and mismatch shaping SAR with segmented DAC (adapted from 
[13]). ................................................................................................................. 27 
Fig. 2.22. A-synchronous SAR based CT-ΔΣM with digital delta-sigma truncator 
(adapted from [14]). .......................................................................................... 28 
Fig. 2.23. SAR-assisted CT-ΔΣM with 2nd-order noise shaping (adapted from 
[15]). ................................................................................................................. 28 
Fig. 2.24. Hybrid delta-sigma/pipelined (HDSP) modulator (adapted from [16]). .......... 29 
Fig. 2.25. The 0-N MASH ΔΣM concept (adapted from [17]). ....................................... 30 
Fig. 2.26. The 1-0 MASH ΔΣM with nonlinear memory error calibration (adapted 
from [18]). ......................................................................................................... 31 
Fig. 3.1. Block diagram of (a) 3D image sensor chip; (b) 96-column digitizer 
prototype. .......................................................................................................... 38 
Fig. 3.2. Global counter latching scheme: (a) without passing window; (b) with 
passing window. ............................................................................................... 40 
Fig. 3.3. CDS timing scheme for one horizontal scan: (a) under bright 
illumination condition; (b) under dark illumination condition. ........................ 43 
Fig. 3.4. Column comparator topologies. ......................................................................... 45 
Fig. 3.5. (a) Pixel source follower; (b) Ramp buffer. ....................................................... 48 
Fig. 3.6. Proposed digital counter: (a) architecture; (b) timing scheme; (c) delay 
paths. ................................................................................................................. 49 
Fig. 3.7. Proposed comparator with double AZ scheme. ................................................. 52 
Fig. 3.8. Column FPN caused by comparator delay variations without CDS: (a) 
VCP/VCN does not cross at comparator common-mode voltage Vcm; 
(b) VCP/VCN crosses at comparator common-mode voltage Vcm; (c) 
column FPN under various input signal level for (a) and (b); (d) sample 
images with column FPN of 0.83% (33.91 LSB) and 0.34% (14.11 
LSB) respectively. ............................................................................................ 54 
 xi 
Fig. 3.9. Column FPN caused by comparator delay variations with CDS: (a) 
VCP/VCN does not cross at comparator common-mode voltage Vcm; 
(b) VCP/VCN crosses at comparator common-mode voltage Vcm; (c) 
column FPN under various input signal level for (a) and (b); (d) sample 
images with column FPN of 0.65% (26.78 LSB) and 0.004% (0.163 
LSB) respectively. ............................................................................................ 55 
Fig. 3.10. Proposed ramp generator. ................................................................................ 56 
Fig. 3.11. Microphotograph of image sensor digitizer chip. ............................................ 57 
Fig. 3.12. Measured random noise performance. ............................................................. 59 
Fig. 3.13. Measured column FPN performance. .............................................................. 59 
Fig. 3.14.  Measured column FPN under condition of (a) dark illumination; (b) 
bright illumination. ........................................................................................... 60 
Fig. 3.15. Measured linearity performance. ..................................................................... 61 
Fig. 3.16. Power consumption of single-column digital counter. .................................... 62 
Fig. 3.17. Power breakdown of single-column digitizer. ................................................. 63 
Fig. 4.1. Architecture of a MASH 1-1-1 CT-ΔΣM including all inter-stage 
connection possibilities. .................................................................................... 71 
Fig. 4.2. Equivalent circuit to compute equivalent discrete loop transfer function 
using impulse invariant transformation. ........................................................... 74 
Fig. 4.3. Equivalent model of the CT-ΔΣM shown in Fig 4.1 with NCFs. ...................... 75 
Fig. 4.4. The proposed MASH 1-1-1 CT-ΔΣM. ............................................................... 75 
Fig. 4.5. NCFs of the proposed MASH 1-1-1 CT-ΔΣM. ................................................. 76 
Fig. 4.6. Simulated output spectrum of the proposed MASH 1-1-1 CT-ΔΣM: (a) 
w/ FIR compensation logic in NCF1; (b) w/o FIR compensation logic in 
NCF1. ................................................................................................................ 77 
Fig. 4.7. Quantization NF as function of RC time-constant variations. ........................... 78 
Fig. 4.8. Clock jitter effect on the NF with and without FIR DAC. ................................. 79 
Fig. 4.9. Top-level circuit implementation of the proposed MASH 1-1-1 CT-
ΔΣM. ................................................................................................................. 80 
 xii 
Fig. 4.10. Circuit implementation for OA: (a) NCFF topology; (b) transconductor 
Gm1; (c) transconductor Gm4 and Gm14; (d) error amplifier (EA) 
required for CMFB loop. .................................................................................. 82 
Fig. 4.11. Post-layout simulated OA1’s Bode plot: (a) gain; (b) phase. .......................... 83 
Fig. 4.12. ELD compensation path: (a) with conventional analog implementation; 
(b) with loop-unrolling digital implementation; (c) operation flow 
example. ............................................................................................................ 87 
Fig. 4.13. Loop-unrolling quantizer employing: (a) MUX-ENCODER topology; 
(b) ENCODER-MUX topology. ....................................................................... 89 
Fig. 4.14. 1.5-bit encoder-embedded loop-unrolling (EELU) quantizer topology. ......... 90 
Fig. 4.15. 1.5-bit FIR DAC implementation. ................................................................... 91 
Fig. 4.16. 500-run Monte-Carlo simulated distribution of SQNR with 1% current 
mismatch among segments of FIR DAC. ......................................................... 92 
Fig. 4.17. Microphotograph of the prototype chip. .......................................................... 93 
Fig. 4.18. Measured single-tone FFT spectrum. .............................................................. 94 
Fig. 4.19. Measured SNR and SNDR versus the modulator’s input signal level. ............ 95 
Fig. 4.20. Measured two-tone FFT spectrum. .................................................................. 96 
Fig. 4.21. Measured NF versus the first stage RC time-constant control code. ............... 97 
Fig. 4.22. Measured power consumption breakdown. ..................................................... 98 
 
 xiii 
LIST OF TABLES 
 
 Page 
Table 3.1: Performance summary and comparison with the state of the art .................... 64 
Table 4.1: Coefficients for the proposed MASH 1-1-1 CT-ΔΣM .................................... 74 
Table 4.2: Digitally tunable capacitors ............................................................................ 84 
Table 4.3: Performance summary and comparison with  state-of-the-art CT-ΔΣMs ....... 99 
 1 
I. INTRODUCTION 
 
1.1. Motivation  
At a time of the information technology revolution, one of the main driving sources 
for the revolution of modern human society is the trend of moving functionality into the 
digital domain. Faster digital signal processers (DSPs) capable of performing numerous 
complex functions are developed thanks to advanced CMOS technologies. However, the 
“bridge” between the analog and the digital world, analog-to-digital converters (ADCs), 
are facing design challenges to meet the wide-bandwidth, high-resolution and low-power 
consumption targets. With the added speed of new generations of DSPs implemented in 
advanced technology nodes, the bandwidth of ADCs is becoming the bottleneck for the 
overall system. Reduced supplies mean reduced signal range, which generally demands 
higher accuracy for the same dynamic range (DR) target. As a potentially power hungry 
component, the ADC power needs to be reduced to improve the battery life of portable 
devices.  
ADCs are widely used in the market for sensors, wireline/wireless communication, 
computers and consumer electronics. The resolution and bandwidth requirement for 
different applications are shown in Fig. 1.1. Different applications demand different 
ADC topologies and the specifications and design challenges are also different.  
 
 2 
Bandwidth (MHz)
· Medical
· Bluetooth
· Sensors
· Wired LAN
· DTV/DOCSIS
· LTE-A
· WiFi (802.11ac)
· Serial Link
· Optical Link
4
6
8
10
12
14
10-1 100 101 102 103 104
 
Fig. 1.1 Resolution and bandwidth requirements for different applications. 
 
One focus of the dissertation is on low-power ADC design for image sensor readout 
applications where the demanded resolution is around 12 bit. If global ADC architecture 
is employed for the entire 16-Mpixel, 8000 (H) x 2000 (V) array at a frame rate of 60 fps, 
the ADC sampling speed needs to be close to GHz Sample/s range. The simple 
calculation assumes the entire readout period is used for the analog-to-digital conversion. 
However this is not the real case in the application and thus even higher speed ADCs 
may be demanded. Another topology of using one single-slope ADC dedicated for each 
column can significantly lower down the speed requirement. The single-slope ADC 
array topology has the benefit of being very simple with only the comparator and the 
ramp as analog circuits. The counter and the logic are all implemented in digital. As an 
ADC array, the ramp can be shared and generated globally which can further improve 
 3 
the power efficiency of the topology. However, even with such benefits, the readout 
circuit still consumes a large portion of power in the image sensor chip especially under 
high-frame mode, due to the high-speed refreshing scheme in conventional counter 
implementation. Thus, to be able to achieve low-power readout for the next generation 
imagers operating at 120 fps or higher, effective techniques are demanded.  
Another application of focus is on the low-power ADC design in mobile/wireless 
receivers. In LTE-Advanced, with the carrier aggregation technique added to 
conventional LTE, the up and down link speed has been improved based on aggregating 
the component carriers with bandwidth of 1.4, 3, 5, 10, 15 or 20 MHz.  The same carrier 
aggregation concept is applied to WiFi 802.11ac to boost its link speed. The next 
generation 5th-Generation (5G), further improvement in the link speed is demanded 
which mains wider band ADCs in receivers will be in huge demand. Conventionally, 
single-loop continuous-time delta-sigma ADC is the most popular choice in the 
mobile/wireless receiver due to its high dynamic range capability, implicit anti-aliasing 
behavior, and tolerance of out-of-band blockers. However, there is still stability and 
overload recovery issue in high-order modulators demanding new techniques to handle. 
1.2. Research Contribution 
The dissertation presents system and circuit solutions to improve the power 
efficiency and address high-speed design issues of ADCs in advanced CMOS 
technologies.  
 4 
For image sensor applications, a high-performance digitizer prototype based on column-
parallel single-slope ADC (SS-ADC) topology for readout of a back-illuminated 3D-stacked 
CMOS image sensor is presented. To address the high power consumption issue in high-
speed digital counters, a passing window (PW) based hybrid counter topology is proposed. 
In this approach, the memory cells in the digital counters of SS-ADCs are disconnected from 
the global bus during non-relevant timing. To address the high column FPN under bright 
illumination conditions, a double auto-zeroing (AZ) scheme is proposed. In this technique, 
the AZ process is employed twice at reset and signal level, respectively. The double AZ 
scheme not only allows the comparator to serve as a crossing detector around the common-
mode level, but it also enables low-voltage comparator design. The proposed techniques are 
experimentally verified in a prototype chip designed and fabricated in the TSMC 40 nm low-
power CMOS process. The PW technique saves 52.8% of power consumption in the hybrid 
digital counters. Dark/bright column fixed pattern noise (FPN) of 0.0024%/0.028% is 
achieved employing the proposed double AZ technique for digital correlated double 
sampling (CDS). A single-column digitizer consumes total power of 66.8μW and occupies 
an area of 5.4 μm x 610 μm. 
For mobile/wireless receiver applications, this dissertation presents a low-power 
wide-bandwidth multistage noise-shaping (MASH) continuous-time delta-sigma 
modulator (CT-ΔΣM) employing finite impulse response (FIR) digital-to-analog 
converters (DACs) and encoder-embedded loop-unrolling (EELU) quantizers. The 
proposed MASH 1-1-1 topology is a cascade of three single-loop first-order CT-ΔΣM 
stages, each of which consists of an active-RC integrator, a current-steering DAC, and 
an EELU quantizer. An FIR filter in the main 1.5-bit DAC improves the modulator’s 
 5 
jitter sensitivity performance. FIR’s effect on the noise transfer function (NTF) of the 
modulator is compensated in the digital domain thanks to the MASH topology. Instead 
of employing a conventional analog direct feedback path, a 1.5-bit EELU quantizer 
based on multiplexing comparator outputs is proposed; this approach is suitable for high-
speed operation together with power and area benefits. Fabricated in a 40-nm low-power 
CMOS technology, the modulator’s prototype achieves a 67.3 dB of signal-to-noise and 
distortion ratio (SNDR), 68 dB of signal-to-noise ratio (SNR), and 68.2 dB of dynamic 
range (DR) within 50.5 MHz of bandwidth (BW), while consuming 19 mW of total 
power (P). The proposed modulator features 161.5 dB of figure-of-merit (FOM), defined 
as FOM = SNDR + 10 log10 (BW/P). 
1.3. Dissertaton Organization 
The dissertation is organized as follows: Chapter II discusses about different ADC 
implementation topologies. Chapter III presents a low power digitizer for back-
illuminated 3D-stacked CMOS image sensor readout with passing window and double 
auto-zeroing techniques. Chapter IV presents A 50-MHz BW 67.3-dB SNDR MASH 1-
1-1 CT ΔΣ modulator with FIR DAC and EELU quantizer. Chapter V concludes the 
dissertation. 
 6 
II. ANALOG-TO-DIGITAL CONVERTER TOPOLOGIES 
 
2.1. Introduction 
There are several major types of ADC architectures that could be used for broadband 
communications. Each type entails different trade-offs among resolution, speed, power 
and area. Overall, there are two main categories to classify these ADCs according to the 
ratio between the sampling frequency and signal bandwidth: Nyquist ADCs and 
Oversampling ADCs. In recent years, Hybrid ADCs combining different stand-alone 
ADC topologies have shown great potential to achieve better performance for certain 
application areas. 
2.2. Nyquist ADCs 
In Nyquist ADCs the sampling frequency is twice the value of the maximum input 
frequency wanted to be digitized as shown in Fig. 2.1.  
 
Frequency
Q
u
a
n
ti
za
ti
o
n
 
N
o
is
e
Fs/2 Fs
BW
 
Fig. 2.1. Quantization noise ADC. 
 
 7 
There are many different types of Nyquist ADCs that can be used. The most popular 
ones are, sloping, flash, successive approximation register (SAR), and pipeline ADCs. 
2.2.1. Sloping ADC 
In a sloping ADC as shown in Fig. 2.2, the voltage signal is first compared with a 
reference of ramp signal and thus converted into a pulse width signal. The pulse width 
signal is measured and converted into digital bits through a time-to-digital (TDC) 
converter. One simple example of TDC is a ripple counter which starts counting at start 
of the ramp signal and ends when the input voltage crosses with the ramp signal.  
 
comparator
Vin
RAMP
Time to Digital 
Converter Dout
 
Fig. 2.2. Sloping ADC architecture. 
 
The sloping ADC typically provides benefits of high-resolution and low area. 
However, the speed of operation is usually low due to the monotonic thermometer code 
like counting behavior. The worst condition happens when the input signal is full-scale. 
The counter needs to count up to close to its full-count, which takes time and power.  
One clear target to address the issue is to lower down the number of counting cycles to 
complete the conversion process. In [1], a pulse position modulation sloping ADC 
architecture employing two-step TDC topology is proposed, as shown in Fig. 2.3. The 5 
 8 
bit course time quantizer, formed using a counter measures the number of reference 
clock cycles while the counter_enable signal is high, thus measures 𝑡𝑐. The fine TDC 
measures the time  𝑡𝑓 defined as the time between the stop signal and clk_stop rising 
edges. The timing scheme is shown in Fig. 2.4 and the reference clock frequency is 
lowered by 16 times to achieve the same time resolution. Another possible solution is to 
employ a multiple ramp topology [2]. The basic concept of a multi-ramp single slope 
ADC is that the ramp voltage, which spans the entire input voltage range in the single 
slope architecture, is divided into m steps, each of which spans 1/m of the input range. 
The timing operation of the multi-ramp architecture is shown in Fig. 2.5. In the coarse 
phase, the comparator is connected to a single course ramp voltage, and the first 
conversion is performed. Next, the coarse conversion result is fed back into the ramp 
generator to decide the range for the fine conversion range. In the fine conversion, the 
ramp only has to span 1/m times the ADC input range, and therefore the conversion can 
be much faster.   
 
comparator
Vin
RAMP
Synchronizer M MSBsCoarse Counter
Fine Time
Measurement
counter_enable
clk_stop
N LSBs
TDCstop
clk
 
Fig. 2.3. Pulse-position modulation sloping ADC architecture (adapted from [1]). 
 
 9 
tm
tc
tf
vin
ramp
start
stop
clk
clk_stop
counter_enable
 
Fig. 2.4. Pulse-position modulation sloping ADC timing scheme (adapted from [1]). 
 
coarse ramp
coarse phase fine phase
Dc Df
t
v
ramp1
ramp2
ramp3
ramp4
 
Fig. 2.5. Multi-ramp single-slope ADC timing scheme (adapted from [2]). 
 
 10 
2.2.2. Flash ADC 
A flash ADC uses parallel comparators to compare the input signal with all transition 
points between adjacent quantization intervals. The result of these comparisons 
highlights the limit at which the input is larger than on of the thresholds giving 
information that can then be transformed into digital codes. The output is obtained in 
thermometer code, which is typically converted into a binary digital output. Since the 
comparators operate in parallel, the latency of the output is less than one clock cycle, 
making this architecture suitable for speeds of several GHz. However, a flash ADC 
suffers from limited resolution, the number of comparators and references required 
increases exponentially with the number of bits. For an N-bit flash ADC, (2
N
-1) 
comparators and reference voltages are required as shown in Fig. 2.6. 
Also, the offset requirement for the comparators and matching of the references 
increases exponentially, demanding more area and power consumption making the 
design of high resolution flash unreliable. Thus, it is not practical to implement a flash 
ADC with resolution higher than 7 bit. 
 
 11 
-Vref
+Vref Vin
Dout
TH
ER
M
O
M
ET
ER
 T
O
 B
IN
A
R
Y
 D
EC
O
D
ER
1
2N-1
 
Fig. 2.6. Flash ADC architecture. 
 
2.2.3. SAR ADC 
A SAR ADC uses a single comparator to quantize the input signal based on binary 
search algorithm. As shown in Fig. 2.7, the ADC consists of a comparator, a SAR digital 
decision logic, and DAC. During the conversion process, the DAC output gradually 
approaches the input voltage.  
SAR ADCs require multiple clock cycles to complete the digital output. For N-bit 
SAR ADC, N clock cycles are necessary. Therefore, the SAR ADCs are typically slower 
compared to flash ADCs. The SAR logic adjusts the reference voltage provided the 
DAC, which defines the resolution of the ADC and is normally implemented by 
capacitors or resistors. Thus, for high resolution the area of the SAR is dominated by the 
DAC, which is sized depending on the matching requirements. To achieve high-speed 
 12 
operation, however, smaller capacitor array is more desirable. Thus, there is a 
fundamental trade-off between speed and accuracy for SAR ADCs.  
 
Dout
SAR LOGIC
DAC
Vin
 
Fig. 2.7. SAR ADC architecture. 
 
The conventional implementation of the SA logic relies on a synchronous clock to 
divide the time into a signal tracking phase and conversion phase which processes from 
the MSB to the LSB as shown in Fig. 2.8. Each clock cycle has to tolerate the worst case 
comparison time, which is composed of maximum DAC settling time and comparator 
resolving time depending on the minimum resolvable input level. Therefore, the power 
and speed limitation of a synchronous SAR design come largely from the high-speed 
internal clock. Using asynchronous processing of the internal comparisons removes the 
need for such a clock and substantially improves the power efficiency of SAR ADCs. 
The concept of asynchronous processing is to trigger the internal comparison from MSB 
to LSB like dominoes [3]. As shown in Fig. 2.9, whenever the current comparison is 
complete, a ready signal is generated to trigger the next comparison process.  
 
 13 
MSB MSB1 MSB2 LSB
tracking phase
internal 
clk
synchronous conversion phasetracking phase
 
Fig. 2.8. Synchronous conversion for SAR ADCs (adapted from [3]). 
 
MSB MSB1 MSB2 LSB
tracking phaseasynchronous conversion phasetracking phase
 
Fig. 2.9. Asynchronous conversion for SAR ADCs (adapted from [3]). 
 
Efforts have also been made in the SAR algorithm to improve its switching energy 
efficiency and reduce the total capacitance. In [4], a monotonic capacitor switching 
procedure is proposed as shown in Fig. 2.10. After the sampling switches turn off, the 
comparator directly performs the first comparison without switching any capacitor. The 
subsequent switching subsequent switching sequence is also more efficient than the 
conventional scheme which leads to an average switching energy saving of 81%. 
However, in the monotonic switching topology the input common-mode gradually 
converges to ground. Thus a comparator with low sensitivity to the signal-dependent 
offset caused by the input common-mode voltage variation is demanded.  
 
 14 
2C C C
2C C C
Vip-Vin>0
2C C C
2C C C
Vip-Vin>0
2C C C
2C C C
Vip-Vin>0
2C C C
2C C C
Vip-Vin>0
Vref
2C C C
2C C C
Vip-Vin>Vref/2
Vref Vref
Vref Vref
2C C C
2C C C
Vip-Vin>-Vref/2
Vref Vref
Vref Vref Vref
Vref Vref Vref
Vref Vref Vref
2C C C
2C C C
Vip-Vin>0
Vref
Vref Vref Vref
Vref Vref
Vref Vref
Vref Vref
Vref Vref
Vref Vref Vref
Vref
CVref2
CVref2
(1/4)CVref2
(3/4)CVref2
(3/4)CVref2
(1/4)CVref2
Yes
No
Yes
No
Yes
No
 
Fig. 2.10. Monotonic capacitor switching procedure (adapted from [4]). 
 
Compared with flash ADC, SAR topology is very attractive for its superior power 
efficiency. Time interleaving techniques are proposed to speed up the operation of SAR 
ADCs into GHz Sample/s. In [5], a 10 bit time-interleaved SAR ADC with background 
time-slew calibration in 40 nm CMOS technology is presented, as shown in Fig. 2.11. 
The entire interleaved ADC consists of 16 main ADCs and 8 auxiliary ADCs for skew 
correction, which operate at 1/16 and 1/8 of the sampling frequency, respectively. The 
reported ADC demonstrates a digital timing-skew correction technique incorporated 
with a delta-sampling technique, and achieves a 2.6 GHz sampling rate and a wide signal 
bandwidth up to Nyquist. 
 15 
 
ADCm8
Ym8
ADCm16
Ym16
ADCa8
Ya8
Vin
ADCm1
Ym1
ADCm9
Ym9
ADCa1
Ya1
OFF-CHIP DIGITAL CALIBRATION 
(OFFSET, GAIN AND TIME SKEW)
 
Fig. 2.11. Time-interleaved SAR ADC (adapted from [5]). 
 
2.2.4. Pipeline ADC 
A pipeline ADC takes advantage of the speed of a flash ADC, and eliminates the 
exponential increase in complexity by cascading multiple stages of low resolution flash 
ADCs. A sample and hold (S&H) function in each stage allows all stages to operate 
concurrently, giving a throughput of one output sample per clock cycle. Fig. 2.12 shows 
that besides the sub ADC, a DAC, a subtractor, and an OpAmp are required for each 
stage.  
The operation is described as follow: First, the input signal is quantized by the sub 
ADC each stage includes a DAC to convert the quantized signal to analog, a residue 
calculator obtains the difference between the analog input signal and the quantized 
 16 
signal, then the residue is amplified to adjust the swing for the full-scale of the next 
stage, the amplified residue becomes the input of the next stage. The process is 
successively replicated until the last pipelined stage. The number of stages depends on 
the number of bits solved per stage, and the total number of bits target. Since all 
pipelined stages work simultaneously, the conversion speed of the pipeline ADC is close 
to the conversion speed of its single stage. However, since the input signal is quantized 
successively by the stages, the output has a latency delay equal to the number of stages. 
The limitation of the pipeline ADC is the high power consumption required for the 
calculation of the residue. Pipeline ADCs can reach resolution between 8 bit to 12 bits 
and speeds up to several GHz Sample/s. 
 
N BitsSub-ADC
(N BITs)
Sub-DAC
(N BITs)
 Gain
Vin Vout
STAGE #1
(N BITs)
STAGE #2
(N BITs)
STAGE #3
(N BITs)
STAGE #4
(N BITs)
a1 a2 a3 a4
Dout
Vin
Z-3 Z-2 Z-1
 
Fig. 2.12. Pipeline ADC architecture. 
 17 
In early implementations, the resolution of the sub-ADC is typically chosen to be 1.5 
bit per stage for two reasons [6]. The first reason in to maximize the bandwidth of the 
S&H/residue-amp switched capacitor circuit which limits the overall conversion rate.  
Second, with the use of digital correction algorithm in 1.5 bit per stage pipeline 
architecture, the overflow of present stage output from the input range of the following 
stage can be prevented even with the presence of a large comparator offset up to 1/4 of 
the reference voltage, so that this offset error amplified down the pipeline can be 
detected for correction. 
However, in recent implementations, 3 or 4 bit per stage topology gains more 
popularity. Compared with 1.5 bit per stage implementations, adding more bits in the 
first stage leads to less requirement on latter stages and thus results in lower power 
dissipation overall.  However, as the feedback factor for the residue amplifier drops, 
higher gain-bandwidth product (GBW) is demanded. Thus, precision amplifiers 
dominate the power consumption in most high-speed pipeline ADCs. In [7], a digital 
background calibration technique is proposed, enabling to replace precision amplifiers 
by simple power efficient open-loop stages. In the multibit first stage of a 12-bit 75-
MS/s proof-of-concept prototype, 60% residue amplifier power savings over a 
conventional implementation is achieved. 
Pipeline ADC can also be time-interleaved to further improve the operation speed. In 
[8], a 5.4 GS/s 12 bit two-way interleaved pipeline ADC is presented, as shown in Fig. 
2.13. It employs a complementary switched-capacitor amplifier topology to double the 
GBW without increasing power. The ping-pong amplifier sharing configuration doubles 
 18 
the sampling rate, whereas the sub DAC equalization digitally corrects the settling error 
and other dynamic errors in the ping-pong architecture.   
 
÷2
Buffer SHA Pipeline ADC
Buffer SHA Pipeline ADC
CLK
CLK0
CLK1
Vin
M
IS
M
A
TC
H
 C
O
R
R
EC
TI
O
N
 
Dout
 
Fig. 2.13. Time-interleaved pipeline ADC architecture (adapted from [8]). 
 
2.3. Oversample ADCs 
In oversample ADCs, the bandwidth of the input signal is less than half the 
bandwidth of the sampling frequency. However, the most widely-used architecture 
taking good advantage of the oversampling benefit is delta-sigma ADC. 
2.3.1. Delta-Sigma ADC 
A delta-sigma ADC achieves high resolution by combining the techniques of: 
oversampling and closed loop noise shaping. Fig. 2.14 shows the basic architecture of a 
delta-sigma ADC. A delta-sigma modulator consists of a loop filter, a sub ADC, a DAC, 
and a digital filter. 
 
 19 
Loop 
Filter
Sub-
ADC
DAC
Vin DigitalFilter Dout
 
Fig. 2.14. Delta-Sigma ADC. 
 
The effect of oversampling spreads the quantization from the sub ADC over a wider 
frequency. Therefore, if the oversampling ratio increases the quantization noise inside 
the desired bandwidth will reduce as shown in Fig. 2.15 (a). Moreover, the quantization 
noise level inside the desired bandwidth is shaped by the effect of the feedback loop 
around the sub ADC, such that most of the noise is shifted out of the bandwidth of 
interest as shown in Fig. 2.15 (b). The former made the delta-sigma ADC a perfect 
architecture for high resolution applications. However, the speed of the ADC is typically 
limited comparing with Nyquist ADCs due to the oversample behavior. Nonetheless, 
advance in process technology has allowed delta-sigma ADCs to reach the hundreds of 
MHz of bandwidth range, employing continuous-time implementation topologies. 
 
Frequency
Q
u
a
n
ti
za
ti
o
n
 N
o
is
e
Fs/2 Fs
BW
digital filter
(a) (b)
Frequency
Q
u
a
n
ti
za
ti
o
n
 N
o
is
e
Fs/2 Fs
BW
digital filter
 
Fig. 2.15. Quantization noise: (a) oversample ADC; (b) delta-sigma ADC. 
 20 
2.3.2. Discrete-Time Delta-Sigma Modulator (DT-ΔΣM) 
In discrete-time delta-sigma modulators, the input voltage is sampled first and then 
processed by the modulator.  Fig. 2.16 shows the block diagram of the second-order 
discrete-time delta-sigma modulator (ΔΣM). A forward path delay is included in both 
integrators, thus simplifying the implementation of the modulator with straightforward 
sampled-data analog circuits. Each integrator is preceded by an attenuation of 0.5, to 
address the swing issue at the integrator output nodes. The circuit implementation of the 
ΔΣM employing switch capacitor integrators and a single-bit comparator is shown in Fig. 
2.17. The comments on the non-idealities affecting the performance of the DT-ΔΣM are 
as follows [9]: 
a. OFFSET: Offset is minor concern in many signal acquisition systems, as long as 
the quantization is uniform. The offset at the input of the first integrator is the 
only significant contributor because offsets in the second stage and comparator 
are suppressed by the large low-frequency gain of the integrator.  
b. SAMPLING JITTER: Oversampled ADC put considerably less stringent 
requirements on the filter than Nyquist ADCs since the signal is sampled at a 
frequency far exceeding its bandwidth. 
c. NOISE: Noise injected at the modulator input is the dominant source. Input-
referred noise from the comparator undergoes the same second-order noise 
shaping as the quantization noise. The noise sources at the input of the second 
integrator are subjected to first-order noise shaping behavior. Out-of-band noise 
 21 
is eliminated by the decimation filter, but high-frequency noise at multiples of 
the sampling frequency will be aliased in band. 
d. INTEGRATOR DC GAIN: The performance penalty incurred is on the order of 
1 dB when the integrator dc gain is comparable to the oversampling ratio (OSR). 
e. INTEGRATOR BANDWIDTH: Integrator implementations using operational 
amplifiers (OPAMP) with bandwidths considerably lower than the sampling 
frequency, thus with correspondingly inaccurate settling, will not impair the 
ΔΣM performance as long as the settling process is linear. 
f. INTEGRATOR SLEW: Slewing distortion appears when the peak rate of change 
in the impulse response exceeds the maximum slew rate the integrator can 
support. 
g. COMPARATOR HYSTERESIS:  The sensitivity of ΔΣMs to comparator 
hysteresis is several orders of magnitude smaller than that of Nyquist ADCs 
attributed to the negative feedback with high loop gain topology. 
 
Vin
E1
Y1
ADC
DAC
Z-10.5Z-10.5
 
Fig. 2.16. Block diagram of second-order DT-ΔΣM. 
 
 22 
S1
S2 S3
4S
S1
S2
S3
S4
S1
S2 S3
S4
S1
S2
S3
S4
Vref+
Vref-
Vref+
Vref-
Vin
Y1
C1
C1
C1
C1
C2
C2
 
Fig. 2.17. Second-order DT-ΔΣM implementation. 
 
2.3.3. Continuous-Time Delta-Sigma Modulator (CT-ΔΣM) 
As mentioned above, in DT-ΔΣMs, high-frequency noise, aliases and blockers at 
multiples of the sampling frequency will fold in band. However, in CT-ΔΣM, the 
situation is quite different. The CT-ΔΣM does not demand a dedicated sampler at the 
ADC input since the loop filter directly process the input signal. The impulse sampling 
operation occurs at the input of the quantizer after the loop filter. In addition, the 
feedback path is operating to convert the quantized impulse into time-domain waveforms 
for integration in the loop filter.  A simple example on the analysis of a first order CT-
ΔΣM is possible based on the time-domain model shown in Fig. 2.18 (a). By splitting the 
feedforward path for the signal and the feedback path for the output impulse sequence, 
the model in Fig. 2.18 (a) can be redrawn as in Fig. 2.18 (b). In the feedback path, based 
on impulse invariant, it is possible to find z-domain filter H(z) equivalent to the original 
s-domain paths including the DAC, s-domain loop filter H(s) and the impulse sampler at 
 23 
the quantizer input. The impulse invariant methodology significantly simplifies the 
analysis and is widely used in the design of CT-ΔΣMs. 
 
1 1/sTs
1
Ts
Z-1
1
E1
Y1
Vin
(a)
(b)
Z-1/(1-Z-1)
1 1/sTs
1
Ts
Z-1
1
E1
Y1
Vin
 
Fig. 2.18. Block diagram of first order CT-ΔΣM: (a) time-domain model; (b) linearized 
time-domain model splitting signal feedforward and output feedback paths. 
 
Compared to DT-ΔΣMs, the main advantages of CT-ΔΣMs [10] are:  
a. Implicit anti-aliasing filter 
b. Noise shaping on S&H 
c. Possibly higher sampling rate 
d. Less digital switching noise 
 24 
For the reasons above, CT-ΔΣMs are generally more favored in mobile/wireless 
applications. However, there are still design challenges mainly in the following aspects: 
a. Sensitive to clock jitter 
b. Sensitive to excessive loop delay (ELD) 
c. Sensitive to DAC waveform error 
d. Less accurate analog transfer function 
2.4. Hybrid ADCs 
Different ADC topologies were briefly discussed highlighting their architecture, 
advantages and issues. Rather than stick to certain fixed ADC topology, designers have 
the option to break the boundary and combine some of the topologies and techniques to 
further improve the ADC performance. 
2.4.1. Pipelined SAR ADC 
SAR ADC architectures are popular for achieving high energy efficiency but they 
suffer from resolution and speed limitations. On the other hand, pipeline ADC 
architectures usually achieve high resolution and speed but have lower energy-efficiency. 
Fig. 2.19 shows one possible combination of the two topologies named pipelined SAR 
ADC [11]. The flash sub-ADC of a conventional pipeline ADC is replaced by a SAR 
sub-ADC. The sub-DAC input sampling capacitor array is also utilized by the SAR sub-
ADC by connecting a single comparator to the top plate of the input capacitor array.  
The problem of sampling path mismatch in conventional pipeline ADC is eliminated 
 25 
because the SAR sub-ADC and the sub-DAC share the same sampling path. The single 
comparator quantizes the static signal stored on the capacitor array, hence aperture error 
is also absent. The sub-DAC capacitor array is also used for sampling purpose and there 
is no need for an active front-end S&H circuit. At the end of the entire successive 
approximating cycle, the voltage left as the input of the comparator is the residue which 
needs to be amplified and processed by later stages. 
 
GainSampling 
CDAC
SAR Logic Digital Calibration
Vin SAR ADC
Dout
 
Fig. 2.19. Pipelined SAR ADC (adapted from [11]). 
 
Fig. 2.19 shows a state-of-the-art implementation of a 13 bit 50 MS/s pipelined-SAR 
ADC in 65 nm CMOS [11], A fully differential ring amplifier with high gain, fast slew 
based charging and an almost rail-to-rail swing is used for a switched capacitor (SC) 
inter-stage residue amplifier to achieve accurate amplification without calibration. A 
floated detect-and-skip (FDAS) witching scheme is introduced to reduce the switching 
energy loss and improve the linearity of the SAR capacitor DAC. The prototype ADC 
achieves signal to noise and distortion ratio (SNDR), signal to noise ratio (SNR) of 70.9 
dB and 71.3 dB, respectively at Nyquist input frequency. 
 
 26 
2.4.2. Noise Shaping SAR ADC 
Although SAR ADCs are highly efficient, comparator noise and other effects limit 
the most efficient operation to below 10 to 12 bit effective number of bits (ENOB). In 
[12], a noise-shaping concept from ΔΣM is introduced to filter out the in-band noise, as 
shown in Fig. 2.20. The noise shaping scheme shapes both comparator noise and 
quantization noise, thereby decoupling comparator noise from ADC performance. The 
loop filter is comprised of a cascade of a two-tap charge-domain finite impulse response 
(FIR) filter and infinite impulse response (IIR) based on a low-quality integrator. 
 
IIRFIR
Main 
CDACVin Dout
 
Fig. 2.20. Noise-shaping SAR using FIR and IIR filter (adapted from [12]). 
 
In [13], a similar noise shaping concept together with mismatch error shaping for the 
capacitor DAC are employed to achieve 105 dB of in-band spurious free dynamic range 
(SFDR) without calibration, as shown in Fig. 2.21. The prototype incorporates the 
concepts of flash, SAR and ΔΣM and can be configured into conventional and 
oversampling modes. 
 
 27 
Vin
Noise 
Shaping
Tri-Level
DWA
Monotonic
MES
R
e
sid
u
e
SARFlash
Dout
MSB LSB
 
Fig. 2.21. Noise and mismatch shaping SAR with segmented DAC (adapted from [13]). 
 
2.4.3. Delta-Sigma Modulator Employing SAR as Quantizer 
Conventionally, flash ADC is typically the choice to implement the quantizer inside 
of ΔΣM. However, replacing the flash quantizer with an a-synchronous SAR quantizer 
has shown potential to further improve the power efficiency of CT-ΔΣM [14], [15]. In 
[14], by using a 6 bit a-synchronous SAR quantizer, the sampling frequency is lowered, 
which reduces the design efforts both in system level and circuit level as shown in Fig. 
2.22. However, as it is not practical to feedback a 64 level DAC signal, a digital delta-
sigma truncator is employed to truncate the feedback into only 9 levels. Compared with 
the same order loop filter without delta-sigma truncation, only 2 dB of 2 dB degradation 
in in-band noise is noticed. 
 
 28 
Loop
Filter
ASAR
ADC
DAC
Vin DoutF(z)
DIGITAL TRUNCATOR
Fs
Fs
L1 
Levels L2 
Levels
 
Fig. 2.22. A-synchronous SAR based CT-ΔΣM with digital delta-sigma truncator 
(adapted from [14]). 
 
The noise shaping SAR concept can also be used to build the quantizer inside a CT-
ΔΣM. Fig. 2.23 shows the block diagram of the modulator using a 4th-order feed-
forward architecture with a 4-bit asynchronous SAR quantizer [15]. The 2nd-order noise 
coupling and ELD structures are integrated into the 4-bit asynchronous SAR. Thus, 6th-
order of total noise shaping is obtained and the measured SNDR of the CT-ΔΣM is 
improved from 53.9 dB to 75.3 dB enabling noise coupling and DAC calibration. 
 
Loop
Filter
Sub-
ADC
DAC
Vin Dout
2nd-order 
noise coupling
Z-1
Quantization Noise
ELD
Ts
 
Fig. 2.23. SAR-assisted CT-ΔΣM with 2nd-order noise shaping (adapted from [15]). 
 29 
2.4.4. Hybrid Delta-Sigma/Pipelined Modulator 
Similar to employing a SAR ADC as the quantizer of a ΔΣM, pipeline ADC can also 
be integrated in a delta-sigma loop and operates as a high-resolution quantizer. In [16], a 
hybrid delta-sigma/pipelined (HDSP) modulator is presented. The proposed modulator 
takes advantage of the high resolution and distributed pipelined quantization and 
combines it with the noise shaping property of a ΔΣM. The latency issue of the pipeline 
ADC is addressed by adding additional feedback paths from the last stage of the pipeline 
ADC to its internal nodes as shown in Fig. 2.24. Since the quantizer noise of the pipeline 
ADC is available in analog form as the residue, it can be used for generating the 
feedback terms. 
 
Loop
Filter
DAC
Vin
STG1
Z-1/2 
STGn-1
Z-1/2 
STGn
Z-1/2 
An A1
Z-1/2 Z-n/2 
Adder Dout
 
Fig. 2.24. Hybrid delta-sigma/pipelined (HDSP) modulator (adapted from [16]). 
 
 
30 
2.4.5. 0-N/N-0 MASH Delta-Sigma ADC 
Instead of allocating a pipeline ADC as the quantizer of ΔΣM, other topologies by 
cascading ΔΣM and Nyquist ADC is presented in [17], [18]. There are basically two 
category of topologies, namely 0-N MASH and N-0 MASH. Fig. 2.25 shows the block 
diagram of a 0-N MASH concept [17].  The first stage is a zero-order quantizer, while 
the second stage is an N-th order single-loop ΔΣM with an internal quantizer. The main 
benefit of 0-N MASH topology is allowing a larger input-signal which results in an 
improvement in the achievable performance and an enhancement in the modulator 
efficiency. In addition, its ability to let the ΔΣM process quantization noise only, relaxes 
the headroom and linearity requirements of its opamps. 
E1
DAC
Vin
Loop
Filter
E2
DAC
Nth-order ΔΣ 
modulator
H1(z)
H2(z)
Dout
Fig. 2.25. The 0-N MASH ΔΣM concept (adapted from [17]). 
 31 
Fig. 2.26 shows a 15 MHz 1-0 MASH ΔΣM with nonlinear memory error calibration 
[18]. A two-tap sequential polynomial derived from an output-referred error analysis 
accurately models the no-ideality of a first-order ΔΣM. The model parameters are 
extracted by correlating various moments of the ADC digital output with a one-bit 
pseudorandom noise (PN) superimposed on the input, largely reducing the circuit 
overhead associated with the nonlinear calibration. The proposed calibration scheme 
effectively improves the third-order intermodulation product (IM3) by 30 dB to 87.1 dBc. 
 
Vin LoopFilter
E1
DAC
a2
E2
a3
E3
a4
E4
a5
E5
CAL1 c5c4c3c2
GAIN ERROR CALIBRATION
Dout
NON-LINEAR INTEGRATOR 
& DAC CALIBRATION
 
Fig. 2.26. The 1-0 MASH ΔΣM with nonlinear memory error calibration (adapted from 
[18]). 
 
2.5. Conclusion 
In this section, various ADC topologies are visited including both conventional 
stand-alone ADC and Hybrid ADC topologies. Each type of ADC has its own unique 
advantage and shortcoming. In image sensing readout applications, implementation 
based on different ADC topologies, like sloping, SAR, pipeline, pipelined SAR, ΔΣM 
 32 
and so on, have been reported in literature. In mobile/wireless communication receivers, 
even though CT-ΔΣM is the current dominant option, there is still fierce competition 
from other ADC topologies, like SAR, pipeline, pipelined SAR etc. In addition, 
processing analog signal in the domains other than voltage, like time, charge, and 
current, also adds more diversity in architectures for the analog-to digital conversion.   
 33 
III. A LOW POWER DIGITIZER FOR BACK-ILLUMINATED 3D-STACKED CMOS 
IMAGE SENSOR READOUT WITH PASSING WINDOW AND DOUBLE AUTO-
ZEROING TECHNIQUES 
 
3.1. Introduction 
Recent innovations on back-illuminated 3D-stacked process for CMOS image 
sensors [19]–[30] have stimulated a new leap forward in the image sensor industry. The 
separation of the conventional image sensor process for pixels and the logic process for 
the readout circuits enables independent performance optimization leading to cost 
reduction. Instead of sticking to an image sensor process, advanced logic technology 
nodes are applicable choices for the image sensor digitizer which infers both power and 
benefits with process scaling. 
 
 
 
 
 
 
* Reprinted, with permission, from “A low power digitizer for back-illuminated 3D-stacked 
CMOS image sensor readout with passing window and double auto-zeroing techniques,” Q. Liu, 
A. Edward, M. Kinyua, E. G. Soenen, and J. Silva-Martinez, IEEE Journal of Solid-State 
Circuits, accepted in Jan 2017 and to be published. © 2017 IEEE.  
  
 34 
The most critical building block of an image sensor digitizer is the analog-to-digital 
converter (ADC). Diverse topologies of ADCs integrated at different levels have been 
reported targeting high-speed, low-power and area-efficient implementation [19], [22], 
[28]–[30], [31]–[39]. Binary-search algorithm-based ADCs like cyclic [30], [31], [32] 
and successive approximation registers (SARs) [35]–[37] are typically faster compared 
with ramp-based ADCs [19], [21], [28], [29], [38], [39] and oversampling ADCs [33], 
[34]. However, cyclic ADCs demand amplifiers with precise gain while SAR ADCs 
require well-matched capacitor digital-to-analog converters (DACs) to achieve high 
resolution. Forcing cyclic/SAR ADCs into a column-parallel array with a small pixel 
pitch results in significant layout complexity and column non-uniformity [35]. 
Furthermore, issues in reference voltage and power supply distribution can degrade far-
end ADC performance [37]. Employing a single chip-level pipelined SAR ADC to read 
out the entire pixel array avoids the area and the analog voltage distribution issues, but 
the frame rate of the image sensor is limited by the speed of the ADC [36]. Column-
parallel oversampling ADCs with noise filtering can achieve low random noise and wide 
dynamic range, but complicated decimation filters are usually demanded. On the 
contrary, single-slope ADCs (SS-ADCs), working in column-parallel array and sharing 
the same ramp signal, have superior fitness for fine-pitch-pixel readout. With only 
comparators in the analog domain, SS-ADCs benefit from the 3D-stacked process 
revolution because of their simplicity. Another advantage of an SS-ADC is its good 
differential non-linearity (DNL) performance without large spikes, which is common 
problem in cyclic/SAR ADCs [37]. In addition, the digital CDS technique has proven 
 35 
effective in SS-ADC, resolvisng a number of limiting issues like resetting KT/C noise, 
pixel and readout FPN, clock skew, and ramp delay [38]. 
However, employing SS-ADC architecture as a digitizer for CMOS image sensor 
application comes with two obstacles. A high-speed SS-ADC demands a fast-counting 
clock, which implies high power consumption in the digital counters lumped to each 
column [38]. In [29], [39], by sharing a global 5-bit least significant bit (LSB) counter 
between every 248 columns and put memory cells in columns, the power consumption 
of the digital counter was reduced. However, if the memory cells in columns are 
implemented as transparent standard-cell latches [29], the continuous refreshing of the 
memory cells before the real latching moment will lead to large amount of power waste. 
Secondly, in the conventional CDS scheme [38], during auto-zeroing (AZ), the offset 
and delay information stored for cancellation purpose only corresponds to small pixel 
information levels. Nevertheless, under bright illumination conditions, the input of the 
comparator [38] crosses at a much lower voltage level which corresponds to a different 
offset and delay information. For the aforementioned reasons, the digital CDS 
subtraction introduces a cancelation error, which results in large column fixed pattern 
noise (FPN) left under bright illumination conditions.  
In this paper, passing window (PW) and double auto-zeroing (AZ) techniques are 
presented to address these two issues. The LSB memory cells in the hybrid digital 
counters of SS-ADCs are only connected to the global buses during an auto-generated 
PW. This allows power saving in both LSB memories and global data bus drivers. First 
attempt of employing double AZ concept to solve bright column FPN was made in this 
 36 
design. The AZ process of comparator is employed twice during reset and signal readout 
phase, respectively, such that the comparator operates as a crossing detector only around 
the same common-mode level. With a double AZ scheme, an effective comparator offset 
and delay cancelation with digital CDS can be achieved even under bright illumination. 
Normally dark image column FPN is more visible than bright column FPN. However, as 
when designers lower down the power supply aiming to save power, the bright column 
FPN performance gets worse and becomes more noticeable. 
The paper is organized as follows: Section II describes the overall architecture of the 
image sensor chip. Section III describes the proposed PW and double AZ techniques. 
The circuit-level implementations of the most relevant building blocks of the digitizer 
are detailed in Section IV. The measurement results are discussed in Section V, and 
Section VI concludes this paper. 
3.2. Sensor Architecture 
A typical CMOS image sensor system consists of a pixel array, a digitizer, a row 
decoder and driver, a timing controller, a phase-locked loop, and a data transfer circuit. 
In a conventional 2D implementation, the pixel array is arranged at the center of the chip 
and the peripheral circuits have to be designed using the same process optimized for the 
pixels. This arrangement limits the performance of the peripheral circuits and degrades 
the silicon area efficiency. The first commercial product of a back-illuminated 3D-
stacked image sensor was reported in [19]. The connection between the interconnect 
layers between the top and bottom parts was realized with through-silicon vias (TSVs) 
 37 
outside the pixel array. In [28]–[30], [40], with direct connection of top and bottom 
wafers with Cu-Cu bonding, the pixel array to chip area ratio is improved because the 
connection position can be under the pixel array. 
Fig. 3.1 (a) shows a back-illuminated 3D-stacked CMOS image sensor with direct 
interconnection between top and bottom wafers [28]. While the design techniques are 
eventually targeting at entire multi-megapixel image sensor readout, in this prototype, 
only 96 columns of readout circuits are implemented as shown in Fig. 3.1 (b), in order to 
focus on the characterization of the digitizer design. The source followers, together with 
switches multiplexing between reset voltage and signal voltage, are employed to mimic 
the pixel operation. No column gain amplifiers are used and the source follower outputs 
are directly connected to the column SS-ADCs. The main counting 1.7 GHz clock is 
generated externally and buffered on chip, while both the timing control and the global 
ramp are generated on chip.  
 
 38 
Ramp
Buffer
Ramp
Gen
C
lo
ck
 B
u
ff
e
r Control
Gen
Source
Follower
Comparator
Digital
Counter
I/O
Global
Counter
VA VB VF
DA DB DF
x16 x16 x16
CLKIN
X6 Groups
...
...
...
...
...
...
...
...
...
IMPLEMENTED
(b)(a)
PIXEL ARRAY
R
O
W
 D
EC
O
D
ER
 &
 D
R
IV
ER
s
TI
M
IN
G
 C
O
N
TR
O
LL
ER
R
A
M
P 
G
EN
G
LO
B
A
L 
CO
U
N
TE
R
PL
L
READOUT 
CIRCUITs
READOUT 
CIRCUITs
DATA TRANSFER CIRCUIT
DATA TRANSFER CIRCUIT
 
Fig. 3.1. Block diagram of (a) 3D image sensor chip; (b) 96-column digitizer prototype. 
 
3.3. Proposed Techniques 
The column-parallel digital counters and comparators are critical blocks for 
digitizing the analog pixel information level into digital numbers (DNs). The two 
proposed techniques, passing window (PW) and double auto-zeroing (AZ), are dedicated 
to improving the performance of digital counters and comparators, respectively.  
3.3.1. Passing Window Compatible with CDS 
In the conventional SS-ADC, the ripple counter topology [38] is a popular choice for 
the digital counter. First, the architecture is simple with only a single line of cascaded D-
flip flops resulting in an area efficient solution. Second, no synchronization circuits are 
demanded. Thirdly, a ripple counter capable of doing both up and down counting makes 
 39 
digital CDS subtraction possible in every column. However, a main issue with ripple 
counter topology in high-speed SS-ADCs is that the lower-bit D-flip flops are always 
counting at high frequency until the STOP signal from the comparator is generated.   
To address this issue, a hybrid column counter scheme was proposed first in [39]. 
The digital counter is divided into two parts: 1) a lower 5-bit global counter shared by 
248 column memories and 2) an upper 9-bit column-based ripple counter. In this 
partitioning manner, replacing 5-bit continuous running D-flip flops with memory cells 
sharing one global counter, power can be saved. However, if the memory cells in 
columns are implemented as transparent standard-cell latches [29], before the 
comparator output flips, the high-speed LSB data buses are always refreshing the 
memory cells. As the comparator output can flip at any time, the memory cells in [29] 
needs to be connected to the global LSB counter bus all the time before the latching 
moment. The power needed for charging and discharging the capacitors in the memory 
cells is wasted, as the relevant counting information is only captured at the latching 
instant STOP as shown in Fig. 3.2 (a). Only after the latching signal is generated can the 
memory cells be latched to the data and saved for readout before the next horizontal scan. 
Fig. 3.2 (b) shows the improved latching scheme with the proposed PW technique. 
The PW digital switch disconnects the memory cells from the global data bus during 
non-relevant timing which not only allows power minimization in the column memories, 
but also enables power savings due to the lower capacitance driven by the data bus 
drivers. It is possible to generate a PW based on delaying the comparator output [20] as 
shown in Fig. 3.2 (b). However, with this method, the real latching moment is also 
 40 
delayed which introduces a PVT dependent offset and affects the performance of the SS-
ADC. In [28], a look-ahead (LA) circuit consisting of a dynamic comparator (D-CMP) 
and logic in each column together with a globally shared early ramp generator, was 
proposed to cut unnecessary power consumed by the static comparator (S-CMP) and 
digital counter. This approach, however, requires complex circuitry. Moreover, the 
power saved through gating the high-speed clock until decision of D-CMP is limited.  
 
LSB Global
Counter
D<0:3>
LSBs
COMP
VRAMP VPIXEL
STOPBus Drivers
Q<0:3>
D<0>
Continuously Refreshing (Wasted Power)
VRAMP
VPIXEL
STOP (Comparator Output)
Q<0>
Latching Moment
(a) 
VRAMP
D<0>
VPIXEL
STOP (Comparator Output)
Q<0>
PW (Passing Window)
No Refreshing (Saved Power)
Latching Moment
PASS
(b) 
LSB Global
Counter
D'<0:3>
LSBs
COMP
VRAMP VPIXEL
STOP
Q<0:3>
Bus Drivers PW Gen
D
PW
 
Fig. 3.2. Global counter latching scheme: (a) without passing window; (b) with passing 
window. 
 
In this design, the hold-and-go counting scheme as defined in [41] is adjusted to 
generate the PW, which makes the proposed PW technique fully compatible with the 
digital CDS. Fig. 3.3 (a) and (b) show the proposed CDS timing scheme of the SS-ADC 
within one horizontal scan under bright and dark illumination condition, respectively. 
The column-parallel digital counter contains a FLAG generation circuit, a reset counter 
 41 
and a signal counter composed of LSB memories and a most significant bit (MSB) 
counter. The conventional digital CDS operation demands the following procedures: 1) 
Quantize the reset information corresponding to T1 in the reset readout phase (Φrst); 2) 
Quantize the signal information corresponding to T3 in the signal readout phase (Φsig); 3) 
Subtract the reset information (T1) from the signal information (T3) in the digital 
domain. In the proposed CDS scheme, by splitting T into T1 and T2, T3 into T and T4, 
the real pixel information level can be expressed as: 
 
Tpixel = T3 − T1 = (T4 + T) − (T − T2) = T4 + T2 (3.1) 
 
where T is the full counting time of the reset counter, T2 is the time left for the reset 
counter to count to full starting from T1. T4 is defined as the time difference from the 
moment when the signal counter start counting to the moment when the comparator 
output flips during Φsig. Based on this definition, T4 is always equal to T3 − T and can 
become negative when T3 < T under dark illumination condition as shown in Fig. 3.3 (b). 
Thus instead of quantizing T1 and T3 separately, single capture of T4 + T2 in the signal 
counter is possible following the procedures below: 
1) During Φrst, the reset counter is enabled at the same moment that the ramp starts and 
stops when VRAMP crosses with VPIXELrst. T1 is then saved on the reset counter. 
2) During Φsig, the signal counter and the LSB global counter start counting time of T 
later than the start of the ramp. When VRAMP crosses with VPIXELsig, the reset 
counter resumes counting from saved T1. The PW switches are then closed allowing 
 42 
the LSB data buses to refresh the LSB memory cells in the signal counter. 
3) During Φsig, the state when the reset counter counts to its full range is detected by the 
FLAG generation circuit. The generated DATASTOP signal opens the PW switches 
and disconnects the LSB memory cells from the LSB data buses. As the LSB 
memory cells are latched, the cascaded MSB counter also stops.  
After finishing the procedure above, the targeted digitized pixel information level 
after CDS corresponding to time T4 + T2 is stored in the signal counter. The real 
latching moment is defined by the FLAG generation, and time T2 after the comparator 
output flips. Thus, the PW of T2 is created automatically using the comparator output 
together with the DATASTOP signal without introducing any additional offset or 
complex digital circuitry. The PW technique is proposed to save the power spent in 
driving and refreshing the column memory cells, and thus more effective for bright 
illumination condition when more time of driving and refreshing is needed before the 
latching moment. Under dark illumination, if the demanded counting time is smaller 
than the passing window as the case shown in Fig. 3.3 (b), the power saving benefit of 
the proposed PW technique is less compared to the case of bright illumination. 
 
 43 
COMP
VRAMP VPIXEL
RESUME
D
A
T
A
ST
O
P
RESET
COUNT
ENABLE
D
'<
0
:3
>
P
W
 G
e
n
D
<
0
:3
>M
SB
C
O
U
N
T
ER
Q<3>
To Next Column
9
4
Q<0:12>
LSB Global
Counter
SIGNAL
COUNT
ENABLE
 
LSB
MEMORY
SIGNAL
COUNTER
8
RESET
COUNTER
FLAG GEN
M
C
LK
SIGNAL COUNT
RESET COUNT
VRAMP
VPIXEL
Full
COMPARATOR 
OUTPUT
(RESUME)
DATASTOP
RESET READOUT PHASE 
(Φrst)
SIGNAL READOUT PHASE
(Φsig)
T1 T2
T T
T2T3
-T4
Passing Window
Latch Data Memory
RAMP ENABLE
SIGNAL COUNT 
ENABLE
RESET COUNT 
ENABLE
PIXELrst, Φrst
PIXELsig2, Φsig2 
=
(b)
COMP
VRAMP VPIXEL
RESUME
D
A
T
A
ST
O
P
RESET
COUNT
ENABLE
D
'<
0
:3
>
P
W
 G
e
n
D
<
0
:3
>M
SB
C
O
U
N
T
ER
Q<3>
To Next Column
9
4
Q<0:12>
LSB Global
Counter
SIGNAL
COUNT
ENABLE
 
LSB
MEMORY
SIGNAL
COUNTER
8
RESET
COUNTER
FLAG GEN
M
C
LK
SIGNAL COUNT
RESET COUNT
VRAMP
VPIXEL
Full
COMPARATOR 
OUTPUT 
(RESUME)
DATASTOP
RESET READOUT PHASE 
(Φrst)
SIGNAL READOUT PHASE
(Φsig)
T1 T2
T T
T2T3
T4
Passing Window
Latch Data Memory
RAMP ENABLE
SIGNAL COUNT 
ENABLE
RESET COUNT 
ENABLE
PIXELrst, Φrst
PIXELsig1, Φsig1 
=
(a)
 
Fig. 3.3. CDS timing scheme for one horizontal scan: (a) under bright illumination 
condition; (b) under dark illumination condition. 
 
3.3.2. Double Auto-Zeroing 
In a conventional CDS scheme, the AZ of the comparator is employed only once in 
 44 
Φrst as shown in Fig. 3.4 (a) [38], [42]–[45]. A modified implementation of the 
comparator in Fig. 3.4 (a) with a fully differential first stage to avoid systematic offset is 
shown in Fig. 3.4 (b). One critical drawback of the topologies shown in Fig. 3.4 (a) and 
(b) is that the offset and delay information of the comparator saved in the capacitors for 
cancellation purpose during Φrst only corresponds to dark illumination conditions. 
During Φsig, the comparator makes the critical decision at very different voltage levels. If 
the offset and delay of analog comparators vary much with its input voltage crossing 
level, the conventional CDS scheme is not quite effective, especially under bright 
illumination conditions, when the difference between pixel reset voltage and pixel signal 
voltage can be full-scale. The imperfect digital CDS induced cancellation error results in 
large column fixed pattern noise (FPN) left under bright illumination condition. Even 
though shot noise and pixel FPN dominate, column FPN can be noticeable to human 
eyes, especially when the comparator input common-mode range is very limited under 
low power supply. 
Designing a low-power comparator under low supply voltage with a constant offset 
and constant time delay over the entire pixel information range in the presence of 
process-voltage-temperature (PVT) variations can be challenging and power demanding. 
Instead, an improved topology as shown in Fig. 3.4 (c) [29] was proposed to avoid this 
problem by maintaining the input crossing point of the comparators always at AZ level. 
However, the charge sharing structure demands well matched capacitor ratios of C1 and 
C2 among columns to achieve low column FPN. The charge recombination present at 
node VCP reduces the error signal by 6dB leading to SNR degradation. Finally, even 
 45 
though a dynamic inverter-based comparator in [29] offers the benefit of simplicity and 
almost zero static power, the single-ended circuit is very sensitive to substrate and 
supply noise sources and PVT variations. 
 
(d)
(c)
(b)
(a)
VO
STAGE 1 STAGE 2
+  -
-  +
+  
-  
C1
C2
PHI1
PHI2
PHI2E
PHI2E
VRAMP
VPIXEL
COMPARATOR
VCP
VCN
COMP
-
+
C1
PHI2
VRAMP
VPIXEL
COMPARATOR
VCN VO
VCP
VO
STAGE 1 STAGE 2
+  -
-  +
+  
-  
C1
C2
PHI2
PHI2
VRAMP
VPIXEL
COMPARATOR
VCP
VCN
VO
INV
C1
C2
PHI2
VRAMP
VPIXEL
COMPARATOR
VCP
VPIXEL(Φrst)=VPIXELrst
VPIXEL(Φsig)=VPIXELsig
PHI1
PHI2
VCP
VCN
PHI2E
VRAMP
VPIXEL
Vcm
PHI2
VCP
VCN
VRAMP
VPIXEL
Vbk
Vcm
PHI2
VCP
VCN
VRAMP
VPIXEL
Vbk
Vcm
Vcm
PHI2
VCP
VRAMP
VPIXEL
Vbk
VPIXEL(Φrst)=VPIXELrst
VPIXEL(Φsig)=VPIXELsig
VPIXEL(Φrst)=VPIXELrst
VPIXEL(Φsig)=VPIXELsig
VPIXEL(Φrst)=VPIXELrst
VPIXEL(Φsig)=VPIXELsig
Φrst Φsig
Φrst Φsig
Φrst Φsig
Φrst Φsig
 
Fig. 3.4. Column comparator topologies. 
 
In the proposed design, a double AZ scheme is employed to resolve the comparator 
 46 
input crossing point issue while avoiding the additional issues present in the topology as 
shown in Fig. 3.4 (c). In Fig. 3.4 (d), the AZ process is employed twice, once during Φrst 
and the other during Φsig. During the first AZ phase, the left terminal of C1 is connected 
to VPIXEL, while the right terminal is connected to common-mode voltage Vcm defined 
by the comparator with its first stage in unity gain feedback. The voltage across the 
capacitor, VPIXELrst − Vcm, together with the non-idealities to cancel is saved on C1. 
After the first AZ phase, the left terminal of C1 is switched to connect to VRAMP. VCP 
will follow the ramping down of VRAMP until it crosses with Vcm and then the 
comparator output flips. During the second AZ phase, the procedure is similar to that in 
the first AZ phase. The only difference is that now the voltage across the capacitor, 
VPIXELsig − Vcm, together with the non-idealities is saved on C1. After the second AZ 
phase, the left terminal of C1 is switched to connect to VRAMP, and the comparator 
output flips again when VCP crosses with Vcm. After digital CDS subtraction, the real 
pixel information level is VPIXELrst − VPIXELsig with the system non-idealities well 
cancelled. With this method [46]–[48], the comparator only needs to be optimized 
around the input common-mode Vcm which accommodates low supply voltage design in 
advanced technology nodes. The matching of C1 and C2 is less critical since the 
proposed approach does not rely on charge recombination; however matching is still 
desirable for PSRR considerations. As the un-correlated sampling noise power gets 
doubled at final output after subtraction of the two samples converted during Φrst and 
Φsig, respectively, C1 and C2 still needs to be sized large enough based on KT/C noise 
requirements.  
 47 
3.4. Circuit Implementation 
3.4.1. Pixel Source Follower / Ramp Buffer 
To emulate one column of active pixel sensors (APS) in a multi-mega pixel array, a 
pixel source follower as shown in Fig. 3.5 (a) is built with eight source follower unit 
cells, a column bias current mirror circuit, together with an additional capacitor of 8 x 64 
fF added to mimic the loading from the rest of the source follower unit cells which are 
not implemented in this prototype. The source follower unit cell consists of a source 
follower transistor, Nsf, a row selection transistor, Nrow, controlled by ROW<0:7>, and 
the input selection multiplexer. RST and TX, which are two non-overlapping digital 
signals that control the transmission gate multiplexer selecting VPIXELrst or 
VPIXELsig during Φrst or Φsig clock phases, respectively. The same topology in Fig. 3.5 
(b) enables the global ramp buffer (one for all columns) to track the pixel source 
follower over PVT and therefore reduce the demanded reset counting range and improve 
the INL performance of the digitizer. For the global ramp buffer, the multiplexer always 
passes the RAMP to the gate of Nsf and the same ROW<0:7> controls which source 
follower unit cell to turn on.  
 
 48 
IBIAS
VPIXEL
Nbias
ROW<0:7>
Nsf
Nrow
RSTB
RST
TXB
TX
VPIXELrst
VPIXELsig
X 8
Cp
(a)
RAMP
VPIXELrst
IBIAS
VRAMP
Nbias
ROW<0:7>
Nsf
Nrow
Low
High
High
Low
X 8
Cp
(b)
Nmux
Pmux
W/L
(um/um)
Nsf 2/0.8
Nrow 2.4/0.27
Nbias 3/2
Nmux 0.4/0.4
Pmux 0.4/0.4
MOS
 
Fig. 3.5. (a) Pixel source follower; (b) Ramp buffer. 
 
3.4.2. PW-based Hybrid Digital Counter 
The implementation and timing scheme of the proposed PW-based hybrid digital 
counter introduced in Fig. 3.3 are detailed in Fig. 3.6 (a) and (b), respectively. The 
hybrid digital counter is built with a FLAG generation circuit and two counters 
containing an 8-bit reset counter and a 12-bit signal counter. The signal counter consists 
of a 4-bit LSB global counter combined with in-column LSB memory cells and 9-bit in-
column MSB ripple counters. This segmentation of the signal counter is good enough for 
power saving considering that the 5th-bit LSB of the conventional counter counts at a 
low speed that is 1/32 of its 1st-bit LSB counterpart. 
 
 49 
(a)
LSB COUNT<1>
LSB COUNT<0>
LSB COUNT<2>
LSB COUNT<3>
(b)
(c)
COMPARATOR 
OUTPUT
DATASTOP
RESET READOUT PHASE 
(Φrst)
SIGNAL READOUT PHASE
(Φsig)
SIGNAL COUNT 
ENABLE
MCLK
PW<0:2>
PW<3>
R<0>
FF
D
QB Q
 
 
R<7>
FF
D
QB Q
 
 
FF
D
QB Q
 
 
R<1>
FF
D
QB Q
 
 
R<2>
SIGNAL COUNT ENABLE
LSB Global
Counter
MCLK 
LSB COUNT<0:3>
COMPARATOR OUTPUT
FF
D
QB Q
 
 
D<12>
FF
D
QB Q
 
 
D<4>
D
A
TA
ST
O
P
 M
D Q
 
D<0>
M
D Q
 
D<1>
M
D Q
 
D<2>
M
D Q
 
D<3>
 
 
 
 
 
RESET COUNT SIGNAL COUNT
 
 
 
 
P
W
<0
:2
>
P
W
<3
>
FLAG GEN
R
<7
>
R
<6
>
R
<0
>
          
FF
D
Q
 
G<0>
G<1>
FF
D
Q
 
FF
D
Q
 
FF
D
Q
 
G<2>
G<3>
          
          
          
M
C
LK
SIGNAL COUNT
ENABLE
LSB Global Counter
 
 
 
HALF
ADDER
          
FF
D
Q
MCLK
  
 
DATASTOP
M
D Q
 
D<0>
FF
D
QB Q
 
 
R<0> P
W
<0
> 
&R<1:7>
 
COMPARATOR OUTPUT
LSB COUNT<0:3>
Tclkq
Tclkq
LATCHTd1
Td2
 
Fig. 3.6. Proposed digital counter: (a) architecture; (b) timing scheme; (c) delay paths. 
 
For the global counter, synchronous grey-code and delay-line based counter 
topologies are welcomed for their property wherein only a single bit changes from count 
to count [29], [43]. This feature enables power savings as the frequencies of LSB data 
buses are lowered. However, the proposed PW technique can push this benefit to its 
extreme by avoiding all unnecessary data transfer outside of predetermined small PW as 
shown in Fig. 3.6 (b). In this design, a simple but robust synchronous binary counter 
based on half-adders and D-flip flops from a standard cell library is employed as the 
LSB global counter. As shown in Fig. 3.6 (c), DATASTOP is generated using the LSB 
D-flip flop of the ripple counter which is already synchronized with the master clock 
(MCLK); hence a power-consuming circuit to synchronize the comparator output is 
avoided. However, to ensure a proper latching moment of LSB data into memory cells, 
the delay of the two paths, data signal path in blue and latch signal path in red, still needs 
 50 
to be properly controlled. The delay of buffers for four LSB data buses is made 
adjustable to compensate for the delay of the logic circuits which only exists in the red 
path. In addition, special care has been taken on the 520µm routing of MCLK together 
with the four LSB data buses to maintain their delay to the same column, Td1 and Td2, 
close to each other. Multiple global counters are required for readout of a complete 
multi-mega pixel readout circuits because the delay matching of clock and data buses at 
different frequencies cannot be guaranteed for long distance routing.   
The reset counter is implemented using a ripple counter topology for the benefits 
explained in Section III. To cover the PVT variations on VPIXELrst, the reset counter 
needs resolution of 8 bit which corresponds to 2.55 us in time domain with T1 and T2 
arranged to be equal. Since the reset counting time T is short (256 cycles), the power 
consumption is only a small portion of the entire digital counter, unlike the signal 
counter (4096+256 cycles). Instead of using a latch for its LSB counting [38], the reset 
counter still employs a D flip-flop for two reasons: 1) it offers better delay matching 
between the red and blue paths shown in Fig. 3.6 (c); 2) The latch can result in meta-
stability issues depending on the comparator output flipping moment [29]. The outputs 
of the reset counter are sent to a cascading line of AND gates and DATASTOP signal is 
generated when the counter is at full count.  
As shown in Fig. 3.6 (b), PW<0:2> generated using comparator output together with 
DATASTOP is applied to control the connection of the last 3-bit LSB memory cells in 
the signal counter to their corresponding data buses. PW<3> for controlling the 
connection of the 4th-bit LSB memory cell to its data bus needs to be generated based on 
51 
SIGNAL COUNT ENABLE and DATASTOP. This is because the memory cell needs to 
be all transparent to its input until DATASTOP is achieved so as to trigger and stop the 
column-based MSB ripple counter.  
3.4.3. Double AZ Comparator 
Fig. 3.7 shows the detailed implementation of the proposed comparator with double 
AZ scheme introduced in Section III Part B. The topology in Fig. 3.4 (d) is modified by 
connecting the left terminal of C2 to VPIXEL instead of ground to make the comparator 
fully symmetrical. Non-overlapping PHI1 and PHI2 controlled dummy switches are 
added between C2 and VPIXEL to reduce non-idealities introduced by clock 
feedthrough and charge injection. The first stage of the comparator is implemented using 
a fully differential topology [41]. During the AZ phase, the comparator in unity gain 
feedback operates at a nominal common-mode voltage of 0.93 V, which is well-defined 
by the tail current and diode connected transistors N1-N4. The differential gain of the 
first stage is around 29.4 dB achieved by the cross-couple transistors N1 and N2. The 
high gain in the first stage relaxes the offset and noise requirement of the second stage. 
The first stage consumes 16 µA current from a 2 V analog supply and achieves a 
bandwidth of around 8.46 MHz, which enables: 1) fast tracking and settling during the 
AZ phase and 2) small delay variation at different input crossing levels. Since only the 
first stage with a single pole at output is placed in closed loop during the AZ phase, 
closed loop stability is not an issue. The second stage possesses a lower bandwidth to 
help filter out some high frequency noise. The overall gain and bandwidth of the analog 
 52 
comparator is around 66.7 dB and 3.55 MHz, respectively.  
 
VRAMP
VPIXEL
PHI1
PHI2
PHI2E
VB
VIN VIP VO
P0
P1 P2
P3 P4
N1N3 N4N2N5 N6
STAGE 1VO
STAGE 1 STAGE 2
+  -
-  +
+  
-  
C1
C2
PHI1
PHI2
PHI2
PHI1
PHI2E
PHI2E
VRAMP
VPIXEL
VPIXEL
VPIXEL
COMPARATOR
VPIXEL(Φrst)=VPIXELrst
VPIXEL(Φsig)=VPIXELsig
Φrst Φsig
600 fF
(W/L) N3 = 0.9um/3um
(W/L) N1 = 0.72um/3um
 
Fig. 3.7. Proposed comparator with double AZ scheme. 
 
To validate the concept of the proposed double AZ method, two test-benches 
simulating RCCC-extracted view of the transistor level comparator including all 
components shown in Fig. 3.7 are run under various timing schemes. In the first test-
bench shown in Fig. 3.8, the AZ process is done only once for a single readout without 
CDS. The single AZ happens during Φrst as shown in Fig. 3.8 (a); thus, the VCP/VCN 
does not cross at Vcm when realizing comparing the two signals. While in Fig. 3.8 (b), a 
single AZ is completed during Φsig, and this time, when decision is made, the VCP/VCN 
crossing happens at Vcm. As shown in Fig. 3.8 (c), the equivalent column FPN, caused 
by the comparator delay variations alone under different pixel information levels is 
characterized based on Monte-Carlo simulations with a global variation model. The 
column FPN for Fig. 3.8 (a) goes up to 0.83% (33.91 LSB) and 0.34% (14.11 LSB) 
 53 
under an input signal level of 0.8 V and 0.7 V, respectively. This is caused by the fact 
that when a very low crossing voltage is applied to the comparator input, the differential 
pair consisting of P1 and P2 is too close to the triode region. Sample images under bright 
illumination with 1% pixel FPN, 1.56% shot noise, together with column FPN of 0.83% 
and 0.34% are shown in Fig. 3.8 (d) and the vertical strips are clearly noticeable. 
Nevertheless, the column FPN for Fig. 3.8 (b) stays below 3.86LSB throughout the 
entire input signal range benefiting from the constant VCP/VCN crossing level. The 
second test-bench introduces the effect of CDS to the first test-bench as shown in Fig. 
3.9. In Fig. 3.9 (a), a single AZ is employed in Φrst and the topology becomes the same 
as shown in Fig. 3.4 (b); thus, VCP/VCN crosses at Vcm for the reset readout, while 
remaining at a signal-dependent level for the signal readout. On the contrary, in the 
proposed double AZ scheme as shown in Fig. 3.9 (b), VCP/VCN always crosses at Vcm. 
Fig. 3.9(c) shows that the column FPN is improved from 0.65% (26.78 LSB) to 0.004% 
(0.163 LSB) at the 0.8 V input level using the proposed double AZ method. Sample 
images in Fig. 3.9 (d) further illustrate the effect of the proposed method in cancelling 
out the noticeable strips under bright illumination conditions. With the help of CDS, the 
dark column FPN is improved from around 3.9 LSB to below 0.3 LSB. 
 
 54 
(b)
(a)
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
0
12
24
36
INPUT SIGNAL LEVEL (V)
F
P
N
 (
L
S
B
)
 
 
FPN for (a)
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
3.78
3.82
3.86
3.9
INPUT SIGNAL LEVEL (V)
F
P
N
 (
L
S
B
)
 
 
FPN for (b)
3.890LSB
33.91LSB
3.853LSB
3.805LSB
(c)
Column FPN of 0.83% (33.91LSB)
Pixel FPN of 1%
Shot noise of 1.56%
Column FPN of 0.34% (14.11LSB)
Pixel FPN of 1%
Shot noise of 1.56%
(d)
VCP
VCN
VRAMP
VPIXEL
Vcm
PHI1
PHI2
PHI2E
PHI1
PHI2
VCP
VCN
PHI2E
VRAMP
VPIXEL
Vcm
VPIXEL(Φrst)=VPIXELrst
VPIXEL(Φsig)=VPIXELsig
VPIXEL(Φsig)=VPIXELsig
Φrst Φsig
Φsig
VO
STAGE 1 STAGE 2
+  -
-  +
+  
-  
C1
C2
PHI1
PHI2
PHI2E
PHI2E
VRAMP
VPIXEL
VCP
VCN
PHI1
PHI2
VPIXEL
VPIXEL RCCC_EXTRACTED
VO
STAGE 1 STAGE 2
+  -
-  +
+  
-  
C1
C2
PHI1
PHI2
PHI2E
PHI2E
VRAMP
VPIXEL
VCP
VCN
PHI1
PHI2
VPIXEL
VPIXEL RCCC_EXTRACTED
 
Fig. 3.8. Column FPN caused by comparator delay variations without CDS: (a) 
VCP/VCN does not cross at comparator common-mode voltage Vcm; (b) VCP/VCN 
crosses at comparator common-mode voltage Vcm; (c) column FPN under various input 
signal level for (a) and (b); (d) sample images with column FPN of 0.83% (33.91 LSB) 
and 0.34% (14.11 LSB) respectively.  
 
 55 
(b)
(a)
VO
STAGE 1 STAGE 2
+  -
-  +
+  
-  
C1
C2
PHI1
PHI2
PHI2E
PHI2E
VRAMP
VPIXEL
VCP
VCN
PHI1
PHI2
VPIXEL
VPIXEL RCCC_EXTRACTED
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
0
9
18
27
INPUT SIGNAL LEVEL (V)
F
P
N
 (
L
S
B
)
 
 
FPN for (a)
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
0.08
0.12
0.16
0.2
INPUT SIGNAL LEVEL (V)
F
P
N
 (
L
S
B
)
 
 
FPN for (b)
0.234LSB
26.78LSB
0.107LSB
0.163LSB
Column FPN of 0.65% (26.78LSB)
Pixel FPN of 1%
Shot noise of 1.56%
Column FPN of 0.004% (0.163LSB)
Pixel FPN of 1%
Shot noise of 1.56%
(c) (d)
VO
STAGE 1 STAGE 2
+  -
-  +
+  
-  
C1
C2
PHI2E
PHI2E
RCCC_EXTRACTED
VCP
VCN
PHI1
PHI2
PHI1
PHI2
VRAMP
VPIXEL
VPIXEL
VPIXEL
PHI1
PHI2
VCP
VCN
PHI2E
VRAMP
VPIXEL
Vcm
VCP
VCN
VRAMP
VPIXEL
Vbk
Vcm
PHI1
PHI2
PHI2E
High 
Low 
VPIXEL(Φrst)=VPIXELrst
VPIXEL(Φsig)=VPIXELsig
VPIXEL(Φrst)=VPIXELrst
VPIXEL(Φsig)=VPIXELsig
Φrst Φsig
Φrst Φsig
 
Fig. 3.9. Column FPN caused by comparator delay variations with CDS: (a) VCP/VCN 
does not cross at comparator common-mode voltage Vcm; (b) VCP/VCN crosses at 
comparator common-mode voltage Vcm; (c) column FPN under various input signal 
level for (a) and (b); (d) sample images with column FPN of 0.65% (26.78 LSB) and 
0.004% (0.163 LSB) respectively. 
 
 56 
3.4.4. Ramp Generator 
The ramp generator in this design is implemented using a current-charging-capacitor 
topology as shown in Fig. 3.10. A switch controlled by the RAMP ENABLE signal is 
used to pull the RAMP voltage to its top value when necessary. The capacitor is 
implemented using metal-oxide-metal (MOM) topology for its good linearity 
performance at the cost of relatively lower area efficiency. The current source is digitally 
controllable with 4-bit binary-weighted current branches for ± 100% current control to 
compensate for the slope variations with PVT. The ‘High’ voltage controlling the 
differential switches of the current branches is designed to be 1.8 V, so as to achieve 
high output impedance by cascading three transistors in saturation. A distributed RC 
filter is needed in front of the gate of the tail transistor in the current branches to filter 
out the noise from the bias circuitry.  
 
RAMP ENABLE
RAMP ENABLEB
RAMPTOP
RAMP
IRAMP
Cmom
Cg
RAMP BUFFER
VRAMP
SF
IBtIBc
RB<0>RBB<0> RB<3>RBB<3>
IRAMP
NBc0
NBc1
NBc2
X16 X1
NBt0
NBt1
Nt0
Nc0
Nt3
Nc3
Nsl0 Nsr0 Nsl3 Nsr3
 
Fig. 3.10. Proposed ramp generator. 
 
 
 57 
3.5. Measurement Results 
The image sensor digitizer prototype is fabricated in 40 nm low-power CMOS 
process, and Fig. 3.11 shows the microphotograph of the chip. The SS-ADC array of the 
96-column digitizer is implemented within a column pitch of 5.4 µm and a vertical 
length of 325 µm and 290 µm for the comparator and digital counter, respectively. 
 
Source Follower
Comparator
Digital
Counter
Readout
Ramp
Buffer
Ramp
Gen
Control
Gen
Clock
Buffer
3
25
µ
m
2
90
µ
m
1
.4
m
m
1.4mm
 
Fig. 3.11. Microphotograph of image sensor digitizer chip. 
 
By emulating active pixels using external voltage sources together with on-chip pixel 
 58 
source followers and timing control, the entire readout chain including the pixel source 
follower and the digitizer is tested based on a single die in SS corner at room 
temperature. One horizontal scan for the 12-bit digitizer is 6.02 µs, while the proposed 
PW-based hybrid digital counter counts at 1.7 GHz. Even though the double AZ timing 
scheme is fixed and not possible to bypass it in the implemented prototype, by setting 
VPIXELrst higher than VRAMP during the entire phase Φrst, the comparator will not flip 
and thus the reset read out is disabled. Benefiting from this setup, we are able to get 
measured data under no CDS condition for comparison purposes. 
Fig. 3.12 shows the measured random noise performance of the designed image 
sensor readout array with and without the proposed CDS scheme, shown in red and blue, 
respectively. The random noise with CDS scheme is roughly √2 times higher than the 
same condition but without CDS over the entire input signal range, which is attributed to 
its double sampling behavior.  Under a dark illumination condition, the random noise 
voltages through the entire readout chain are 261.5 µV and 202 µV with and without 
CDS, respectively. The dynamic range of the readout chain is calculated to be 71.8 dB 
without any column gain stages under CDS scheme. 
 
 59 
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
0.7
0.8
0.9
1
1.1
1.2
1.3
INPUT SIGNAL LEVEL (V)
N
O
IS
E
 W
/ 
C
D
S
 (
L
S
B
)
 
 
0.7
0.8
0.9
1
1.1
1.2
1.3
N
O
IS
E
 W
/O
 C
D
S
 (
L
S
B
)
NOISE W/ CDS
NOISE W/O CDS
0.99LSB
1.20LSB
1.05LSB
0.81LSB
 
Fig. 3.12. Measured random noise performance. 
 
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
0
0.2
0.4
0.6
0.8
1
1.2
1.4
1.6
INPUT SIGNAL LEVEL (V)
F
P
N
 W
/ 
C
D
S
 (
L
S
B
)
 
 
6.6
6.7
6.8
6.9
7
7.1
7.2
7.3
7.4
F
P
N
 W
/O
 C
D
S
 (
L
S
B
)
FPN W/ CDS
FPN W/O CDS
0.175%
0.166%
0.028%
0.0024%
 
Fig. 3.13. Measured column FPN performance. 
 60 
Fig. 3.13 shows the measured column FPN performance of the designed image 
sensor readout array with and without the CDS scheme, shown in red and blue, 
respectively. In both cases, the crossing happens at the AZ level, and the variation of 
column FPN with input signal level is very small. Combining the proposed double AZ 
scheme with the CDS scheme, the dark and bright column FPN performance of 0.0024% 
and 0.028% is achieved in this design, respectively. Fig. 3.14 (a) and (b) show the DN 
distribution behavior of the 96 readout chains with a unified input level under dark 
illumination and bright illumination, respectively. By controlling ‘SIGNAL COUNT 
ENABLE’ independently, the design can be improved to tolerate the condition when the 
image sensor has negative output level due to random noise. 
 
(b)(a)
0 10 20 30 40 50 60 70 80 90 100
-0.1
-0.05
0
0.05
0.1
DIGITAL CODE DISTRIBUTION
COLUMNS
C
O
L
U
M
N
 F
P
N
 (
%
)
3922 3924 3926 3928 3930
0
10
20
30
40
50
CHANNEL FPN HISTOGRAM
DIGITAL NUMBER (DN)
H
IT
SIGMA = 1.15 LSB, MEAN = 3925.96
0 10 20 30 40 50 60 70 80 90 100
-0.01
0
0.01
DIGITAL CODE DISTRIBUTION
COLUMNS
C
O
L
U
M
N
 F
P
N
 (
%
)
3 4 5 6 7 8
0
30
60
90
120
150
CHANNEL FPN HISTOGRAM
DIGITAL NUMBER (DN)
H
IT
SIGMA = 0.10 LSB, MEAN = 5.22
 
Fig. 3.14.  Measured column FPN under condition of (a) dark illumination; (b) bright 
illumination. 
 
Fig. 3.15 shows the measured linearity performance of the near-end and far-end 
 61 
column readout chain, shown in red and blue, respectively. The far-end column is 
expected to be the worst case with DNL and INL of +0.32/−0.28 LSB and +4.21/−0.94 
LSB, respectively. The INL performance is believed to be limited by the linearity of the 
ramp. Even though linear MOM caps are used as the main capacitor for current 
charging, the non-linear gate capacitance from the ramp buffer degrades the INL 
performance. 
 
0 1000 2000 3000 4000
-0.4
-0.2
0
0.2
0.4
DIGITAL NUMBER (DN)
D
N
L
 (
L
S
B
)
0 1000 2000 3000 4000
-1
0.5
2
3.5
5
DIGITAL NUMBER (DN)
IN
L
 (
L
S
B
)
Near End
Far End
 
Fig. 3.15. Measured linearity performance. 
 
Fig. 3.16 illustrates the power consumption of the column-parallel digital counter. In 
the conventional hybrid counter topology [29], [39], the most power demanding 
component is the signal counter which consumes 23.48 µW under bright illumination 
 62 
conditions. Nevertheless, with the proposed PW technique, this power can be reduced to 
8.88 µW under the same condition. A power saving of 52.8% in the entire hybrid digital 
counter is obtained using the proposed PW technique without introducing any additional 
circuitry. As the design is not reconfigurable to disable PW, the detailed comparison of 
the power consumption of single-column counter with and without PW shown in Fig. 
3.16 is based on post-layout simulations scaled to match the measured total power. As 
there is no dedicated supply for each block, the power consumption values shown for 
signal counter, reset counter and FLAG GEN are calculated based on their percentage 
attained from post-layout simulations and measuring the total power consumption of 
single column counter. 
 
3.35 3.35 3.34 3.34
8.88
23.48
4.98 6.38
0
5
10
15
20
25
30
BRIGHT W/ PW BRIGHT W/O PW DARK W/ PW DARK W/O PW
SIGNAL COUNTER
RESET COUNTER
FLAG GEN
P
O
W
ER
C
O
N
SU
M
P
TI
O
N
 (
µ
W
)) 52.8% Save
13.05
27.65
9.14
10.54
 
Fig. 3.16. Power consumption of single-column digital counter. 
 
Fig. 3.17 shows the power breakdown of a single-column digitizer with a total power 
 63 
of 66.8 µW. For the global counter, the ramp and ramp buffer, their equivalent single-
column power is calculated by dividing their entire power over the number of columns. 
In this design, the column comparator power dominates as it is a conservative design still 
covering a full-scale input signal level range. 
 
COMPARATOR
32.28 µ W (48.3%)
COLUMN COUNTER
11.09 µ W (16.6%)
GLOBAL COUNTER
13.18 µ W (19.7%)
RAMP + RAMP BUF
10.15 µ W (15.2%)
 
Fig. 3.17. Power breakdown of single-column digitizer. 
 
Table 3.1 summarizes the performance of the implemented image sensor digitizer. 
Single-column-wise comparison with the state-of-the-art implementations for image 
sensors can be made even though this design does not include an entire multi-megapixel 
readout array. For fair comparison purposes, only the ADC power is considered. 
Compared with the cyclic/SAR ADC based implementations in [30], [32], [37], the 
proposed design appears to be slower as additional time is needed for completing the 
digital CDS procedure. Nevertheless, the proposed digitizer achieves high dynamic 
 64 
range, the lowest per-column power consumption, the smallest area, and competitive 
Schreier FOM of 162.75 dB.  
 
Table 3.1: Performance summary and comparison with the state of the art 
  
  This Work [28] [32] [37] [40] 
Processs 40nm CMOS 
65nm CIS/ 
65nm CMOS 
180nm CIS 180nm CMOS 
40nm CIS/ 
65nm CMOS 
Power Supply 2.5V/2V/1.1V 2.5V/1.8V/1.2V 3.3V/1.8V 3.3V/1.8V 2.8V/2.5V/1.2V 
Digitizer Topology 
Column-
Parallel  
SS 
Column-
Parallel  
PGA+SS 
Column-Parallel 
 CDS+2-Stage-
Cyclic 
32-Column-
Shared 
 CMR+SAR 
Column-Parallel 
CDS+Cyclic-
Cyclic-SAR 
1 Horizontal Time 
(T1h) 
6.02µsec 10µsec 1.92µsec 1.85µsec 0.92µsec 
ADC Conversion Time 2.7µsec 5µsec 1.92µsec 55.7nsec 0.92µsec 
ADC Resolution (N) 12bit 11bit 12bit 12bit 12bit 
ADC Area 5.4µm x 610µm N/A 5.6µm x 1770µm N/A 4.4µm x 920µm 
Digital CDS Y Y N N N 
DNL +0.32/-0.28LSB N/A +0.5/-0.7LSB +6/-1.0LSB +0.82/-0.88LSB 
INL +4.21/-0.94LSB N/A +86/0LSB N/A 
+1.04/-
11.75LSB 
Column FPN 
0.0024%(Dark) 
0.037%(Bright) 
N/A 
0.31%(Dark) before 
off-chip calibration 
N/A N/A 
Random Noise (Vn) 
261.5µVrms 
(0dB Gain) 
N/A 
320µVrms 
(0dB Gain) 
614.4µVrms 
(6dB Gain) 
414µVrms 
(0dB Gain) 
Dynamic Range (DR) 71.8dB N/A 62.9dB 62.3dB 62.1dB 
Per 
Column 
Power 
(P) 
Total 66.8µW N/A  161µW  242.1µW N/A  
Breakdown ADC(66.8µW) 
PGA(32.5µW) CDS(59.9µW) CMR(134.3µW) 
ADC(120µW) 
S-CMP(4µW) ADC(101µW) ADC(107.8µW) 
FoM1a [V·pJ] 0.105 N/A 0.099 0.275 N/A 
FoM2b [V·fJ/step] 0.026 N/A 0.024 0.134 N/A 
FoM3c [dB] 162.75 N/A  154.9 152.8 N/A  
  
     a FoM1 =  P·Vn·T1h·10
12 [V·pJ] (Image Sensor FoM1from [40]) 
     b FoM2 =  P·Vn·T1h·Gain·10
15/2N [V·fJ/step] (Image Sensor FoM2 from [40]) 
     c FoM3 =  DR+10log10(1/(2·T1h·P)) [dB] (Schreier ADC FoM3) 
 
 
 65 
3.6. Conclusion 
A 66.8µW-per-column image sensor digitizer based on SS-ADC topology was 
designed and implemented in a 40 nm low-power CMOS process. The proposed PW 
technique enables a power saving of 52.8% in the hybrid digital counter, while 
maintaining the capability of completing digital CDS in column. A dark/bright column 
FPN of 0.0024%/0.028% from the readout circuit is achieved with the proposed double 
AZ technique without any off-chip processing. The proposed design achieves low 
readout noise and a wide dynamic range of 71.8 dB without any column gain stages. The 
proposed techniques are suitable for readout of back-illuminated 3D-stacked CMOS 
image sensor pixel array and exportable to other architectures. 
 
 
 
 
 
 
 
 
 
 
 
 
 66 
IV. A 50-MHZ BW 67.3-DB SNDR MASH 1-1-1 CT ΔΣ MODULATOR WITH FIR 
DAC AND EELU QUANTIZER IN 40-NM CMOS 
 
4.1. Introduction 
In modern LTE-Advanced (LTE-A) receivers, to boost user throughput and increase 
network capacity, low power (P) wide bandwidth (BW) analog-to-digital converters 
(ADCs) are in critical demands [49]. The continuous-time delta-sigma modulator (CT-
ΔΣM) is the popular architecture of choice for its high dynamic range (DR) capability, 
implicit anti-aliasing behavior, and tolerance to out-of-band blockers [50]–[71]. 
State-of-the-art single-loop CT-ΔΣMs are facing design challenges in achieving wide 
bandwidth and good power efficiency [50]–[63]. In a high-order single-loop ΔΣM, there 
is no guarantee that it will return to stable operation after an overload experience. This 
situation can happen when an agile large blocker comes in or near band; detection and 
reset mechanisms are required for overload recovery. On the other hand, MASH CT-
ΔΣMs [64]–[69] offer high-order in-band noise shaping by cascading low-order single-
loop CT-ΔΣMs. They have superior stability and overload recovery capability compared 
to high-order single-loop ΔΣMs. Therefore, MASH ΔΣMs tolerate more out-of-band 
(OOB) gain, provide more aggressive in-band noise shaping and, thus, present a 
potential for wider bandwidth and lower power capability. However, the noise 
cancellation of MASH ΔΣM architecture relies on critical matching between an analog 
noise transfer function (NTF) and a digitally implemented noise cancelling filter (NCF) 
 67 
[66].  
Most modulators employ a multi-bit quantizer, which provides lower quantization 
noise level, better stability and superior clock jitter sensitivity performance compared to 
single-bit quantizer implementation [50], [52], [56]–[71]. However, the power, silicon 
area and input capacitance of the multi-bit quantizer increase exponentially with the 
quantizer’s number of bits. In addition, it becomes difficult for the multi-bit feedback 
digital-to-analog converter (DAC) to satisfy the matching requirements especially at 
high-speed sampling frequencies. The dynamic element matching (DEM) techniques, 
effective for low-speed high resolution DACs, are not applicable in high sampling-rate 
modulators because extra delay is no longer tolerable in high-speed feedback paths. In 
some implementations, higher supply voltage is required to meet the stringent matching 
requirements [50], [52], [56], [66]–[69]. Look-up table based calibration techniques are 
proposed in [59]; however, this scheme demands complex digital hardware 
implementation. 
On the other hand, a single-bit modulator topology simplifies the quantizer, DAC 
and clock generation circuits compared with multi-bit implementations [51], [53]–[55]. 
A single-bit DAC is inherently linear and no DEM is required, which reduces excess 
loop delay (ELD) and enables a higher sampling rate [51]. However, there are three 
critical issues in employing a single-bit DAC in CT-ΔΣM: (1) a single-bit feedback DAC 
is quite sensitive to clock jitter; (2) the full-scale (FS) swing of the feedback signal in the 
single-bit case is hard for the operational amplifier (OA) to handle; (3) the OOB gain 
needs to be restricted for stability consideration. 
 68 
The concept of employing FIR DAC in a single-bit CT-ΔΣM has already been 
proposed to address the three critical issues just mentioned [51], [53]–[55]. Thanks to the 
noise suppression around half the sampling frequency, the FIR logic improves the jitter 
sensitivity performance of the modulator. The feedback FIR DAC generates multi-level 
current pulses like those found in multi-bit DAC, which relaxes the requirement on the 
slew rate of OA in the first integrator. However, directly adding FIR logic introduces 
additional delay in the feedback path, which affects the original NTF of the modulator 
and usually degrades the in-band noise performance [53]–[55]. In [51], a single FIR 
compensation path is provided to the third integrator, whose non-idealities are attenuated 
by the gain of the previous two integrators. The low-frequency gain of the compensation 
path is made small, thereby reducing its impact on the in-band signals. However, the 
poor matching between the FIR logic and the analog-based compensation filter may 
degrade the compensation and limit the noise-shaping and stability performance of the 
modulator. 
Compared to the previously mentioned single-loop implementation, the MASH CT-
ΔΣM topology based on the quantization noise cancellation scheme provides a unique 
convenience in the NTF compensation for FIR DAC in the feedback path of the 
modulator’s first stage. The degradation in the first stage noise transfer function, NTF1, 
can usually be tolerated due to its low-order implementation and superior stability 
compared with single-loop high-order modulators. Simple adjustment in the noise 
cancellation of the first stage, NCF1, still ensures a solid cancellation of the first stage 
quantization noise in the end. 
 69 
In this paper, a low-power wide-bandwidth MASH 1-1-1 CT-ΔΣM is proposed, 
cascading three first order feedback single-loop CT-ΔΣMs. To address the mismatch 
issue of multi-bit DAC, a single-current-source 1.5-bit DAC is employed. An FIR filter 
is added to the DAC in the feedback path to relax the integrator design and improve the 
jitter sensitivity performance of the modulator. A simple adjustment in the first-stage 
NCF by adding the same FIR logic maintains the properties of the overall modulator’s 
NTF. Instead of employing a conventional high-speed direct feedback path consisting of 
an analog summer and a fast DAC, an encoder embedded loop-unrolling (EELU) 1.5-bit 
quantizer based on multiplexing comparator outputs is proposed. The proposed 
techniques improve the robustness and power efficiency of the modulator and make the 
MASH 1-1-1 CT-ΔΣM a potential candidate for the next generation of radio 
applications. 
The paper is organized as follows: Section II describes the overall architecture and 
system-level design of the proposed MASH 1-1-1 CT-ΔΣM. The circuit-level 
implementations of the most relevant modulator’s building blocks are detailed in Section 
III. The measurement results are discussed in Section IV, and Section V concludes this 
paper. 
4.2. Modulator Architecture 
4.2.1. MASH 1-1-1 CT-ΔΣM Architecture 
The architecture of a MASH 1-1-1 CT-ΔΣM including all inter-stage connection 
possibilities is shown in Fig. 4.1. The modulator is built by cascading three stages of 
 70 
first-order feedback single-loop CT-ΔΣMs. Each stage is comprised of an integrator, a 
main feedback DAC, and a loop-unrolling quantizer, aiming to provide first-order noise 
shaping with zeros placed at dc.  
The blocks and connections in blue will be implemented in the digital domain, while 
those in black remain in the analog domain. The fast feedback path and the summer 
around the quantizer set to achieve ELD compensation can actually be implemented in 
the digital domain as part of the proposed EELU quantizer. Therefore, the nodes B1 and 
B2 become internal digital nodes of the proposed EELU quantizer and are available for 
inter-stage connection. The FIR filter is added before the main feedback DAC with 
coefficient k11 in the first stage to improve its jitter sensitivity and relax the requirement 
of the first stage integrator. The FIR logic affects NTF1, and consequently the overall 
NTF, whose effect is compensated by adjusting NCF1 in the digital domain. 
 
 71 
k10 1/sTs
k11
FIR
Ts
Z-1
A1
k12
1/sTs
k21
Ts
Z-1
A2
k22
b12
d12
1/sTs
k31
Ts
Z-1
A3
k32
b23c23
d13
E1
Y1
x1
E2
Y2
x1
E3
Y3
x1
Vin
c12
c13b13
B1
B2
B3
 
Fig. 4.1. Architecture of a MASH 1-1-1 CT-ΔΣM including all inter-stage connection 
possibilities. 
 
4.2.2. MASH Coefficient Synthesis and NCF 
The MASH 1-1-1 modulator can be analyzed quantitatively employing the impulse 
invariant transformation. The impulse response of the feedback path going through NRZ 
DAC, continuous-time filter H(s) and an impulse sampler is equivalent to a discrete-time 
filter H(z) as shown in Fig. 4.2 [65], [66], [70], [71].  By identifying the three different 
paths: feed-in path (FF), feed-back path (LG) and connecting path (CLF), and after 
calculating their equivalent digital equivalent transfer functions, the impulse invariant 
model of the MASH 1-1-1 modulator with its NCFs can be found as shown in Fig. 4.3 
 72 
[70]. The derived digital equivalent paths include: 
a) Feedback paths: 
 
LG1 = 𝑘11𝐹𝐼𝑅 (
𝑧−2
1 − 𝑧−1
) +𝑘12𝑧
−1 (4. 2) 
𝐿𝐺2 = 𝑘21 (
𝑧−2
1 − 𝑧−1
) + 𝑘22𝑧
−1 (4. 3) 
𝐿𝐺3 = 𝑘31 (
𝑧−2
1 − 𝑧−1
) + 𝑘32𝑧
−1 (4. 4) 
 
b) Connecting paths: 
𝐶𝐿𝐹12 = 𝑘11𝑏12𝐹𝐼𝑅 (
𝑧−2(1 + 𝑧−1)
2(1 − 𝑧−1)2
) + (𝑐12 + 𝑑12𝐹𝐼𝑅) (
𝑧−2
1 − 𝑧−1
) (4. 5) 
𝐶𝐿𝐹13 = 𝑘11𝑏12𝑏23𝐹𝐼𝑅 (
𝑧−2(1 + 4𝑧−1 + 𝑧−2)
6(1 − 𝑧−1)3
) 
+𝑏23(𝑐12 + 𝑑12𝐹𝐼𝑅) (
𝑧−2(1 + 𝑧−1)
2(1 − 𝑧−1)2
) 
+𝑘11𝑏13𝐹𝐼𝑅 (
𝑧−2(1 + 𝑧−1)
2(1 − 𝑧−1)2
) 
+(𝑐13 + 𝑑13𝐹𝐼𝑅) (
𝑧−2
1 − 𝑧−1
) 
(4. 6) 
𝐶𝐿𝐹23 = 𝑘21𝑏23 (
𝑧−2(1 + 𝑧−1)
2(1 − 𝑧−1)2
) + 𝑐23 (
𝑧−2
1 − 𝑧−1
) (4. 7) 
 
To achieve the targeted overall NTF of the modulator, NTF = (1 − z−1)3, the third 
 73 
stage noise-cancelling filter is implemented as: 
 
NCF3 = (1 − z−1)2 (4. 8) 
 
The quantization noise of the first two stages needs to be completely cancelled, 
which means: 
 
NTF(E2 → Y2) NCF2 + NTF(E2 → Y3) NCF3 = 0 (4. 9) 
NTF(E1 → Y1) NCF1 + NTF(E1 → Y3) NCF3 = 0 (4. 10) 
 
Thus, the expression of the NCFs can be computed as: 
 
NCF2 = (1 − z−1)[(c23 +
1
2
𝑘21𝑏23) 𝑧
−2 + (−𝑐23 +
1
2
𝑘21𝑏23)𝑧
−3] (4. 11) 
𝑁𝐶𝐹1 = (1 − 𝑧−1)3{𝑘11𝑏12𝑏23𝐹𝐼𝑅
𝑧−2(1 + 4𝑧−1 + 𝑧−2)
6(1 − 𝑧−1)3
 
+[(𝑘11𝑏13 + 𝑑12𝑏23)𝐹𝐼𝑅 + 𝑐12𝑏23]
𝑧−2(1 + 𝑧−1)
2(1 − 𝑧−1)2
 
+(𝑑13𝐹𝐼𝑅 + 𝑐13)
𝑧−2
1 − 𝑧−1
} 
(4. 12) 
 
For this design, the coefficients c12, c13 and b13 are set to zero. Special care is needed 
on the inter-stage gain coefficients considering the output swing of the integrators. The 
other coefficients are computed to simplify the NCFs and provide first-order noise 
 74 
shaping with zeros placed at dc in each individual stage. The updated topology of the 
proposed MASH 1-1-1 CT-ΔΣM is shown in Fig. 4.4 and its coefficients are listed in 
Group I of Table 4.1. Fig. 4.5 shows the schematic of the NCFs of the proposed 
modulator, which is simple and easy to implement in digital domain. The FIR filter is as 
simple as a 4-tap delay averaging logic [53].  
 
Table 4.1: Coefficients for the proposed MASH 1-1-1 CT-ΔΣM 
                              
Coefficients a g k10 k11 k12 k21 k22 k31 k32 b12 b23 c23 d12 d13 
Group I 1 1 1 1 1 1 1 1 1 1/2 3/4 3/8 1/2 1/8 
Group II 1/2 2 1/2 1 1/2 1 1/2 1 1/2 1 3/2 3/8 1/2 0 
 
H(s)
NRZ 
DAC Ts
IN OUT
H(z)
IN OUT
 
Fig. 4.2. Equivalent circuit to compute equivalent discrete loop transfer function using 
impulse invariant transformation. 
 
 75 
1/sTsk10 1/sTsb12 1/sTsb23
b13
Ts Ts Ts
LG1 LG2
CLF12
LG3
CLF23
CLF13
FIR NCF1 NCF2 NCF3
Y1 Y2 Y3
Y
E1 E2 E3
Vin
 
Fig. 4.3. Equivalent model of the CT-ΔΣM shown in Fig 4.1 with NCFs. 
 
k10 a/sTs
k11
FIR
Ts
Z-1
A1
k12
a/sTs
k21
Ts
Z-1
A2
k22
b12
d12
a/sTs
k31
Ts
Z-1
A3
k32
b23c23
d13
E1
Y1
x g
E2
Y2
x g
E3
Y3
x g
Vin B1
B2
B3
 
Fig. 4.4. The proposed MASH 1-1-1 CT-ΔΣM. 
 76 
 
MASH 1-1-1
Y1
Vin Y2
Y3
FIR
NCF2
NCF3
NCF1
Y
FIR = (1 + z-1 + z-2 + z-3)/4
NCF1 = 3z-1/8
NCF2 = (3z-2/4) (1 – z-1)
NCF3 = (1 – z-1)2
 
Fig. 4.5. NCFs of the proposed MASH 1-1-1 CT-ΔΣM. 
 
A shortcoming of the ELD compensation scheme with fast feedback path coefficient 
k12, k22 and k23 equal unity is that the voltage swings at the output of the integrators at 
nodes A1, A2 and A3 are twice the full-scale of the quantizer. To address the signal 
swing issue, the 1.5-bit quantizer topology is employed instead of a single-bit giving the 
freedom to control the gain of the quantizer to two. This is obtained by reducing the full-
scale range Vref of the quantizer with respect to that of the modulator and by doubling 
the capacitor value in the active RC integrator [74]. In addition, the lowered bandwidth 
of the integrator relaxes the unity-gain frequency requirement of the OA. The updated 
coefficients of the proposed MASH 1-1-1 CT-ΔΣM after quantizer gain scaling are listed 
in Group II of Table 4.1. The digital feedforward path from the first stage to the third 
stage is eliminated (d13 = 0) in the circuit implementation to save power with negligible 
impact on the NTF of the modulator. 
Clocked at 3.0 GHz sampling frequency (Fs), the proposed MASH 1-1-1 CT-ΔΣM, 
employing a 1.5-bit FIR DAC, and 1.5-bit loop-unrolling quantizer, achieves a signal to 
quantization noise ratio (SQNR) of 78.7 dB in a BW of 50.5 MHz, with an input signal 
level of -2.5 dBFS. The simulated output spectrum of the proposed MASH 1-1-1 CT-
 77 
ΔΣM is shown in Fig. 4.6 (a). Without the proposed FIR compensation scheme in NCF1, 
the SQNR of the modulator reduces down to 60.7 dB as shown in Fig. 4.6 (b).  
 
10
0
10
1
10
2
10
3
-200
-150
-100
-50
0
Frequency (MHz)
d
B
F
S
 
 
SQNR=78.7dB, HD2=128.2dB, HD3=108.4dB
10
0
10
1
10
2
10
3
-200
-150
-100
-50
0
Frequency (MHz)
d
B
F
S
 
 
SQNR=60.7dB, HD2=89.0dB, HD3=76.0dB
(a) (b)
 
Fig. 4.6. Simulated output spectrum of the proposed MASH 1-1-1 CT-ΔΣM: (a) w/ FIR 
compensation logic in NCF1; (b) w/o FIR compensation logic in NCF1. 
 
4.2.3. Coefficient Variation 
Extensive simulations verified the impact of coefficient variations on the noise floor 
(NF) of the proposed modulator architecture. Fig. 4.7 shows the simulated quantization 
NF of the proposed MASH 1-1-1 CT-ΔΣM with RC time-constant variations. To ensure 
the NF of the modulator below − 70 dBFS, the global RC time-constant variations need 
to stay within − 6.8% and 4.6%. Therefore, RC time-constant calibration is mandatory 
because the expected variations of the absolute values of R and C can be as large as ± 
20%. In addition to the RC time-constant variation, the DAC current also affects the 
 78 
coefficients; thus, its variations must be minimized as well. The inter-stage coefficient 
variations are compensated by adjusting the capacitor value in the active RC integrator 
externally. 
 
-40 -30 -20 -10 0 10 20 30 40
-90
-80
-70
-60
-50
-40
RC Time-Constant Variations (%)
N
o
is
e 
F
lo
o
r 
(d
B
F
S
)
 
Fig. 4.7. Quantization NF as function of RC time-constant variations. 
 
4.2.4. Clock Jitter 
Clock jitter has a critical impact on the performance of CT-ΔΣMs [72]. The DAC k11 
in the first stage of the modulator is the most sensitive block, since in-band signal 
processing strongly depends on the quality of this loop, and its jitter noise is injected into 
the modulator equivalent to a noise source added directly to the modulator’s input. Due 
to the lack of an on-chip PLL, the prototype chip relies on the performance of an 
external clock source. The interface importing the external clock on chip may add more 
jitter and degrade the performance of the modulator if not managed properly. As shown 
 79 
in Fig. 4.8, without the FIR DAC, the jitter-induced modulator’s NF is limited to around 
− 63 dBFS when the external clock shows a white rms jitter of 1 ps (0.3% of the clock 
period Ts). Thus, the proposed FIR filter is mandatory to improve the modulator’s jitter 
sensitivity. Without the FIR filter, the original 1.5-bit DAC only provides three feedback 
current levels; however, with the FIR filter, the number of current levels increases to 
nine, as shown in Fig. 4.8. To achieve a − 70 dBFS NF for the proposed MASH 1-1-1 
CT-ΔΣM with FIR DAC, the clock rms jitter requirement is relaxed to 2.33 ps, which 
corresponds to 0.7% of the clock period. 
 
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1
-90
-80
-70
-60
-50
-40
Clock Jitter (%Ts)
N
o
is
e 
F
lo
o
r 
(d
B
F
S
)
Use 1.5bit w/o FIR (3 levels)
Use 1.5bit w/ FIR (9 levels)
w/o FIR
w/ FIR
 
Fig. 4.8. Clock jitter effect on the NF with and without FIR DAC. 
 
4.3. Circuit Implementation 
Fig. 4.9 shows the top-level circuit implementation of the proposed MASH 1-1-1 
CT-ΔΣM. The modulator consists of three active-RC integrators, five 1.5-bit current-
 80 
steering DACs and three 1.5-bit EELU quantizers.  The noise cancelling filters and the 
digital summer are off chip in MATLAB. The following sub-sections describe the key 
building blocks of the proposed modulator.  
 
FIR NCF1 NCF2 NCF3
Y1 Y2 Y3
Y
Off Chip
In MATLAB
C1
+  -
-  +
φQ 
FIR 
DAC
1 φDAC 
+  -
-  +
φQ 
FIR
DAC
12 φDAC 
DAC
2 φDAC 
+  -
-  +
φQ 
DAC
23 φDAC 
DAC
3 φDAC 
C1
R1
R1
C2
C2
R2
R2
C3
C3
R3
R3
OA1 OA2 OA3VIN
V1 V2 V3
 
Fig. 4.9. Top-level circuit implementation of the proposed MASH 1-1-1 CT-ΔΣM. 
 
4.3.1. Operational Amplifier (OA) 
For CT-MASH modulators, the OA gain must be high enough in the signal band to 
meet the distortion targets and to ensure sufficient coefficient accuracy. The circuit 
implementation for OA is shown in Fig. 4.10 (a). The four-stage OA is compensated 
using the no capacitor feedforward (NCFF) scheme [52], [62], [66], [67], [69], [73]. The 
fourth-order cascaded path consists of the transconductors Gm1-4, which provide a high 
gain at low frequencies under low power supply voltage. At high frequencies, the first-
order path through the transconductor Gm14 dominates the OA frequency response to 
guarantee closed-loop stability. The second- and third-order paths through the 
 81 
transconductors Gm12 and Gm13 provide a smooth transition for the OA frequency 
response at intermediate frequencies. The bandwidth of each stage is optimized to 
provide a high gain up to the modulator bandwidth of 50 MHz while maintaining an 
adequate phase margin. Additional NMOS capacitors are added to the first- and second- 
stage outputs to achieve the optimal frequency response and low noise performance. 
Their non-linearity requirements are not a big concern due to small signal swings at 
internal nodes. 
Gm1-3 and Gm12-13 in the OA share a similar schematic as shown in Fig. 4.10 (b). 
The input transistors M1 use small channel length transistors, and the cascode transistors 
M2 are added to improve the output resistance. More headroom is allocated for M3 to 
lower their noise contribution without adding cascode transistors. The single-stage 
amplifier is self-biased with R1 and C1 forming a common-mode feedback (CMFB). 
The transconductor Gm4 and Gm14 are formed by the transistors M3 and M2 as shown 
in Fig. 4.10 (c), respectively. A two-stage Miller compensated CMFB loop with a 
nulling resistor is used. The error amplifier (EA) is implemented using a folded-cascode 
topology with NMOS input as shown in Fig. 4.10 (d). 
Fig. 4.11 shows the post-layout simulated Bode plot of OA1. The loop gain is 
obtained by Cadence stb analysis when OA1 is used in closed loop as the first integrator, 
whereas OA gain is obtained when OA1 is used in open loop including the integrator 
feedback network as its loading. The unity gain frequency of the loop response is 1.6 
GHz, slightly higher than Fs/2, and its phase margin is around 51 deg. At 50 MHz BW 
boundary, a gain of 65 dB is provided by the OA1 to satisfy both the linearity and 
 82 
quantization noise leakage specifications. The requirement of gain at Fs/2 is relaxed for 
a lowered integrator bandwidth after the quantizer gain adjustment. 
 
(a)
Gm1 Gm2 Gm3 Gm4
Gm12 Gm13 Gm14
C14 R14
C1 C2 C3
Cl
Cl
VoutP
VoutN
VinP
VinN
EA
Vcm
V
cm
o Vcmi
Vcm
VinP_ac
VinN_ac
Vcmo
VinN_ac
VoutP
V3-
(b) (c) (d)
Vcmfb
VoutN
M3+ M3-
R1+ R1-
C1+ C1-
Vcmi
VinP_ac
M1
M2+ M2-
V3+
C2
R2
VinP VinN
V1- V1+
Vb
M1+ M1-
M2+ M2-
M3+ M3-
R1+ R1-
C1+ C1-
Ib2
Ib1 Ib1
Vcmi Vcm
Vb2
Vb3
Vb1
M2+
M3+
M5+
M4+
M2-
M3-
M5-
M4-
M1
V1 V2 V3
 
Fig. 4.10. Circuit implementation for OA: (a) NCFF topology; (b) transconductor Gm1; 
(c) transconductor Gm4 and Gm14; (d) error amplifier (EA) required for CMFB loop. 
 
 83 
10
6
10
7
10
8
10
9
10
10
-180
-160
-140
-120
-100
-80
-60
-40
-20
0
Frequency (Hz)
P
h
a
se
 (
d
eg
)
 
 
OA Phase
Loop Phase
CMFB Loop Phase
390 MHz, -119
o
1.6 GHz, -128.7
o
10
6
10
7
10
8
10
9
10
10
-20
0
20
40
60
80
100
Frequency (Hz)
G
a
in
 (
d
B
)
 
 
OA Gain
Loop Gain
CMFB Loop Gain
87 dB
50 MHz, 65 dB
1.6 GHz, 0 dB
390 MHz, 0 dB
(a)
(b)
 
Fig. 4.11. Post-layout simulated OA1’s Bode plot: (a) gain; (b) phase. 
 
4.3.2. Digitally Tunable Capacitor 
To compensate for RC time-constant variations, digitally tunable capacitors are 
employed in the integrators. Table 4.2 shows the digitally tunable capacitors for the loop 
filter capacitors. The capacitor in the first integrator is controlled independently, while 
 84 
the capacitors in the last two stages share the same digital control bus. The switch on-
resistance is low enough to minimize the NTF out-of-band peaking; on the other hand, 
the transistor dimensions are limited to maintain small enough parasitic capacitances. 
 
Table 4.2: Digitally tunable capacitors 
  
  
 Capacitor Capacitance (pF) 
C1 2.5 (1.8 to 4.0 tunable) 
C2 0.625 (0.4 to 1.0 tunable) 
C3 0.625 (0.4 to 1.0 tunable) 
 
4.3.3. Encoder-Embedded Loop-Unrolling (EELU) Quantizer 
Fig. 4.12 (a) shows the conventional analog implementation of the ELD 
compensation path, highlighted in the blue block. The analog feedback path is composed 
of a fast feedback DAC and a summing amplifier [71]. The nth-sampled analog voltage 
at the input of the quantizer B(n) is then computed as: 
 
𝐵(𝑛) = 𝐴(𝑛) − 𝐶(𝑛) 𝑅2; (4. 13) 
 
where A(n) is the analog voltage at the output of the integrator at the nth sampling 
instant and C(n) is the nth feedback current level of the fast DAC. The 3-level 1.5-bit 
quantizer then resolves B(n) by comparing it with ± Vref/3 at the rising edge of ΦQ and 
generates the digital output Y(n): 
 
 85 
Y(n) = {
′11′, 𝑖𝑓 𝐵(𝑛) > 𝑉𝑟𝑒𝑓/3;
′01′, 𝑖𝑓 − 𝑉𝑟𝑒𝑓/3 ≤ 𝐵(𝑛) ≤ 𝑉𝑟𝑒𝑓/3;
′00′, 𝑖𝑓 𝐵(𝑛) < −𝑉𝑟𝑒𝑓/3;
 (4. 14) 
 
Since the quantizer resolves 1.5 bits, the fast DAC has 3 different current levels: 
2Vref/(3R2), 0 and − 2Vref/(3R2). The value of DAC current C(n) depends on the 
previous quantizer output: 
 
C(n) = {
2𝑉𝑟𝑒𝑓/(3𝑅2), 𝑖𝑓 𝑌(𝑛 − 1) = ′11′;
0, 𝑖𝑓 𝑌(𝑛 − 1) = ′01′;
−2𝑉𝑟𝑒𝑓/(3𝑅2), 𝑖𝑓 𝑌(𝑛 − 1) = ′00′;
 (4. 15) 
 
To achieve effective ELD compensation, the operation of the entire feedback path 
needs to be completed within clock period Ts. However, it is power and area demanding 
to meet the high gain-bandwidth product (GBW) requirement of the OA in the summing 
amplifier. 
To address the issue, a digital domain loop-unrolling concept inspired on decision 
feedback equalization (DFE) in wireline communication systems [75] is employed. The 
proposed realization circuit multiplexes among the output of a group of sub-quantizers 
with pre-determined comparison references based on the previous comparison results 
[76], [77]. As shown in Fig. 4.12 (b), the 1.5-bit loop-unrolling quantizer is composed of 
three sub-quantizers, a digital MUX, and a unity delay element. The digital outputs of 
the sub-quantizers are computed based on the following rules: 
 
 86 
𝑌11(n) = {
′11′, 𝑖𝑓 𝐴(𝑛) >  𝑉𝑟𝑒𝑓;
′01′, 𝑖𝑓 𝑉𝑟𝑒𝑓/3 ≤ 𝐴(𝑛) ≤ 𝑉𝑟𝑒𝑓;
′00′, 𝑖𝑓 𝐴(𝑛) <  𝑉𝑟𝑒𝑓/3;
 (4. 16) 
𝑌01(n) = {
′11′, 𝑖𝑓 𝐴(𝑛) >  𝑉𝑟𝑒𝑓/3;
′01′, 𝑖𝑓 − 𝑉𝑟𝑒𝑓/3 ≤ 𝐴(𝑛) ≤ 𝑉𝑟𝑒𝑓/3;
′00′, 𝑖𝑓 𝐴(𝑛) <  −𝑉𝑟𝑒𝑓/3;
 (4. 17) 
𝑌00(n) = {
′11′, 𝑖𝑓 𝐴(𝑛) >  −𝑉𝑟𝑒𝑓/3;
′01′, 𝑖𝑓 − 𝑉𝑟𝑒𝑓 ≤ 𝐴(𝑛) ≤ −𝑉𝑟𝑒𝑓/3;
′00′, 𝑖𝑓 𝐴(𝑛) <  −𝑉𝑟𝑒𝑓;
 (4. 18) 
 
These rules show that the references of the sub-quantizer with digital output Y11(n) 
are shifted up by 2Vref/3 compared with the conventional values ± Vref/3. Similarly, the 
comparison reference of the sub-quantizer with digital outputs, Y01(n) and Y00(n), are 
shifted by 0 and − 2Vref/3, respectively. The final digital output of the loop-unrolling 
quantizer with embedded digital ELD compensation path is achieved by multiplexing 
among the digital outputs of the sub-quantizers. All three potential quantizations are 
realized and the correct one is selected by the bits associated with the previous output 
result. If the previous comparison result is ‘11’, ‘01’ or ‘00’, the final digital output is 
chosen to be equal to Y11, Y01, or Y00, respectively, which means: 
 
𝑌(n) = {
𝑌11(n), 𝑖𝑓 𝑌(n − 1) = ′11′;
𝑌01(n), 𝑖𝑓 𝑌(𝑛 − 1) = ′01′;
𝑌00(n), 𝑖𝑓 𝑌(n − 1) = ′00′;
 (4. 19) 
 
The operation flow of a 1.5-bit loop-unrolling quantizer is detailed in Fig. 4.12 (c). 
 87 
During the succeeding clock cycles, the integrator output voltage A is assumed to be 
constant for cycle N and N − 1. The two succeeding cycles N − 1 and N operate as 
follows: 1) During cycle N − 2, assuming the previous comparison result is ‘01’, the 
reference levels for 1.5-bit quantization are ± Vref/3. 2) If A(n − 1) is higher than 
Vref/3, the output during cycle N − 1 should be ‘11’. 3) Thus, the fast DAC will 
feedback 2Vref/3, and in cycle N, the comparison should be between A(n) − 2Vref/3 
and the default reference levels ± Vref/3. This is equivalent to compare A(n) with shifted 
up references Vref/3 and Vref. 4) If the input voltage A(n) is higher than Vref/3, but 
lower than Vref, the output during cycle N should be ‘01’. The 1.5-bit loop-unrolling 
quantizer achieves the same functionality as the conventional analog implementation 
counterpart. 
 
1.5-bit Quantizer
(2 Comparators)
(b)
C1
+  -
-  +
φQ 
FIR 
DAC
1 φDAC 
C1
R1
R1
Y(n)
VIN
Z-1
MUX
Y11 Y01 Y00
Z-1
A(n)
Y(n-1)
(c)
Vref
Vref/3
-Vref/3
-Vref
2Vref/3
-2Vref/3
0
11
01
00
Quantizer
Reference
Quantizer
Code
Fast
DAC
Cycle N-1
A(n-1) Vref
Vref/3
-Vref/3
-Vref
2Vref/3
-2Vref/3
0
11
01
00
Quantizer
Reference
Quantizer
Code
Fast
DAC
Cycle N
A(n)
(a)
VIN
C1
+  -
-  +
φQ 
FIR 
DAC
1 φDAC 
C1
R1
R1
Y(n)
FAST
DAC
φDAC 
+  -
-  +
R2
R2
R2
R2
A(n) B(n)
Z-1
C(n)
Y(n-1)
LOOP UNROLLING QUANTIZER
+ DIGITAL ELD COMPENSATION
Sub-Quantizers
 
Fig. 4.12. ELD compensation path: (a) with conventional analog implementation; (b) 
with loop-unrolling digital implementation; (c) operation flow example. 
 88 
To provide the 3-bit digital control signal for the 1.5-bit DAC, a simple digital 
encoder built with NAND gates is required in the feedback path. The encoder can be 
arranged after the MUX or before the MUX, which results in two different topologies as 
shown in Fig. 4.13 (a) MUX-ENCODER topology and (b) ENCODER-MUX, 
respectively. The local cycling loop of the ENCODER-MUX topology operates faster, as 
the encoder block is no longer a part of the loop, as in the MUX-ENCODER topology. 
Thus, in the ENCODER-MUX topology, the final quantizer output data P gets ready 
earlier and holds valid for a larger percentage of the clock cycle for the DAC DFFs to 
capture, compared with the MUX-ENCODER topology. It is then beneficial to allocate 
the encoder block before the MUX for its higher speed operation potential. The complete 
top-level schematic of the proposed 1.5-bit EELU quantizer based on the ENCODER-
MUX topology is shown in Fig. 4.14. On the left part of the figure, there are three sets of 
reference-level-shifted sub-quantizers, each of which consists of two StrongARM 
comparators. However, only four comparators in total are realized, as the other two are 
redundant. The tri-stage gates encompass the encoder as part of the MUX to reduce the 
delay in the signal path. The inverter-chain buffers ensure driving capability of the long-
distance routing interconnections between the EELU quantizers and DACs.  
 
 89 
(a)
(b)
M
U
X
DFFs (z-1) 
SR
Latch
P
E
D B
A DFFs ENCODER
φQ 
φQd 
φDAC 
DACs
SR
Latch
P
E
D B
DFFs (z-1) 
ENCODER
M
U
X
DFFs 
φDAC 
DACs
A
φQ 
φQd 
 
Fig. 4.13. Loop-unrolling quantizer employing: (a) MUX-ENCODER topology; (b) 
ENCODER-MUX topology. 
 
 90 
SR
Latch
SR
Latch
SR
Latch
SR
Latch
SR
Latch
SR
Latch
VIN
-Vref/3
-Vref
Vref/3
-Vref/3
Vref
Vref/3
Vref
-Vref
Q
QB
Q
QB
Q
QB
Q
QB
Q
QB
Q
QB
φQ 
φQ 
φQ 
φQ 
φQ 
φQ 
-2Vef/3
-Vref/3
Vref/3
2Vref/3
0
SUB-QUANTIZERS
DFFs (z-1) 
P<0>
  
  
P<2>
P<1>
EN<0:2> DP<0:2>
00
01
11
N
A
N
D
1
0
1
0
1
0
00
01
11
N
A
N
D
1
0
1
0
1
0
11
01
00
N
A
N
D
1
0
1
0
1
0
ENCODER-MUX
φQd 
 
Fig. 4.14. 1.5-bit encoder-embedded loop-unrolling (EELU) quantizer topology. 
 
4.3.4. FIR DAC 
The detailed implementation of the 1.5-bit FIR current-steering DAC is shown in 
Fig. 4.15. The total DAC current IDAC is segmented into four equal sections with one 
clock cycle delayed digital control between each [53]. Analog drivers (NDR and PDR) 
placed after the DFFs adjust the on/off voltage limits and the crossing voltage levels to 
improve the switching performance of the DAC segments. A complementary N-P DAC 
topology under low voltage supply of 1.2 V is employed for its good power efficiency 
 91 
compared with N or P alone DAC topologies [56], [67]–[69]. Different from a multi-bit 
DAC that demands stringent matching between segments, each segment of the FIR DAC 
is inherently linear for its single current source topology. On the other hand, a 1% 
current matching among the DAC segments is desirable for the FIR coefficient accuracy 
and quantization noise cancellation, based on a 500-run Monte-Carlo simulation as 
shown in Fig. 4.16. The DAC current IDAC/4 is mirrored from a biasing current generated 
by a reference voltage over resistor. External 10 μF ceramic capacitors are used to 
decouple the common-mode noise from the biasing circuitry. The biasing resistor is 
placed close to the integrator resistors in layout to improve matching [66]. A 90-dB dc 
gain telescope amplifier is employed to achieve decent current resistance product 
accuracy. The 1.5-bit DAC current is routed towards VCM when its digital control code 
is ‘01’. Thus, the noise performance of the 1.5-bit DAC is better compared with the 
conventional 1-bit design in which the DAC noise is always injected into the modulator. 
 
DFF
IDAC/4
PDR
Pd0<0:2>
DFF
IDAC/4
PDR
Pd1<0:2>
DFF
IDAC/4
PDR
Pd2<0:2>
DFF
IDAC/4
PDR
Pd3<0:2>
DFF
IDAC/4
NDR
Nd0<0:2>
DFF
IDAC/4
NDR
Nd1<0:2>
DFF
IDAC/4
NDR
Nd2<0:2>
DFF
IDAC/4
NDR
Nd3<0:2>
IN
IP
VCM
N<2>
P<2>
BPT
BPC
BNC
BNT
N<1> N<0>
P<1> P<0>
VCM
IN
IP
IDAC/4
IDAC/4
 
Fig. 4.15. 1.5-bit FIR DAC implementation. 
 92 
 
74 75 76 77 78 79 80
0
50
100
150
200
250
300
SQNR (dB)
N
u
m
b
er
 o
f 
H
it
s
SIGMA = 0.82 dB, MEAN = 77.88 dB
 
Fig. 4.16. 500-run Monte-Carlo simulated distribution of SQNR with 1% current 
mismatch among segments of FIR DAC. 
 
4.4. Measurement Results 
The prototype MASH 1-1-1 CT-ΔΣM was fabricated in a 40-nm low-power CMOS 
process; Fig. 4.17 shows the microphotograph of the chip. The total area of the prototype 
modulator is 0.177 mm
2 
in which the modulator core occupies 0.127 mm
2
. The 
modulator is clocked at 3 GHz generated using an external clock source from Agilent 
N5171B and on-chip clock buffers. The 3 GHz digital data of the modulator are sent out 
of chip using pulse amplitude modulation (PAM-4) and captured using high-speed 
oscilloscope Agilent DSA91304A.  
 93 
 
700 um
285 um
 
Fig. 4.17. Microphotograph of the prototype chip. 
 
Fig. 4.18 shows the measured fast Fourier transform (FFT) spectrum of the prototype 
MASH 1-1-1 CT-ΔΣM output for a 10 MHz single-tone sinusoidal input signal provided 
by Agilent E8267D and filtered by a KR Electronics 2796-SMA bandpass filter. The 
measured peak signal-to-noise and distortion ratio (SNDR) and peak signal-to-noise 
ratio (SNR) are 67.3 dB and 68 dB, for an input amplitude of − 2.8 dBFS and − 1.6 
dBFS, respectively. Under peak SNDR condition, the second and third order harmonic 
distortion components HD2 and HD3 are at − 80 dB and − 80.1 dB levels, respectively. 
Since the bandwidth of the modulator is set at 50.5 MHz, the SNDR integration includes 
up to a fifth-order distortion harmonic component. Fig. 4.19 shows the measured SNR 
 94 
and SNDR vs single-tone sinusoidal input signal amplitude at a frequency of 10 MHz. 
The measured dynamic range of the modulator is 68.2 dB.  
 
10
0
10
1
10
2
10
3
-100
-80
-60
-40
-20
0
Frequency (MHz)
d
B
F
S
 
 
-2.8 dBFS
8
0
.0
 d
B
8
0
.1
 d
B
 
Fig. 4.18. Measured single-tone FFT spectrum. 
 
 95 
-70 -60 -50 -40 -30 -20 -10 0
0
10
20
30
40
50
60
70
Input Signal (dBFS)
M
a
g
a
n
it
u
d
e 
(d
B
)
 
 
SNDR
SNR
-5 -4 -3 -2 -1 0
63
64
65
66
67
68
69
Dynamic Range (DR) = 68.2 dB
 
Fig. 4.19. Measured SNR and SNDR versus the modulator’s input signal level. 
 
Fig. 4.20 shows the measured FFT spectrum for two-tone sinusoidal input signals at 
frequencies of 38 MHz and 42 MHz; the amplitude of each tone corresponds to − 8.8 
dBFS. The two-tone input signals are combined by a power combiner and filtered by the 
KR Electronics 2510-SMA bandpass filter. The measured third-order intermodulation is 
78.8/78.4 dB. The harmonic distortion components are very close to the noise level of 
the setup, demonstrating the outstanding linearity of the architecture. 
 
 96 
30 34 38 42 46 50
-100
-80
-60
-40
-20
0
Frequency (MHz)
d
B
F
S
 
 
7
8
.8
 d
B
-8.8 dBFS
7
8
.4
 d
B
 
Fig. 4.20. Measured two-tone FFT spectrum. 
 
Fig. 4.21 shows the measured noise floor versus the RC time-constant control code 
of the first stage of the prototype modulator. The control code adjusts the digitally 
tunable capacitor in the active RC integrator. The digital code for the modulator’s first 
stage was swept over its entire range while keeping the last two stages at their nominal 
value.  
 
 97 
0 20 40 60 80 100 120
-80
-70
-60
-50
-40
-30
-20
-10
0
Stage #1 RC Time-Constant Control Code
N
o
is
e 
F
lo
o
r 
(d
B
F
S
)
 
 
Noise Floor
Noise Floor Stage #1
 
Fig. 4.21. Measured NF versus the first stage RC time-constant control code. 
 
Fig. 4.22 illustrates the measured power consumption of the prototype modulator. 
The total power of the modulator is 18.98 mW. The modulator’s core composed of the 
integrator OAs, the FIR DACs and the EELU quantizer is powered with a 1.2 V voltage 
supply. Only the biasing circuitry uses a 2.5 V supply to provide a biasing current with 
good accuracy. 
 
 98 
OPAMPs
9.02 mW (47.5%)
QUANTIZERs + DIGITAL
4.93 mW (26.0%)
DACs
3.95 mW (20.8%)
BIAS
1.08 mW (5.7%)
 
Fig. 4.22. Measured power consumption breakdown. 
 
4.5. Conclusion 
This work demonstrates a MASH 1-1-1 CT-ΔΣM employing FIR DACs and EELU 
quantizers. An FIR filter in an inherently linear 1.5-bit DAC improves the modulator’s 
jitter sensitivity performance. The MASH topology compensates the FIR’s effect on the 
modulator’s NTF. A 1.5-bit EELU quantizer topology based on multiplexing comparator 
outputs achieves high-speed encoding and ELD compensation, eliminating the need of a 
fast analog summing block. The modulator’s prototype fabricated in a 40-nm CMOS 
technology, achieves 67.3 dB of SNDR, 68 dB of SNR, and 68.2 dB of DR in 50.5 MHz 
BW with a total power consumption of 19 mW. The proposed modulator features a 
161.5 dB of figure-of-merit (FOM), defined as FOM = SNDR + 10 log10 (BW/P). Table 
4.3 summarizes the performance of the modulator and shows that this design is 
competitive compared to state-of-the-art CT-ΔΣMs.  
 99 
Table 4.3: Performance summary and comparison with  state-of-the-art CT-ΔΣMs 
  
    
  
This 
Work 
[69] [68] [67] [66] [61] [60] [57] [56] [51] 
Technology 
(nm) 
40 28 28 28 40 16 65 65 20 90 
MASH Yes Yes Yes Yes Yes No No No No No 
Fs (GHz) 3 6 1.8 3.2 1 2.88 1.2 1.28 2.184 3.6 
BW (MHz) 50.5 350 50 53.3 50.3 160 50 50 80 36 
Peak SNR (dB) 68 66.8 76.8 83.1 75.8 68.1 71.7 71 70 76.4 
Peak SNDR 
(dB) 
67.3 64.8 74.9 71.4 74.4 65.3 71.5 64 67.5 70.9 
DR (dB) 68.2 72.8 85 88 76.8 72.1 72 75 73 76.4 
Power Supply 
(V) 
1.2 -1 1.3 -1 1.1 0.8 N/A 1.2 1 1.2 
  
2.5 1 1.5 0.9 1.15 1.4   1.5 1.2   
  1.8   1.8 2.5 1.5     1.5   
Area (mm2) 0.177 1.4 0.337 0.9 0.265 0.155 0.5 0.49 0.1 0.12 
Power (P) (mW) 18.98 756 80.4 235 43 40 54 38 23 15 
FOMa(dB) 161.5 151.5 162.8 154.9 165.1 161.3 161.2 155.2 162.9 164.7 
FOMS
b (dB) 162.4 159.5 172.9 171.5 167.5 168.1 161.7 166.2 168.4 176.8 
FOMW
c (fJ/c-s) 99.2 761.1 177.1 730.8 99.8 82.8 176 293.6 74.2 72.7 
  
  
       a FOM=SNDR+10×log10 (BW/P) 
         b FOMS=DR+10×log10 (BW/P) 
         c FOMW=P/(2×BW×2^((SNDR-1.76)/6.02)) 
    
 
 
 
 100 
V. CONCLUSION 
 
In this dissertation, two prototype chips with techniques targeting lower power ADC 
design in advanced CMOS technology nodes are presented. 
A 66.8µW-per-column image sensor digitizer based on SS-ADC topology was 
designed and implemented in a 40 nm low-power CMOS process. The proposed PW 
technique enables a power saving of 52.8% in the hybrid digital counter, while 
maintaining the capability of completing digital CDS in column. A dark/bright column 
FPN of 0.0024%/0.028% from the readout circuit is achieved with the proposed double 
AZ technique without any off-chip processing. The proposed design achieves low 
readout noise and a wide dynamic range of 71.8 dB without any column gain stages. The 
proposed techniques are suitable for readout of back-illuminated 3D-stacked CMOS 
image sensor pixel array and exportable to other architectures. 
A MASH 1-1-1 CT-ΔΣM topology employing FIR DACs and EELU quantizers is 
proposed for wide-band mobile/wireless communication circuits. An FIR filter in an 
inherently linear 1.5-bit DAC improves the modulator’s jitter sensitivity performance. 
The MASH topology compensates the FIR’s effect on the modulator’s NTF. A 1.5-bit 
EELU quantizer topology based on multiplexing comparator outputs achieves high-
speed encoding and ELD compensation, eliminating the need of a fast analog summing 
block. The modulator’s prototype fabricated in a 40-nm CMOS technology, achieves 
67.3 dB of SNDR, 68 dB of SNR, and 68.2 dB of DR in 50.5 MHz BW with a total 
power consumption of 19 mW. 
 101 
REFERENCES 
 
[1] S. Naraghi, M. Courcy, and M. P. Flynn, “A 9-bit 14 µW and 0.06 mm2 Pulse 
Position Modulation ADC in 90 nm Digital CMOS”,  IEEE J. Solid-State Circuits 
(JSSC), vol. 45, no. 9, pp. 1870-1880, Sep 2010.  
[2] M. F. Snoeji, A. J. P. Theuwissen, K. A. A. Makinwa, and J. H. Huijsing, “Multiple-
Ramp Column-Parallel ADC Architectures for CMOS Image Sensors”, IEEE J. 
Solid-State Circuits (JSSC), vol. 42, no. 12, pp. 2968-2977, Dec 2007. 
[3] S.-W. M. Chen, and R. W. Broderson, “A 6-bit 600-MS/s 5.3-mW Asynchronous 
ADC in 0.13-µm CMOS”, IEEE J. Solid-State Circuits (JSSC), vol. 41, no. 12, pp. 
2669-2680, Dec 2006. 
[4] C.-C. Liu, S.-J. Chang, G.-Y. Huang, and Y.-Z. Lin, “A 10-bit 50-MS/s SAR ADC 
With a Monotonic Capacitor Switching Procedure”, IEEE J. Solid-State Circuits 
(JSSC), vol. 45, no. 4, pp. 731-740, Apr 2010. 
[5] C.-Y. Lin, Y.-H. Wei, and T.-C. Lee, “A 10b 2.6GS/s Time-Interleaved SAR ADC 
with Background Timing-Skew Calibration”, in IEEE Int. Solid-State Circuits Conf. 
(ISSCC) Dig. Tech. Papers, 2016, pp. 468-469. 
[6] T. B. Cho, and P. R. Gray, “A 10 b, 20 Msample/s, 35 mW Pipeline A/D Converter”, 
IEEE J. Solid-State Circuits (JSSC), vol. 30, no. 3, pp. 166-172, Mar 1995.  
[7] B. Murmann, and B. E. Boser, “A 12-bit 75-Msample/s ADC Using Open-Loop 
Residue Amplification”, IEEE J. Solid-State Circuits (JSSC), vol. 38, no. 12, pp. 
2040-2050, Dec 2003. 
 102 
[8] J. Wu, A. Chou, C.-H. Yang, Y. Ding, Y.-J. Ko, S.-T. Lin, W. Liu, C.-M. Hsiao, M.-
H. Hsieh, C.-C. Huang, J.-J. Hung, K. Y. Kim, M. Le, T. Li, W.-T. Shih, A. 
Shrivastava, Y.-C. Yang, C.-Y. Chen, and H.-S. Huang, “A 5.4GS/s 12b 500mW 
Pipeline ADC in 28nm CMOS”, in Proc. IEEE VLSI Circuits (VLSIC) Symp., 2014, 
pp. 1-2. 
[9] B. E. Boser, and B. A. Wooley, “The Design of Sigma-Delta Modulation Analog-to-
Digital Converters”, IEEE J. Solid-State Circuits (JSSC), vol. 23, no. 6, pp. 1298-
1308, Dec 1988. 
[10] M. Ortmanns and F. Gerfers, Continuous-Time Sigma-Delta A/D Convertion: 
Fundamentals, Performance Limits and Robust Implementations, Springer-Verlag 
Berlin Heidelberg, 2006. 
[11] Y. Lim, and M. P. Flynn, “A 1 mW 71.5 dB SNDR 50 MS/s 13 bit Fully 
Differential Ring Amplifier Based SAR-Assisted Pipeline ADC”, IEEE J. Solid-
State Circuits (JSSC), vol. 50, no. 12, pp. 2901-2911, Dec 2015. 
[12] J. A. Fredenburg, and M. P. Flynn, “A 90-MS/s 11-MHz-Bandwidth 62-dB SNDR 
Noise-Shaping SAR ADC”, IEEE J. Solid-State Circuits (JSSC), vol. 47, no. 12, pp. 
2898-2904, Dec 2012. 
[13] Y.-S. Shu, L.-T. Kuo, and T.-Y. Lo, “An Oversampling SAR ADC With DAC 
Mismatch Error Shaping Achieving 105 dB SFDR and 101 dB SNDR Over 1 kHz 
BW in 55 nm CMOS”, IEEE J. Solid-State Circuits (JSSC), vol. 51, no. 12, pp. 
2928-2940, Dec 2016. 
[14] H.-C. Tsai, C.-L. Lo, C.-Y. Ho, and Y.-H. Lin, “A 64fJ/Conv.-Step Continuous-
 103 
Time ΣΔ Modulator in 40-nm CMOS Using Asynchronous SAR Quantizer and 
Digital ΣΔ Truncator”, IEEE J. Solid-State Circuits (JSSC), vol. 48, no. 11, pp. 
2637-2648, Nov 2013. 
[15] B. Wu, S. Zhu, B. Xu, and Y. Chiu, “A 24.7mW 45MHz-BW 75.3dB-SNDR SAR-
Assisted CT ΔΣ Modulator with 2nd-Order Noise Coupling in 65nm CMOS”, in 
IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, 2016, pp. 270-271. 
[16] O. Rajaee, T. Musah, N. Maghari, S. Takeuchi, M. Aniya, and U.-K. Moon, “A 
24.7mW 45MHz-BW 75.3dB-SNDR SAR-Assisted CT ΔΣ Modulator with 2nd-
Order Noise Coupling in 65nm CMOS”, IEEE J. Solid-State Circuits (JSSC), vol. 
45, no. 4, pp. 719-730, Apr 2010. 
[17] A. Gharbiya, and D. A. Johns, “A 12-bit 3.125MHz Bandwidth 0-3 MASH Delta-
Sigma Modulator”, IEEE J. Solid-State Circuits (JSSC), vol. 44, no. 7, pp. 2010-
2018, Jul 2009. 
[18] S.-C. Lee, and Y. Chiu, “A 15-MHz Bandwidth 1-0 MASH ΣΔ ADC With 
Nonlinear Memory Error Calibration Achieving 85-dBc SFDR”, IEEE J. Solid-State 
Circuits (JSSC), vol. 49, no. 3, pp. 695-707, Mar 2014. 
[19] S. Sukegawa, T. Umebayashi, T. Nakajima, H. Kawanobe, K. Koseki, I. Hirota, T. 
Haruta, M. Kasai, K. Fukumoto, T. Wakano, K. Inoue, T. Hirayama, and N. 
Fukushima, “ A1/4-inch 8Mpixel Back-Illuminated Stacked CMOS Image Sensor”, 
in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, 2013, pp. 484-
485.  
[20] M. Ikebe, D. Uchida, Y. Take, M. Someya, S. Chikuda, K. Matsuyama, T. Asai, T. 
 104 
Kuroda, and M. Motomura, “Image Sensor/Digital Logic 3D Stacked Module 
featuring Inductive Coupling Channels for High-Speed/Low-Noise Image Transfer”, 
in Proc. IEEE VLSI Circuits (VLSIC) Symp., 2015, pp. C82-C83. 
[21] A. Suzuki, N. Shimamura, T. Kainuma, N. Kawazu, C. Okada, T. Oka, K. Koiso, T. 
Ichikawa, M. Mizuno, T. Sugioka, T. Morikawa, Y. Inada, H. Wakabayashi, “A 
1/1.7-inch 20Mpixel Back-Illuminated Stacked CMOS Image Sensor for New 
Imaging Applications”, in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. 
Papers, 2015, pp. 110-111. 
[22] S.-F. Yeh, C.-C. Hsieh, and K.-Y. Yeh, “A 3 Megapixel 100 Fps 2.8 µm Pixel Pitch 
CMOS Image Sensor Layer with Built-in Self-Test for 3D Integrated Imagers”, 
IEEE J. Solid-State Circuits (JSSC), vol. 48, no. 3, pp. 839-849, Mar. 2013. 
[23] S-F. Yeh, K.-Y. Chou, H.-Y. Tu, C. Y.-P. Chao, and F.-L. Hsueh, “A 0.66e-rms 
Temporal-Readout-Noise 3D-Stacked CMOS Image Sensor with Conditional 
Correlated Multiple Sampling (CCMS) Technique, in Proc. IEEE VLSI Circuits 
(VLSIC) Symp., 2015, pp. C84-C85. 
[24] T. Kondo, Y. Takemoto, K. Kobayashi, M. Tsukimura, N. Takazawa, H. Kato, S. 
Suzuki, J. Aoki, H. Saito, Y. Gomi, S. Matsuda, and Y. Tadaki, “A 3D stacked 
CMOS image sensor with 16Mpixel global-shutter mode and 2Mpixel 10000fps 
mode using 4 million interconnections”, in Proc. IEEE VLSI Circuits (VLSIC) Symp., 
2015, pp. C90-C91. 
[25] V. Suntharalingam, R. Berger, S. Clark, J. Knecht, A. Messier, K. Newcomb, D. 
Rathman, R. Slattery, A. Soares, C. Stevenson, K. Warner, D. Young, L. Ag, B. 
 105 
Mansoorian, D. Shaver, “A 4-Side Tileable Back Illuminated 3D-Integrated Mpixel 
CMOS Image Sensor”, in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. 
Papers, 2009, pp. 38-39. 
[26] J. Ahn, K. Lee, Y. Kim, H. Jeong, B. Kim, H. Kim, J. Park, T. Jung, W. Park, T. 
Lee, E. Park, S. Choi, Y. Kim, Y. J. Jung, D. Park, Se. Nah, Y. Oh, M. Kim, Y. Lee, 
Y. Chung, I. Hisanori, J. Im, D.-J. Lee, B. Yim, Gi. Lee, H. Kown, S. Choi, J. Lee, 
D. Jang, Y. Kim, T. C. Kim, G. Hiroshige, C.-Y. Choi, D. Lee, and G. Han, “A 1/4-
inch 8Mpixel CMOS Image Sensor with 3D Backside-Illuminated 1.12µm Pixel 
with Front-Side Deep-Trench Isolation and Vertical Transfer Gate”, in IEEE Int. 
Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, 2014, pp. 124-125. 
[27] J. Aoki, Y. Takemoto, K. Kobayashi, N. Sakaguchi, M. Tsukimura, N. Takazawa, 
H. Kato, T. Kondo, H. Saito, Y. Gomi, and Y. Tadaki, “A Rolling-Shutter 
Distortion-Free 3D Stacked Image Sensor with -160dB Parasitic Light Sensitivity In-
Pixel Storage Node”, in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. 
Papers, 2013, pp. 482-483. 
[28] K. Shiraishi, Y. Shinozuka, T. Yamashita, K. Sugiura, N. Watanabe, R. Okamoto, 
T. Ashitani, M. Furuta, and T. Itakura, “A 1.2e-rms Temporal Noise 3D-Stacked 
CMOS Image Sensor with Comparator-Based Multiple-Sampling PGA”, in IEEE 
Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, 2016, pp. 122-123. 
[29] C. C.-M. Liu, M. M. Mhala, C.-H. Chang, H. Tu, P.-S. Chou, C. Chao, F.-L. Hsueh, 
“ A 1.5V 33Mpixel 3D-Stacked CMOS Image Sensor with Negative Substrate Bias” 
, in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, 2016, pp. 124-
 106 
125.  
[30] T. Arai, T. Yasue, K. Kitamura, H. Shimamoto, T. Kosugi, S. Jun, S. Aoyama, M.-
C. Hsu, Y. Yamashita, H. Sumi, and S. Kawahito, “A 1.1µm 33Mpixel 240fps 3D-
Stacked CMOS Image Sensor with 3-Stage Cyclic-Based Analog-to-Digital 
Converters”, in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, 
2016, pp. 126-127. 
[31] J.-H. Park, S. Aoyama, T. Watanabe, T. Akahori, T. Kosugi, K. Isobe, Y. Kaneko, 
Z. Liu, K. Muramatsu, T. Matsuyama, and S. Kawahito, “A 0.1e- Vertical FPN 4.7e- 
Read Noise 71dB DR CMOS Image Sensor with 13b Column-Parallel Single-Ended 
Cyclic ADCs”, in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, 
2009, pp. 268-269. 
[32] T. Watabe, K. Kitamura, T. Sawamoto, T. Kosugi, T. Akahori, T. Iida, K. Isobe, T. 
Watanabe, H. Shimamoto, H. Ohtake, S. Aoyama, S. Kawahito, and N. Egami, “A 
33Mpixel 120fps CMOS Image Sensor Using 12b Column-Parallel Pipelined Cyclic 
ADCs”, in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, 2012, pp. 
388-389. 
[33] Y. Chae, J. Cheon, S. Lim, D. Lee, M. Kwon, K. Yoo, W. Jung, D.-H. Lee, S. Ham, 
and G. Han, “A 2.1Mpixel 120frame/s CMOS Image Sensor with Column-Parallel 
ΔΣ ADC Architecture”, in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. 
Papers, 2010, pp. 394-395. 
[34] Y. Oike, and A. E. Gamal, “A 256x256 CMOS Image Sensor with ΔΣ-Based 
Single-Shot Compressed Sensing”, in IEEE Int. Solid-State Circuits Conf. (ISSCC) 
 107 
Dig. Tech. Papers, 2012, pp. 386-387. 
[35] S. Matsuo, T. Bales, M. Shoda, S. Osawa, B. Almond, Y. Mo, J. Gleason, T. Chow, 
and J. Takayanagi, “A Very Low-Column FPN and Row Temporal Noise 8.9 M-
Pixel, 60 fps CMOS Image Sensor with 14bit Column Parallel SA-ADC”, in Proc. 
IEEE VLSI Circuits (VLSIC) Symp., 2008, pp. 138-139. 
[36] J. Deguchi, F. Tachibana, M. Morimoto, M. Chiba, T. Miyaba, H. Tanaka, K. 
Takenaka, S. Funayama, K. Amano, K. Sugiura, R. Okamoto, and S. Kousai, “A 
187.5µVrms-Read-Noise 51mW 1.4Mpixel CMOS Image Sensor with PMOSCAP 
Column CDS and 10b Self-Differential Offset-Cancelled Pipeline SAR-ADC”, in 
IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, 2013, pp. 494-495. 
[37] R. Funatsu, S. Huang, T. Yamashita, K. Stevulak, J. Rysinski, D. Estrada, S. Yan, 
T. Soeno, T. Nakamura, T. Hayashida, H. Shimamoto, and B. Mansoorian, 
“133Mpixel 60fps CMOS Image Sensor with 32-Column Shared High-Speed 
Column-Parallel SAR ADCs”, in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. 
Tech. Papers, 2015, pp. 112-113. 
[38] Y. Nitta, Y. Muramatsu, K. Amano, T. Toyama, J. Yamamoto, K. Mishina, A. 
Suzuki, T. Taura, A. Kato, M. Kikuchi, Y. Yasui, H. Nomura, and N. Fukushima, “ 
High-Speed Digital Double Sampling with Analog CDS on Column Parallel ADC 
Architecture for Low-Noise Active Pixel Sensor”, in IEEE Int. Solid-State Circuits 
Conf. (ISSCC) Dig. Tech. Papers, 2006, pp. 2024-2031. 
[39] T. Toyama, K. Mishina, H. Tsuchiya, T. Ichikawa, H. Iwaki, Y. Gendai, H. 
Murakami, K. Takamiya, H. Shiroshita, Y. Muramatsu, and T. Furusawa, “ A 
 108 
17.7Mpixel 120fps CMOS Image Sensor with 34.8Gb/s Readout”, in IEEE Int. 
Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, 2011, pp. 420-421. 
[40] C. C.-M. Liu, C.-H. Chang, H.-Y. Tu, C. Y.-P. Chao, F.-L. Hsueh, S.-Y. Chen, V. 
Hsu, J.-C. Liu, D.-N. Yaung, and S.-G. Wuu, “A peripheral switchable 3D stacked 
CMOS image sensor”, in Proc. IEEE VLSI Circuits (VLSIC) Symp., 2014, pp. 1-2. 
[41] C. Baek, C. Lim, D. Kim and M. Song, “Design of a 10-bit CMOS Image Sensor 
Based on an 8-bit Configurable Hold-and-Go Counter”, in Proc. IEEE Eur. Solid-
State Circuits Conf. (ESSCIRC), 2012, pp. 393-397. 
[42] S. Yoshihara, Y. Nitta, M. Kikuchi, K. Koseki, Y. Ito, Y. Inada, S. Kuramochi, H. 
Wakabayashi, M. Okano, H. Kuriyama, J. Inutsuka, A. Tajima, T. Nakajima, Y. 
Kudoh, F. Koga, Y. Kasagi, S. Watanabe, and T. Nomoto, “A 1/1.8-inch 6.4 MPixel 
60 frames/s CMOS Image Sensor With Seamless Mode Change”, IEEE J. Solid-
State Circuits (JSSC), vol. 41, no. 12, pp. 2998-3006, Dec. 2006. 
[43] S. Okura, O. Nishikido, Y. Sadanaga, Y. Kosaka, N. Araki, K. Ueda, and F. 
Morishita, “A 3.7M-Pixel 1300-fps CMOS Image Sensor With 5.0 G-Pixel/s High-
Speed Readout Circuit”, IEEE J. Solid-State Circuits (JSSC), vol. 50, no. 4, pp. 
1016-1024, Apr. 2015. 
[44] Y. Lim, K. Koh, K. Kim, H. Yang, J. Kim, Y. Jeong, S. Lee, H. Lee, S.-H. Lim, Y. 
Han, J. Kim, J. Yun, S. Ham, and Y.-T. Lee, “A 1.1e- Temporal Noise 1/3.2-inch 
8Mpixel CMOS Image Sensor using Pseudo-Multiple Sampling”, in IEEE Int. Solid-
State Circuits Conf. (ISSCC) Dig. Tech. Papers, 2010, pp. 396-397. 
[45] Y. Chen, Y. Xu, Y. Chae, A. Mierop, X. Wang, and A. Theuwissen, “A 0.66e-rms 
 109 
Temporal-Readout-Noise CMOS Image Sensor for Low-Light-Level Imaging”, in 
IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, 2012, pp. 384-385. 
[46] H. -S. Lee, L. Brooks, and C.G. Sodini, “Zero-Crossing-Based Ultra-Low-Power 
A/D Converters.” Proceedings of the IEEE, vol. 98, no. 2, pp. 315–332, Feb. 2010.  
[47] L. Brooks, H. –S. Lee, “ A Zero-Crossing-Based 8-bit 200 MS/s Pipelined ADC”, 
IEEE J. Solid-State Circuits (JSSC), vol. 42, no. 12, pp. 2677-2687, Dec 2007.  
[48] S. Lee, A. P. Chandrakasan, H. -S. Lee, “ A 12 b 5-to-50 MS/s 0.5V-to-1 V Voltage 
Scalable Zero-Crossing Based Pipelined ADC”, IEEE J. Solid-State Circuits (JSSC), 
vol. 47, no. 7, pp. 1603-1614, Jul 2012. 
[49] N. Klemmer, S. Akhtar, V. Srinivasan, P. Litmanen, H. Arora, S. Uppathil, S. 
Kaylor, A. Akour, V. Wang, M. Mares, F. Dulger, A. Frank, D. Ghosh, S. 
Madhavapeddi, H. Safiri, J. Mehta, A. Jain, H. Choo, E. Zhang, C. Sestok, C. 
Fernando, K. Rajagopal, S. Ramakhrisnan, V. Sinari, and V. Baireddy “A 45nm 
CMOS RF-to-bits LTE/WCDMA FDD/TDD 2x2 MIMO base-station transceiver 
SoC with 200 MHz RF bandwidth,” in IEEE Int. Solid-State Circuits Conf. (ISSCC) 
Dig. Tech. Papers, 2016, pp. 164–165. 
[50] M. Bolatkale, L. J. Breems, R. Rutten, and K. A. A. Makinwa, “A 4 Ghz 
continuous-time ΔΣ ADC with 70 dB DR and -74 dBFS THD in 125 MHz BW,” 
IEEE J. Solid-State Circuits (JSSC), vol. 46, no. 12 , pp. 2857–2867, Dec. 2011. 
[51] P. Shettigar and S. Pavan, “Design techniques for wideband single-bit continuous-
time ΔΣ modulators with FIR feedback DACs,” IEEE J. Solid-State Circuits (JSSC), 
vol. 47, no. 12, pp. 2865–2879, Dec. 2012. 
 110 
[52] H. Shibata, R. Schreier, W. Yang, A. Shaikh, D. Paterson, and T.C. Caldwell, D. 
Alldred, and P. W. Lai, “A DC-to-1 GHz tunable RF ΔΣ ADC achieving DR= 74 dB 
and BW= 150 MHz at f0 = 450 MHz using 550 mW,” IEEE J. Solid-State Circuits 
(JSSC), vol. 47, no. 12, pp. 2888–2897, Dec. 2012. 
[53] V. Srinivasan, V. Wang, P. Satarzadeh, B. Haroun, and M. Corsi, “A 20mW 61dB 
SNDR (60 MHz BW) 1b 3rd-order continuous-time delta-sigma modulator clocked 
at 6 GHz in 45 nm CMOS,” in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. 
Tech. Papers, 2012, pp. 158–159. 
[54] O. Oliaei, “Sigma-delta modulator with spectrally shaped feedback”, IEEE Trans. 
on Circuits and Systems II (TCASII), vol. 50, no. 9, pp. 518-530, Sep, 2003. 
[55] B. Putter, “ΣΔ ADC with finite impulse response feedback DAC”, in IEEE Int. 
Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, 2004, pp. 76–77. 
[56] S. Ho, C.-L. Lo, J. Ru, and J. Zhao, “A 23 mW, 73 dB dynamic range, 80 MHz BW 
continuous-time delta-sigma modulator in 20 nm CMOS,” IEEE J. Solid-State 
Circuits (JSSC), vol. 50, no. 4, pp. 908–919, Apr. 2015. 
[57] B. Young, K. Reddy, S. Rao, A. Elshazly, T. Anand, and P.K. Hanumolu, “A 75 dB 
DR 50 MHz BW 3rd order CT-ΔΣ modulator using VCO-based integrators,” in 
Proc. IEEE VLSI Circuits (VLSIC) Symp., 2014, pp. 1–2. 
[58] C. Briseno-Vidrios, A. Edward, A. Shafik, S. Palermo, and J. Silva-Martinez, “A 75 
MHz BW 68dB DR CT-ΣΔ with single amplifier biquad filter and a broadband low-
power common-gate summing technique,” in Proc. IEEE VLSI Circuits (VLSIC) 
Symp., 2015, pp. C254–C255. 
 111 
[59] J. G. Kauffman, P. Witte, M. Lehmann, J. Becker, Y. Manoli, and M. Ortmanns, “A 
72 dB DR, ΔΣ modulator using digitally estimated, auxiliary DAC linearization 
achieving 88 fJ/conv-step in 25 MHz BW,” IEEE J. Solid-State Circuits (JSSC), vol. 
49, no. 2, pp. 392–404, Feb. 2014. 
[60] K. Reddy, S. Dey, S. Rao, B. Young, P. Prabha, and P. K. Hanumolu, “A 54mW 
1.2GS/s 71.5dB SNDR 50MHz BW VCO-based CT ΔΣ ADC using dual 
phase/frequency feedback in 65nm CMOS,” in Proc. IEEE VLSI Circuits (VLSIC) 
Symp., 2015, pp. C256–C257. 
[61] S.-H. Wu, T.-K. Kao, Z.-M. Lee, and J.-Y. Tsai, “A 160 MHz-BW 72 dB-DR 
40mW continuous-time ΔΣ modulator in 16nm CMOS with analog ISI-reduction 
technique,” in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, 2016, 
pp. 280–281. 
[62] T. Caldwell, D. Alldred, and Z. Lai, “A reconfigurable ΔΣ ADC with up to 100 
MHz bandwidth using flash reference shuffling,” IEEE Trans. on Circuits and 
Systems I (TCASI), vol. 61, no. 8, pp. 2263–2271, Aug. 2014. 
[63] G. Mitteregger, C. Ebner, S. Mechnig, T. Blon, C. Holuigue, and E. Romani, “A 20-
mW 640-MHz CMOS continuous-time ΣΔ ADC with 20-MHz signal bandwidth, 80-
dB dynamic range and 12-bit ENOB ” IEEE J. Solid-State Circuits (JSSC), vol. 41, 
no. 12, pp. 2641–2649, Dec. 2006. 
[64] L. J. Breems, R. Rutten, and G. Wetzker “A cascaded continuous-time ΣΔ 
Modulator with 67-dB dynamic range in 10-MHz bandwidth,” IEEE J. Solid-State 
Circuits (JSSC), vol. 39, no. 12, pp. 2152–2160, Dec. 2004. 
 112 
[65] Y.-S. Shu, J. Kamiishi, K. Tomioka, K. Hamashita, and B-S. Song, “LMS-based 
noise leakage calibration of cascaded continuous-time ΔΣ modulators,” IEEE J. 
Solid-State Circuits (JSSC), vol. 45, no. 2, pp. 368–379, Feb. 2010. 
[66] A. Edward, Q. Liu, C. Briseno-Vidrios, M. Kinyua, E. G. Soenen, A. I. Karsilayan, 
J. Silva-Martinez, “A 43-mW MASH 2-2 CT ΣΔ modulator attaining 74.4/75.8/76.8 
dB of SNDR/SNR/DR and 50 MHz of BW in 40-nm CMOS”, IEEE J. Solid-State 
Circuits (JSSC), early access. 
[67] Y. Dong, W. Yang, R. Schreier, A. Sheikholeslami, and S. Korrapati, “A 
continuous-time 0-3 MASH ADC achieving 88 dB DR with 53 MHz BW in 28 nm 
CMOS,” IEEE J. Solid-State Circuits (JSSC), vol. 49, no. 12, pp. 2868–2877, Dec. 
2014. 
[68] D.-Y. Yoon, S. Ho, and H.-S. Lee, “A continuous-time sturdy-MASH ΔΣ 
modulator in 28 nm CMOS,” IEEE J. Solid-State Circuits (JSSC), vol. 50, no. 12, 
pp. 2880–2890, Dec. 2015. 
[69] Y. Dong, J. Zhao, W. Yang, T. Caldwell, H. Shibata, R. Schreier, Q. Meng, J. Silva, 
D. Paterson, and J. Gealow, “A 930mW 69dB-DR 465MHz-BW CT 1-2 MASH 
ADC in 28nm CMOS,” in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. 
Papers, 2016, pp. 278–279. 
[70] M. Keller, A. Buhmann, F. Gerfers, M. Ortmanns, and Y. Manoli, “On the implicit 
anti-aliasing feature of continuous-time cascaded sigma-delta modulators,” IEEE 
Trans. on Circuits and Systems I (TCASI),  Reg. Papers, vol. 54, no. 12, pp. 2639–
2645, Dec. 2007. 
 113 
[71] S. Yan and E. Sanchez-Sinencio, “A continuous-time ΣΔ modulator with 88-dB 
dynamic range and 1.1-MHz signal bandwidth,” IEEE J. Solid-State Circuits (JSSC), 
vol. 39, no. 1, pp. 75–86, Jan. 2004. 
[72] A. Edward and J. Silva-Martinez, “General analysis of feedback DAC’s clock jitter 
in continuous-time sigma-delta modulators,” IEEE Trans. on Circuits and Systems II 
(TCASII), Exp. Briefs, vol. 61, no. 7, pp. 506–510, Jul. 2014. 
[73] B. K. Thandri and J. Silva-Martinez, “A robust feedforward compensation scheme 
for multistage operational transconductance amplifiers with no miller capacitors,” 
IEEE J. Solid-State Circuits (JSSC), vol. 38, no. 2, pp. 237–243, Feb. 2003. 
[74] S.D. Kulchycki, R. Trofin, K. Vleugels, and B. A. Wooley, “A 77-dB dynamic 
range, 7.5-MHz hybrid continuous-time/discrete-time cascaded ΣΔ modulator,” 
IEEE J. Solid-State Circuits (JSSC), vol. 43, no. 4, pp. 796–804, Apr. 2008. 
[75] J. Winters, S. Kasturia, “Adaptive nonlinear cancellation for high-speed fiber-optic 
systems”, IEEE J. Lightwave Technology, vol. 10, no.7, pp. 971-977, Aug 1992. 
[76] P. Fontaine, A. N. Mohieldin, and A. Bellaouar, “A Low-Noise Low-Voltage CT 
Modulator with Digital Compensation of Excess Loop Delay”, in IEEE Int. Solid-
State Circuits Conf. (ISSCC) Dig. Tech. Papers, 2005, pp. 498–499. 
[77] Y.-S. Shu, J.-Y. Tsai, P. Chen, T.-Y. Lo, P.-C. Chiu, “A 28fJ/conv-step CT ΔΣ 
Modulator with 78dB DR and 18MHz BW in 28nm CMOS Using a Highly Digital 
Multibit Quantizer”, in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. 
Papers, 2013, pp. 268–269. 
 
