# DESIGN TECHNIQUES FOR HIGH SPEED LOW VOLTAGE AND LOW POWER NON-CALIBRATED PIPELINE ANALOG TO DIGITAL CONVERTERS 

A Dissertation<br>by<br>RIDA SHAWKY ASSAAD<br>Submitted to the Office of Graduate Studies of Texas A\&M University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY

December 2009

Major Subject: Electrical Engineering

# DESIGN TECHNIQUES FOR HIGH SPEED LOW VOLTAGE AND LOW POWER 

 NON-CALIBRATED PIPELINE ANALOG TO DIGITAL CONVERTERSA Dissertation<br>by<br>RIDA SHAWKY ASSAAD<br>Submitted to the Office of Graduate Studies of<br>Texas A\&M University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY

Approved by:
Chair of Committee, Jose Silva-Martinez
Committee Members, Edgar Sanchez-Sinencio Frederick Strieter
Mahmoud El-Halwagi
Head of Department, Costas N. Georghiades

December 2009

Major Subject: Electrical Engineering

ABSTRACT<br>Design Techniques for High Speed Low Voltage and Low Power Non-Calibrated<br>Pipeline Analog to Digital Converters. (December 2009)<br>Rida Shawky Assaad, B.S., Texas A\&M University<br>Chair of Advisory Committee: Dr. Jose Silva-Martinez

The profound digitization of modern microelectronic modules made Analog-toDigital converters (ADC) key components in many systems. With resolutions up to 14bits and sampling rates in the 100 s of MHz , the pipeline ADC is a prime candidate for a wide range of applications such as instrumentation, communications and consumer electronics. However, while past work focused on enhancing the performance of the pipeline ADC from an architectural standpoint, little has been done to individually address its fundamental building blocks. This work aims to achieve the latter by proposing design techniques to improve the performance of these blocks with minimal power consumption in low voltage environments, such that collectively high performance is achieved in the pipeline ADC.

Towards this goal, a Recycling Folded Cascode (RFC) amplifier is proposed as an enhancement to the general performance of the conventional folded cascode. Tested in Taiwan Semiconductor Manufacturing Company (TSMC) $0.18 \mu \mathrm{~m}$ Complementary Metal Oxide Semiconductor (CMOS) technology, the RFC provides twice the bandwidth, $8-10 \mathrm{~dB}$ additional gain, more than twice the slew rate and improved noise
performance over the conventional folded cascode-all at no additional power or silicon area. The direct auto-zeroing offset cancellation scheme is optimized for low voltage environments using a dual level common mode feedback (CMFB) circuit, and amplifier differential offsets up to 50 mV are effectively cancelled. Together with the RFC, the dual level CMFB was used to implement a sample and hold amplifier driving a singleended load of 1.4 pF and using only 2.6 mA ; at $200 \mathrm{MS} / \mathrm{s}$ better than 9 bit linearity is achieved. Finally a power conscious technique is proposed to reduce the kickback noise of dynamic comparators without resorting to the use of pre-amplifiers. When all techniques are collectively used to implement a 1 Vpp 10bit 160MS/s pipeline ADC in Semiconductor Manufacturing International Corporation (SMIC) $0.18 \mu \mathrm{~m}$ CMOS, 9.2 effective number of bits (ENOB) is achieved with a near Nyquist-rate full scale signal. The ADC uses an area of $1.1 \mathrm{~mm}^{2}$ and consumes 42 mW in its analog core. Compared to recent state-of-the-art implementations in the $100-200 \mathrm{MS} / \mathrm{s}$ range, the presented pipeline ADC uses the least power per conversion rated at $0.45 \mathrm{pJ} /$ conversion-step.

To my dear parents, my cherished sisters, and my beloved wife.

## ACKNOWLEDGEMENTS

I have been privileged to be a graduate student in the Analog and Mixed Signal Center at Texas A\&M University for the past few years leading to this dissertation. My experience was filled with learning opportunities from a unique faculty and student body. I would like to begin by thanking my advisor, Dr. Jose Silva-Martinez, for his invaluable insights into circuit design that brought about this work, endless patience with a demanding student, and support and guidance in matters both academic and personal. I also would like to thank Dr. Edgar Sanchez-Sinencio for serving on my committee and the different perspectives by which he approached and analyzed technical and nontechnical matters alike. I owe a debt of gratitude to Dr. Frederick Strieter for steering me to analog circuit design from solid-state device physics and for working with him for many years as a teaching assistant. I am also grateful to Dr. Mahmoud El-Halwagi for his open door, listening ears and encouraging words when the going got rough.

Many friendships were made in the midst of circuit design chaos that made this experience all the more memorable. Chinamaya, David, Alberto, Artur, Wennie, Julio, Faisal, Nozahi, John, Raghavendra, Felix, and many more, thank you for an amazing journey; you have taught me many things through your character and example. Special thanks go to Ella, Tammy and Jeanie for bearing with my relentless questions and always having the answer. Finally, thanks to my parents for shaping me into the man I am, my sisters for their encouragement, and my wife for her endurance, kindness, faith and love.

## TABLE OF CONTENTS

CHAPTER Page
I INTRODUCTION ..... 1
A. Motivation ..... 2
B. Research Contribution ..... 4
C. Dissertation Organization ..... 5
II OVERVIEW OF ADCS ..... 7
A. Sampling ..... 8
B. Quantization ..... 11
C. Static ADC Metrics ..... 12

1. Gain ..... 13
2. Offset ..... 14
3. Differential Nonlinearity (DNL) ..... 14
4. Integral Nonlinearity (INL) ..... 14
D. Dynamic ADC Metrics ..... 15
5. Signal to Noise Ratio (SNR) ..... 15
6. Signal to Noise and Distortion Ratio (SNDR) ..... 16
7. Spurious Free Dynamic Range (SFDR) ..... 16
8. Effective Number of Bits (ENOB) ..... 17
E. ADC Architectures ..... 17
III PIPELINE ADCS ..... 19
A. Pipeline ADC Architecture ..... 19
9. 1.5bits/Stage Pipeline Cell ..... 22
10. Higher Resolutions/Stage ..... 25
B. Implementation of Pipeline Stages ..... 27
11. Front-End S/H ..... 27
12. Sub-ADC ..... 29
13. MDAC ..... 30
14. Switches ..... 33
C. Performance Limitations ..... 37
15. Capacitor Mismatch ..... 38
16. Finite Amplifier Gain ..... 40
17. Finite Amplifier GBW ..... 42
18. Amplifier Offset ..... 43
19. Amplifier Distortion ..... 44
20. Finite Switch Resistance ..... 47

## CHAPTER

Page
D. DNL and INL ..... 52
E. Noise ..... 54

1. Switches ..... 55
2. Amplifiers ..... 57
F. Conclusions ..... 58
IV AMPLIFIERS ..... 60
A. The Differential Pair Amplifier ..... 60
3. Bandwidth ..... 62
4. Gain ..... 64
5. Phase Margin ..... 66
6. Slew Rate ..... 70
7. Distortion ..... 73
8. Noise ..... 76
9. Process Variations ..... 81
B. Amplifiers for Pipeline ADCs ..... 85
C. Amplifier Enhancement ..... 88
10. Multi-Stage Amplifiers ..... 89
11. Gain Boosting ..... 90
D. Conclusions ..... 92
V RECYCLING FOLDED CASCODE. ..... 93
A. Proposed Folded Cascode Amplifier ..... 93
B. Recycling Folded Cascode Characteristics ..... 95
12. Small Signal Transconductance ..... 96
13. Low Frequency Gain ..... 96
14. Phase Margin ..... 97
15. Slew Rate. ..... 98
16. Noise ..... 99
17. Input Offset ..... 102
18. Efficiency ..... 104
19. Area and Power ..... 105
C. Characterization ..... 106
D. The Parameter K ..... 112
E. Conclusions ..... 113
VI COMMON MODE FEEDBACK ..... 114
A. Continuous Time vs. Switched Capacitor CMFB ..... 116
B. Switched Capacitor CMFB ..... 116
CHAPTER Page
C. Dual Level CMFB ..... 118
D. Design Concerns of Dual Level CMFB ..... 123
20. Static Performance ..... 123
21. Dynamic Performance ..... 124
E. Conclusions ..... 129
VII COMPARATORS ..... 130
A. Comparator Architectures ..... 130
22. Amplifier-Type Comparators ..... 130
23. Latch-Type Comparators ..... 131
B. Static Comparators ..... 133
24. Class A Output ..... 134
25. Class AB Output ..... 135
C. Dynamic Comparators ..... 135
26. Resistive Divider Input ..... 136
27. Differential Pair Input ..... 137
D. Kickback Noise ..... 138
28. Kickback Noise Reduction in Static Comparators ..... 142
29. Kickback Noise Reduction in Dynamic Comparators ..... 146
E. Comparator Implementation and Simulation Results ..... 154
VIII SIMULATION AND EXPERIMENTAL RESULTS ..... 159
A. Recycling Folded Cascode ..... 161
30. Gain Bandwidth. ..... 162
31. Slew Rate ..... 166
32. Distortion. ..... 167
33. Summary ..... 168
B. Sample and Hold ..... 171
C. Pipeline ADC ..... 180
IX CONCLUSIONS ..... 187
REFERENCES ..... 189
APPENDIX A ..... 202
APPENDIX B ..... 209
APPENDIX C ..... 217

VITA ............................................................................................................................ 221

## LIST OF FIGURES

FIGUREPage
1 Consumer digital photography, (a) before and (b) after the development of the CMOS sensor arrays ..... 1
2 Survey of recent literature; oversampling, $\boldsymbol{\square}$ single-step and $\boldsymbol{\Delta}$ multi- step conversion ADCs ..... 3
3 A continuous analog signal $x(t)$ before (left) and after (right) conversion. ..... 7
4 A general discrete-time ADC block diagram ..... 8
5 Signal spectra, (a) before sampling, (b) after sampling with $f_{S}>2 f_{B}$, (c) after sampling with $f_{S}<2 f_{B}$ ..... 10
6 Samples (dots) of original signal (solid) producing aliased signal (dashed). ..... 10
7 Ideal characteristic of a 3bit ADC; (a) mapping function, (b) quantization noise ..... 11
8 Static errors in a 4bit ADC ..... 13
9 A spectrum of a non-ideal 10bit ADC ..... 15
10 The basic architecture of a pipeline ADC ..... 20
11 Staggered signal processing scheme of pipeline ADCs ..... 21
12 Pipeline cell implementation and transfer function, (a) 1bit/stage and (b) 1.5bits/stage ..... 23
13 Sub-ADC offset effects on (a) 1bit/stage and (b) 1.5bits/stage pipeline cells ..... 24
14 Summation of stage bits to make output code of a 4bit ADC;
(a) 1bit/stage and (b) 1.5bits/stage ..... 25
15 A 2.5bits/stage pipeline cell implementation and transfer function ..... 26
FIGURE Page
16 The flip-around $\mathrm{S} / \mathrm{H}$; (a) circuit realization and (b) non-overlapped clocking scheme ..... 28
17 A 1.5bits/stage sub-ADC, (a) circuit realization and (b) an alternative thermometer-to-binary decoder ..... 29
18 A 1.5bits/stage DAC ..... 30
19 SC amplifier used in 1.5bits/stage MDAC; (a) overall SC amplifier, (b) S/H function in $\Phi_{1}$, (c) gain stage by charge redistribution in $\Phi_{2}$ and (d) inversion of $V_{D A C}$ and addition to $V_{I N}$ when super-positioned on (c) ..... 31
20 Switch implementation using a single MOS device. ..... 33
21 Low voltage limitations on single MOS switches ..... 34
22 Bootstrapped switch; (a) conceptual implementation, (b) $O N$ operation in $\Phi_{1}$ and (c) $O F F$ operation in $\Phi_{2}$ ..... 35
23 Other MOS switch implementations using (a) an I/O device, (b) a regulated supply and (c) a low $V_{T}$ device ..... 36
24 Effect of positive capacitor mismatch on (a) 1bit/stage and (b) 1.5bits/stage ..... 39
25 A 1.5bits/stage loaded MDAC with amplifier non-idealities ..... 40
26 Effect of finite amplifier gain on (a) 1bit/stage and (b) 1.5bits/stage ..... 41
27 Effect of amplifier offset on (a) 1bit/stage and (b) 1.5bits/stage ..... 44
28 Effect of amplifier distortion on (a) 1bit/stage and (b) 1.5bits/stage ..... 45
29 Effect of amplifier $S R$ on $G B W$ requirement ..... 47
30 A 1.5bits/stage MDAC in (a) sampling phase, $\Phi_{1}$, and (b) multiplying and holding phase, $\Phi_{2}$ ..... 48
31 Four terminal NMOS switch; (a) cross-sectional view and (b) schematic view ..... 50
FIGURE ..... Page
32 Effect of variable switch resistance on (a) 1bit/stage and (b) $1.5 \mathrm{bits} /$ stage ..... 51
33 Modified bootstrapped switch with bulk connections. ..... 51
34 Switch resistance noise model. ..... 56
35 Noise sources present in an M bit/stage MDAC for (a) sampling phase and (b) holding phase ..... 57
36 The differential pair amplifier; (a) single-ended and (b) fully-differential ..... 61
37 Small signal macro-model of Fig. 36(a) assuming a load $R_{L} \| C_{L}$ ..... 63
38 Channel length modulation phenomenon in MOS devices ..... 64
39 An example of a closed loop amplifier circuit ..... 66
40 Improved small signal macro-model of Fig. 36(a) ..... 67
41 Closed loop step response of the amplifier in Fig. 36(a) for different phase margin values ..... 69
42 Small signal macro-model of Fig. 36(b) assuming a load $R_{L} \| C_{L}$ ..... 70
43 Amplifier gain degradation for large output swings ..... 74
44 Slew rate induced distortion for fixed settling time, $T_{S} / 2$; thick solid: Vin, thin solid: Vout (linear settling) and dashed: Vout (slew rate limited settling) ..... 76
45 Noise sources modeled for the amplifier in Fig. 36(a) ..... 78
46 Fundamental OTA architectures; (a) telescopic, (b) current mirror and (c) folded cascode ..... 86
47 Miller compensated multi-stage amplifier ..... 89
48 Regulated cascode gain boosting applied to a telescopic OTA ..... 91
49 The conventional folded cascode amplifier ..... 94
FIGURE Page
50 The recycling folded cascode (RFC) amplifier ..... 95
51 Pole-zero locations of the RFC in the s-domain ..... 97
52 Amplifier characterization setup; (a) AC response and noise and (b) transient response and input offset ..... 107
53 Amplifiers AC response; (a) magnitude and (b) phase ..... 107
54 Amplifiers transient response; (a) output voltage and (b) total output current ..... 109
55 Input offset distributions; (a) conventional folded cascode, (b) recycling folded cascode 1 and (c) recycling folded cascode 2 ..... 110
56 Input referred noise power spectral density ..... 111
57 The differential pair amplifier; (a) single-ended and (b) fully-differential ..... 114
58 Fully differential amplifier with CMFB ..... 115
59 Classical switched capacitor CMFB circuit ..... 117
60 Direct auto-zeroing offset cancellation in (a) flip-around S/H and (b) pipeline stage MDAC ..... 119
61 Different input and output CM in a low voltage folded cascode amplifier ..... 120
62 Switched capacitor CMFB circuits; (a) improved classical circuit and (b) dual level CMFB ..... 121
63 Dynamic change in output common mode using a dual level CMFB circuit; (a) $V_{C M F B}$ and (b) output CM. ..... 122
64 Sampled signal in a flip-around SHA showing 50 mV sampled offset and offset free held output. ..... 123
65 Fully-differential folded cascode amplifier with CMFB connections ..... 125
66 Half-circuit small signal model of CMFB loop; (a) classical CMFB, and (b) dual level CMFB ..... 126
FIGURE ..... Page
67 Effects of supply noise components on the PSR of the dual level CMFB. ..... 128
68 Latch-type comparator; (a) conceptual schematic and (b) time-domain waveforms ..... 132
69 Static latch-type comparators; (a) class A and (b) class AB output ..... 134
70 Dynamic latch-type comparators; (a) resistive and (b) differential pair input ..... 136
71 Kickback noise modeling in a pipeline cell ..... 139
72 Kickback noise modeling in a pipeline cell with a preamp driven comparator ..... 142
73 Passive cross-coupled capacitive neutralization technique ..... 144
74 Kickback noise isolation using cascode devices ..... 145
75 Kickback noise coupling in dynamic comparators; (a) through Cgd and (b) through Cgs ..... 147
76 Kickback noise reduction using a charge redistribution capacitive input circuit ..... 149
77 A pipeline cell and its corresponding timing and delay scheme ..... 150
78 Proposed timing scheme for kickback noise reduction ..... 151
79 Proposed kickback noise reduction approach including the core comparator, a replica and the associated timing scheme ..... 153
80 Transformation of a single-ended input comparator to fully differential. ..... 154
81 The effect of kickback noise on MDAC output ..... 155
82 Comparator regeneration time; (a) outputs $V_{A}$ and $V_{\bar{A}}$ and (b) $V_{\text {Latch }}$ ..... 156
83 Comparator and replica total current; (a) control signals $V_{\text {Latch }}$ and $V_{\text {Rep }}$ and (b) total transient current ..... 157
FIGURE84 Amplifiers and S/H prototype chip ............................................................ 160
85 Prototype chip characterization PCB ..... 160
86 Enlarged die section showing the FC, RFC1 and RFC2 with their loads . ..... 161
87 Amplifier prototype test setup ..... 162
88 Experimental frequency sweep of $\mathrm{FC}, \mathrm{RFC} 1$ and RFC2 amplifier outputs ..... 163
89 PCB trace differences for the FC, RFC1 and RFC2 amplifiers ..... 164
90 Small signal step response of the FC, RFC1 and RFC2 amplifiers ..... 165
91 Large signal step response of the FC, RFC1 and RFC2 amplifiers ..... 166
92 Two tone FFT spectrums of the FC, RFC1 and RFC2 for a 1Vpp signal centered at 1 MHz and separated by 100 kHz ..... 169
93 Enlarged die section showing the $\mathrm{S} / \mathrm{H}$ amplifier and open drain buffer ..... 171
94 S/H test setup ..... 172
95 On-chip generation of S/H clock phases ..... 173
96 S/H amplifier AC response ..... 174
97 5MHz input and S/H output at 100MS/s ..... 175
9850 MHz input and $\mathrm{S} / \mathrm{H}$ output at $100 \mathrm{MS} / \mathrm{s}$ ..... 176
99 75MHz input and S/H output at $100 \mathrm{MS} / \mathrm{s}$ ..... 176
$100 \mathrm{~S} / \mathrm{H}$ output spectrum for a $1 \mathrm{Vpp}, 90 \mathrm{MHz}$ input sampled at 200 MHz ..... 178
101 S/H output spectrum for a 2 tone 1 Vpp input centered at 90 MHz and separated by 100 kHz , sampled at 200 MHz ..... 179
102 Expanded spectrum of Fig. 101 ..... 179
FIGURE ..... Page
103 Block diagram of pipeline ADC designed in SMIC $0.18 \mu \mathrm{~m}$ CMOS technology ..... 180
104 Complete pipeline ADC layout in SMIC $0.18 \mu \mathrm{~m}$ measuring $4 \mathrm{x} 4 \mathrm{~mm}^{2}$ and featuring 4 separate ADCs multiplexed to the LVDS output drivers.. ..... 181
105 First pipeline stage amplifier AC response ..... 182
106 Transient simulation of pipeline ADC showing the input signal and outputs of the S/H, P1 and P2 blocks ..... 183
107 1k FFT ADC spectrum for $f_{I N}=78 \mathrm{MHz}$ at $160 \mathrm{MS} / \mathrm{s}$. ..... 184

## LIST OF TABLES

TABLE ..... Page
1 A 1.5bits/Stage Logic Table ..... 29
2 A 1.5bits/Stage DAC Logic Table ..... 30
Fundamental Parameter Summary of OTAs Seen in Fig. 46 ..... 87
4 Amplifier Characterization Results ..... 112
5 Simulation and Experimental Results Performance Summary of the FC, RFC1 and RFC2 Amplifiers ..... 170
6 Performance Summary of Recent State-of-the-Art Pipeline ADCs. ..... 185

## CHAPTER I

## INTRODUCTION

The advancements of CMOS technologies continue to enable the growth of digital systems in size, complexity and robustness. Consequently, more and more signal processing functions are diverted from the analog to the digital domain for increased reliability and reduced cost. Such diversion is illustrated in the simplified consumer digital photography example of Fig. 1. When charge-coupled devices (CCD) were the predominant sensor base, analog conditioning was performed on the picture before it was digitized to simplify the digital processing. As CMOS technologies continued to mature, they brought by the development of CMOS sensors and increased digital signal processing (DSP) power. Now, many of the functions previously performed in the analog domain are carried out in the digital domain with enhanced performance.

(a)

(b)

Fig. 1. Consumer digital photography, (a) before and (b) after the development of the CMOS sensor arrays.

[^0]This domain shift, however, places stringent requirements on the analog to digital converter (ADC). In the digital photography example of Fig. 1, the signal bandwidth of the ADC in Fig. 1(b) needs to be greater than that in Fig. 1(a) to capture the finer details necessary for the additional digital processing; this generally translates to a higher speed and/or resolution for the ADC.

## A. Motivation

The dominion of DSP over core functions of microelectronics systems continues to push the development of high performance ADCs forward, but not without obstacles. The first hurdle is the adaptation of analog circuit design to the low voltage supplies of modern CMOS technologies; reduced device gain and reduced signal swing are a couple of examples of the difficulties faced.

A survey of CMOS ADCs, which shows the resolution/bandwidth plane of different ADC architectures found in recent literature is summarized in Fig. 2 [1]-[29]. A key observation is that oversampling $\Delta \Sigma$ ADCs [1]-[6] are dominant where high resolution is needed in a limited signal bandwidth, whereas single-step Flash ADCs [7][11] push the signal bandwidth envelope for low resolutions. A best line fit of their combined data reveals that multi-stage conversion ADCs [12]-[29], of which the pipeline ADC is the most prevalent architecture, are on the frontier of high speed and high resolution ADCs. This is due to the wide variety of consumer electronics-digital cameras, camcorders, cell phones, digital radio, GPS ... etc-that demand such high performance on both fronts. The increased portability of such systems, however, adds
another critical obstacle to analog circuit design: low power consumption.
Previous work tackled the low voltage low power obstacles of analog design in pipeline ADCs from an architectural level; the optimization of bit resolutions per pipeline stage, amplifier sharing among stages and digital calibration techniques are some examples. While these techniques have proved effective, they withdrew attention away from the fundamental performance of the individual pipeline ADC building blocks. This work will focus on robust low voltage design techniques that reduce power consumption in these blocks without sacrificing, if not improving, performance. The target is a pipeline ADC with 10bit resolution and a signal bandwidth around 100 MHz , since these specifications seem fitting for many applications as concluded from Fig. 2.


Fig. 2. Survey of recent literature; oversampling, $\boldsymbol{\square}$ single-step and $\Delta$ multi-step conversion ADCs.

## B. Research Contribution

The linearity of the pipeline ADC is limited primarily by the performance of the amplifiers and comparators which together make the bulk of its building blocks. Foreground and background digital calibration can be used, but are costly-even unnecessary-for resolutions at or below 12bits. Therefore, it is imperative to advance the analog design techniques on the transistor level. This is achieved in this research by:

- A systematic approach of extracting the non-linearity sources in pipeline ADCs and how they translate to the required specifications of amplifiers and comparators, thus eliminating the need for digital calibration.
- A novel Recycling Folded Cascode amplifier that enhances the majority of the fundamental performance metrics of the conventional folded cascode, and promises significant savings in both power dissipation, and silicon area. The proposed circuit modifications to the conventional Folded Cascode are simple and inexpensive in terms of design, and are robust in low voltage environments.
- A dual level CMFB approach to optimize the auto-zeroing offset cancellation scheme in low voltage environments, which has a direct impact on the performance of pipeline ADCs using high bit resolutions per stage.
- A power conscious technique to reducing the kickback noise of dynamic comparators that are indispensable to the implementation of low power and low voltage pipeline ADCs.

Moreover, the circuit design techniques proposed are not limited to pipeline ADCs, but may also be adopted by many other discrete and continuous time applications.

## C. Dissertation Organization

Chapter II of this dissertation gives an overview of ADCs. The principles of sampling and quantization are presented, and the major static and dynamic metrics commonly used to quantify the performance of ADCs are described. A survey of the recent ADCs in literature concludes the chapter and highlights the role and domain of pipeline ADCs among different ADC architectures.

Chapter III presents the pipeline ADCs in a systematic manner. The architecture is first introduced, and then broken down into the basic building blocks where the sources of the main non-idealities are identified and translated into design specifications. Low voltage implementation concerns and power optimization techniques are also presented. Amplifiers are discussed in Chapter IV; the amplifier limitations previously highlighted in Chapter III are closely examined in Chapter IV and their physical origins are presented. Moreover, some amplifier enhancement techniques are covered.

Chapter V is devoted to the proposed recycling folded cascode amplifier. It covers the design methodology of the amplifier and analytically demonstrates its enhanced performance over the conventional folded cascode.

In Chapter VI, the dual level CMFB approach is introduced and its application to optimize the direct auto-zeroing offset cancellation scheme in low voltage environments is described and demonstrated.

Chapter VII discusses the implementation of low voltage low power dynamic comparators and presents a power conscious technique to reduce their kickback noise without relying on preamplifiers.

Chapter VIII includes the experimental results supporting the proposed circuit techniques; a comparison of the conventional and recycling folded cascode amplifiers performances, a 10bit $200 \mathrm{MS} / \mathrm{s}$ Sample and hold amplifier and simulations of a 10bit $160 \mathrm{MS} / \mathrm{s}$ pipeline ADC are presented. The dissertation is concluded in Chapter IX.

## CHAPTER II

## OVERVIEW OF ADCS

Analog to Digital Converters (ADCs) are the bridge connecting the sensed physical realm to the world of computation. Portable electronics, instrumentation equipment and communications are but a handful of applications where ADCs are at the heart of what humans can perceive and machine can understand. The principal function of ADCs, as the name implies, is the conversion of analog continuous signals to digital binned data-points through the processes of sampling and quantization as depicted by Fig. 3, which shows a signal $x(t)$ before and after conversion.


Fig. 3. A continuous analog signal $x(t)$ before (left) and after (right) conversion.

The resolution of the digital data, or the smallest discernible value by the ADC, is defined by the number of bits $N$ representing the digital data and the signal full-scale range ( $F S$, or $V_{F S}$ for voltage) an ADC can handle. This is commonly referred to by $\Delta-$
the value represented by the least significant bit of the digital output, $1 L S B$-and can be expressed as in (1).

$$
\begin{equation*}
\Delta=\left.\frac{V_{F S}}{2^{N}-1} \cong \frac{V_{F S}}{2^{N}}\right|_{N \gg 1} \tag{1}
\end{equation*}
$$

## A. Sampling

Digital data are discrete in time; this is the result of sampling. A general block diagram of a discrete-time ADC is given in Fig. 4. The first step in analog-to-digital conversion is sampling the analog data so it can be quantized and it is often performed at a uniform rate determined by the ADC sampling clock period $T_{S}$.


Fig. 4. A general discrete-time ADC block diagram.

There are several considerations to the quality of the sampled signal that need be taken during the sampling process, and perhaps the most critical is fulfilling NyquistShannon sampling theorem [30]-[31], which states: a bandwidth limited signal $x(t)$, whose maximum spectral component is at $f_{B}$, can be reconstructed without loss of information if it was sampled at $f_{S}$, where $f_{S}>2 f_{B}$. Ideally, the sampling process yields a
sequence of delta functions whose amplitude is that of the input signal at the sampling instance, and for uniform sampling with time period $T_{S}$, the output can be given by (2).

$$
\begin{equation*}
x(t) \xrightarrow{\text { sampling }} x\left(n T_{S}\right)=\sum_{n=-\infty}^{\infty} x(t) \delta\left(t-n T_{S}\right) \tag{2}
\end{equation*}
$$

In the time domain, the sampled signal looks as shown in Fig. 3 (right) with valid values at integer intervals of $T_{S}$. As for the frequency domain, Fig. 5 shows the spectra of the input signal before and after sampling. The sampling process replicates the signal's spectrum at integer intervals of the sampling frequency $f_{s}$. If, however, the condition set by the sampling theorem was not met, the replicated signal spectra overlap. This spectral overlap is commonly referred to by aliasing. When a sampled signal is reconstructed, only the spectral content in the range $\left\{-f_{S} / 2, f_{S} / 2\right\}$ is used. Clearly in the case of Fig. 5(c) the signal is corrupted. This is represented in the time domain in Fig. 6, where a high frequency signal can be mistaken for a low frequency signal due to aliasing [32]. Another form of aliasing occurs if the signal spectrum had some noise or unwanted content beyond $f_{B}$. Once sampled, this noise is folded back into the range $\quad\left\{-f_{S} / 2, f_{S} / 2\right\}$, thus corrupting the signal. Hence the use of anti-aliasing filters, as shown in Fig. 4, is desirable before an ADC in systems where the signal is noisy or contains unwanted content beyond the bandwidth of interest $f_{B}$.


Fig. 5. Signal spectra, (a) before sampling, (b) after sampling with $f_{S}>2 f_{B}$, (c) after sampling with $f_{S}<2 f_{B}$.


Fig. 6. Samples (dots) of original signal (solid) producing aliased signal (dashed).

## B. Quantization

Digital data are discrete in magnitude; this is the result of quantization. Quantization is perhaps the fundamental differentiating feature between analog and digital data. While analog data can take any value within a specified range, digital data can only assume fixed and predetermined values within the same range. This was presented in Fig. 3 (right), where the sampled data do not necessarily take the same magnitude as the analog data, but the closest predetermined value that causes the least amount of error.

(a)

(b)

Fig. 7. Ideal characteristic of a 3bit ADC; (a) mapping function, (b) quantization noise.

The ideal characteristic mapping function of a 3bit ADC and its quantization error are given in Fig. 7. The main principal of the quantization process is to place the analog input in predefined bins of width $\Delta$ corresponding to a single digital code with a maximum absolute error of $\Delta / 2$ or $1 / 2 L S B$. The errors introduced to the signal in the process are referred to as quantization noise. Under the assumptions that all quantization levels are exercised with equal probability independent of the input and that a large number of uniform quantization levels are used, the quantization noise power can be expressed by (3) [33]. These assumptions, while inaccurate, provide a very good approximation for resolutions greater than 4bits.

$$
\begin{equation*}
P_{Q}=\frac{1}{\Delta} \int_{-\Delta / 2}^{\Delta / 2} e_{q}^{2} \cdot d e_{q}=\frac{\Delta^{2}}{12} \tag{3}
\end{equation*}
$$

## C. Static ADC Metrics

ADCs implemented in silicon do not share the ideal characteristics shown in Fig. 7. Several performance metrics are used to measure an ADC's deviation from its ideal characteristic, and here we discuss some of the static metrics which measure the ADC's performance independent of time or the input signal. A depiction of these metrics is given in Fig. 8.

A transfer characteristic of an ADC such as the one in Fig. 8 can be obtained using a slow ramp signal for the input that spans the whole range of the ADC. The ramp needs to be slow enough such that each code is hit 10 s or 100 s of times. The collected
data can then be plotted such that the number of hits per code represents the code width. The ideal code width can also found from the speed of the ramp.


Fig. 8. Static errors in a 4bit ADC.

## 1. Gain

A gain error is a deviation in the slope of the real transfer characteristic of an ADC from the ideal transfer characteristic. The most practical method to evaluate the gain error is using the endpoint-fit line of the ADC output, as it can provide insight into the ADC dynamic performance. Another method of evaluating the gain error is the bestline fit of the transfer characteristic, which generally yields a smaller gain error value.

## 2. Offset

The offset changes the ADC transfer function by shifting the code transition points by the offset's value. The offset is simply measured by extracting the horizontal intercept of the first code transition less $1 / 2 L S B$.

## 3. Differential Nonlinearity (DNL)

A deviation in the real code width from the ideal code width $\Delta, 1 L S B$, in the transfer function of an ADC constitutes a $D N L$ error. $D N L$ is measured after the gain and offset errors are compensated for in the real transfer characteristic of the ADC. In severe cases where the $D N L$ exceeds $1 L S B$, some digital codes cannot be represented by any analog input-missing codes-and the ADC effectively loses 1bit of resolution. Therefore, practical ADCs are designed with a $D N L$ range of $\{-1 / 2 L S B, 1 / 2 L S B\}$.

## 4. Integral Nonlinearity (INL)

The $I N L$ is the summation of buildup of $D N L$ over the span of the ADC transfer function. It can be evaluated using (4). Note that if the endpoint-fit line method was used to evaluate the gain error, $I N L_{1}=I N L_{2^{N}-1}=0$, and the shape of the $I N L$ plot could accurately predict some of the dynamic performance metrics of the ADC [34]-[35].

$$
\begin{equation*}
I N L_{k}=\sum_{l=1}^{k} D N L_{l} \tag{4}
\end{equation*}
$$

## D. Dynamic ADC Metrics

The dynamic performance of an ADC is strongly related to the input signal bandwidth and conversion speed, and hence its dynamic metrics are generally evaluated for a specific set of conditions. The overall dynamic performance of the ADC is then performed by adjusting a single variable in the set of conditions, and repeating the measurement until the whole ADC range is characterized. A graphical representation of some of the most frequently used dynamic measurements is given in Fig. 9.


Fig. 9. A spectrum of a non-ideal 10bit ADC.

1. Signal to Noise Ratio (SNR)

Spectral noise of the ADC is the random fluctuations that determine the smallest
detectable signal. In Fig. 9 the noise floor level is roughly -92dB; any signal below this level is undetectable. The $S N R$ is the ratio between the power of the $F S$ signal, generally a sinusoid, and the total noise generated by the ADC within the bandwidth of interest. The errors induced by the quantization process set the limit for $S N R$ and the signal to quantization noise ratio can be evaluated using (5).

$$
\begin{equation*}
S N R_{Q}=10 \log \frac{V_{F S}^{2} / 2}{\Delta^{2} / 12} \cong 6.02 N+1.76 \quad[d B] \tag{5}
\end{equation*}
$$

## 2. Signal to Noise and Distortion Ratio (SNDR)

The ADC spectrum of Fig. 9 contains spurs that rise above the noise floor. These spurs are classified as distortion. In general distortion is harmonic to the fundamental, occurring at integer multiples of the fundamental frequency. The quality of the ADC output signal is degraded by distortion. This degradation can be quantified using $S N D R$. Similar to the $S N R$, the $S N D R$ measures the ratio of $F S$ signal power with respect to combined power of noise and distortion components.

## 3. Spurious Free Dynamic Range (SFDR)

The ratio of $F S$ signal root-mean-square (rms) value to the rms value of the highest spur in the ADC spectrum is the $S F D R$. This is generally, but not necessarily, equivalent to the ratio of the fundamental to the $3^{\text {rd }}$ harmonic component in fullydifferential ADCs.

## 4. Effective Number of Bits (ENOB)

For a noiseless ADC-apart from quantization noise-the $S N R$ given by (5) will show that the ADC effectively has $N$ bits of resolution. However, the noise and distortion added by the ADC components result in a smaller value for $S N R$ and $S N D R$ than that predicted by (5). This translates into an effectively lower resolution than what the ADC was designed for. By equating either $S N R$ or $S N D R$ to $S N R_{Q}$ given by (5) and solving for $N$ we can determine the $E N O B$ as in (6) of the ADC. The $E N O B$ of practical ADCs is within 1bit of what the ADC was designed for.

$$
\begin{equation*}
E N O B=\frac{S N D R-1.76}{6.02} \tag{6}
\end{equation*}
$$

## E. ADC Architectures

While ADC characterization is virtually the same for all ADCs, ADC architectures differ according to their respective applications. ADCs can be divided into two major groups from an operation standpoint: oversampling ADCs and Nyquist Rate ADCs. Oversampling ADCs, as the name implies, use a clock frequency $f_{S} \gg 2 f_{B}$. The result is a significantly lower noise floor compared to Nyquist rate ADCs, which benefits the ADC $S N R$ and dynamic range and increases its resolution beyond 16bits [36]. This, however, is at the expense of a very narrow signal bandwidth and hence finds use primarily in high fidelity audio systems and some communication and medical systems.

Nyquist rate ADCs follow the Nyquist-Shannon sampling theorem and hence their bandwidth is solely dependent on how fast can a signal be accurately sampled for a
desired resolution, which in turn is limited by the technology process used. These ADCs can be further divided into two sub-groups: single-step and multi-step conversion ADCs, where the first sub-group is predominately based on the Flash architecture and the second on Sub-Ranging architectures such as successive approximation, time-interleaved and pipeline ADCs [37], where the latter is the main focus of this dissertation.

## CHAPTER III

## PIPELINE ADCS

Pipeline ADCs are Nyquist rate data converters that are able to achieve resolutions up to 14 bits and sampling rates beyond 100 MHz while maintaining a high SNDR and SFDR performance. They are ideal for many applications such as instrumentation (digital oscilloscopes, spectrum analyzers, and medical imaging), communications (video, radar and software radio), and consumer electronics (digital cameras, flat panel displays and high-definition TV). This extensive range of applications along with the design challenges posed by modern day CMOS technologies had been a motivating force behind developing new techniques to enhance the overall performance of pipeline ADCs , particularly its power consumption. In this chapter the pipeline ADC architecture is introduced and the limiting attributes of its building components are highlighted. This is done in an effort to propose new design techniques that can be applied in a low voltage environment and build a pipeline ADC that is very competitive with the state-of-the-art implementations in literature [13], [15], [16], [23], [26].

## A. Pipeline ADC Architecture

The conceptual block diagram of Fig. 10 describes the basic architecture of a pipeline ADC with a resolution of $N$ bits. It consists of a front-end sample and hold $(\mathrm{S} / \mathrm{H})$, several pipeline stages (or cells) and a digital decoder. Each pipeline stage
resolves $M$ bits using a sub-ADC where $M<N$. The digital to analog converter (DAC) converts the $M$ bits back to analog and the result is subtracted from the original signal thus generating a residue. This residue represents the signal portion not yet resolved by the ADC and is passed on to the following stage for further processing. An amplification $G=2^{M}$ is applied to the residue to keep its dynamic range equal to the full scale signal, $V_{F S}$. This is necessary to allow the use of the same reference voltages in all stages for simplicity, and to also relax the design requirements of the sub-ADCs in subsequent stages given that otherwise the residue gets too small to process. At the end of conversion, the $M$ bits resolved by each pipeline stage are decoded with appropriate delays corresponding to each stage and synchronized to form the $N$ bits of the ADC.


Fig. 10. The basic architecture of a pipeline ADC.

Despite its sequential manner of data conversion, the speed of the pipeline ADC is not adversely affected. The use of a $\mathrm{S} / \mathrm{H}$ in each stage ensures that a new signal sample can be acquired every clock cycle while the older samples progress down the pipeline, hence the name. This processing scheme is illustrated in Fig. 11. There is a time latency at startup, however, which amounts to the clock cycles needed to fill up the pipeline. Henceforth, the pipeline ADC generates a new output for every clock cycle.

| $S / H$ | $V_{i n}(n)$ | $V_{i n}(n+1)$ | $V_{i n}(n+2)$ |  |  |  |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $1^{\text {st }}$ Stage | $M_{1}(n-1)$ | $M_{1}(n)$ | $M_{1}(n+1)$ |  | $M_{1}(n+i-3)$ |  |  |
| $2^{\text {nd }}$ Stage | $M_{2}(n-2)$ | $M_{2}(n-1)$ | $M_{2}(n)$ |  | $M_{2}(n+i-4)$ | $M_{2}(n+i-3)$ |  |
| $\vdots$ | $\vdots$ | $\vdots$ | $\vdots$ | $\cdots$ | $\vdots$ | $\vdots$ | $\vdots$ |
| $i_{-1} 1^{\text {th }}$ Stage |  | $M_{i-1}(n-2)$ | $M_{i-1}(n-1)$ |  | $M_{i-1}(n)$ | $M_{i-1}(n+1)$ | $M_{i-1}(n+2)$ |
| $i^{\text {th }}$ Stage |  |  | $M_{i}(n-2)$ |  | $M_{i}(n-1)$ | $M_{i}(n)$ | $M_{i}(n+1)$ |
| Output |  |  |  |  | $N(n-2)$ | $N(n-1)$ | $N(n)$ |

Fig. 11. Staggered signal processing scheme of pipeline ADCs.

There are many variants to the pipeline ADC configuration of Fig. 10. Particularly, the number of bits resolved in each stage is not necessarily kept the same, and modifying the division of bits among stages can have a significant impact on the overall ADC performance as will be demonstrated in later sections. Also, the last stage is typically implemented using only a sub-ADC, or low resolution Flash ADC, since there is no need to generate a residue.

## 1. 1.5bits/Stage Pipeline Cell

The most fundamental pipeline stage implementation commonly used for high speed is the 1.5 bits/stage cell. It is shown in Fig. 12 side by side next to its predecessor the 1bit/stage pipeline cell. The sub-ADCs are implemented using comparators. The S/H, DAC, summation and residue amplification are implemented using the SC amplifier, which makes the multiplying DAC (MDAC). The ideal transfer functions of the $1 \mathrm{bit} /$ stage and $1.5 \mathrm{bits} /$ stage pipeline cells are given by (7) and (8) respectively, where $V_{R}$ is a reference voltage. The reference voltage, $V_{R}$, also defines the full scale range; $V_{F S}$ $=2 V_{R}$. Moreover, the capacitor values here are nominally equal to implement a gain $G$ of 2; $C_{1}=C_{2}=C$.

$$
\begin{gather*}
V_{\text {OUT }}=\left\{\begin{array}{lc}
V_{I N}\left(1+\frac{C_{2}}{C_{1}}\right)+V_{R} \frac{C_{2}}{C_{1}}, & V_{I N}<0 \\
V_{I N}\left(1+\frac{C_{2}}{C_{1}}\right)-V_{R} \frac{C_{2}}{C_{1}}, & V_{I N}>0
\end{array}\right.  \tag{7}\\
V_{\text {OUT }}=\left\{\begin{array}{lc}
V_{I N}\left(1+\frac{C_{2}}{C_{1}}\right)+V_{R} \frac{C_{2}}{C_{1}}, & V_{I N}<-\frac{V_{R}}{4} \\
V_{I N}\left(1+\frac{C_{2}}{C_{1}}\right), & -\frac{V_{R}}{4}<V_{I N}<\frac{V_{R}}{4} \\
V_{I N}\left(1+\frac{C_{2}}{C_{1}}\right)-V_{R} \frac{C_{2}}{C_{1}}, & V_{I N}>\frac{V_{R}}{4}
\end{array}\right. \tag{8}
\end{gather*}
$$



Fig. 12. Pipeline cell implementation and transfer function, (a) 1bit/stage and (b) $1.5 \mathrm{bits} / \mathrm{stage}$.

In reality, the transfer functions of (7) and (8) suffer errors from imperfections in circuit implementation. For example, suppose the decision level of the 1bit/stage is shifted by $\delta$ due to a comparator offset, then the output exceeds the $\left\{-V_{R}, V_{R}\right\}$ range by $2 \delta$. If the error $2 \delta$ is greater than the quantization noise of the remaining stages, $V_{R} / 2^{N r}$ ( $1 / 2 L S B$ ), where $N r$ is the resolution of the remaining pipeline stages, a conversion error
occurs and the resolution of the whole ADC is reduced. Hence, such an error is most severe in the early stages of the ADC. On the other hand, the 1.5 bits/stage shifts the original decision level and adds another to avoid exceeding the output range. By choosing the decision levels $\pm V_{R} / 4$, the immunity of the 1.5 bits/stage against comparator offsets is maximized. The transfer functions of the 1bit/stage and 1.5bits/stage including offsets in the sub-ADC are shown in Fig. 13; offsets as large as $V_{R} / 4$ can be tolerated in $1.5 \mathrm{bits} /$ stage without exceeding the range $\left\{-V_{R}, V_{R}\right\}$.

(a)

(b)

Fig. 13. Sub-ADC offset effects on (a) 1bit/stage and (b) 1.5bits/stage pipeline cells.

The use of an additional decision level in the 1.5 bits/stage is called digital correction [38], and ADCs utilizing this correction method are named redundant signed digit (RSD) ADCs [39]. The correction is realized in the fact that the stage now gives 2 bits instead of 1 . These 2 bits, however, are incomplete as not all 2 bit levels are used in
the $1.5 \mathrm{bits} /$ stage. The most significant bit (MSB) is the actual bit resolved, while the $L S B$ acts as an uncertainty flag; codes $\mathbf{0 0}$ and $\mathbf{1 0}$ are a certain 0 and 1 respectively, but 01 denotes a result needing further processing. If an uncertainty arises from an input within $\pm V_{R} / 4$ but close to either decision level, it may quickly be resolved by the next stage, but an uncertainty arising from an input closer to 0 may need the whole pipeline to resolve it. Also, as 1 bit is used for correction, the gain of the stage $G$ (Fig. 10) remains $2^{M}$ instead of $2^{M+1}$. Finally, the digital decoding and correction process is a weighted summation of the $M+l$ bits from all stages for a given input sample and is depicted in Fig. 14 for a 4-stage 4bit pipeline ADC with a single comparator for the last stage.

| Stage | $2^{3}$ | $2^{2}$ | $2^{1}$ | $2^{0}$ |
| :---: | :---: | :---: | :---: | :---: |
| $M_{1}$ | $X$ |  |  |  |
| $+M_{2}$ |  | $X$ |  |  |
| $+M_{3}$ |  |  | $X$ |  |
| $+M_{4}$ |  |  |  | $X$ |
| $N$ | $X$ | $X$ | $X$ | $X$ |

(a)

| Stage | $2^{3}$ | $2^{2}$ | $2^{1}$ | $2^{0}$ |
| :---: | :---: | :---: | :---: | :---: |
| $M_{1}$ | $X$ | $X$ |  |  |
| $+M_{2}$ |  | $X$ | $X$ |  |
| $+M_{3}$ |  |  | $X$ | $X$ |
| $+M_{4}$ |  |  |  | $X$ |
| $N$ | $X$ | $X$ | $X$ | $X$ |

(b)

Fig. 14. Summation of stage bits to make output code of a 4bit ADC; (a) 1bit/stage and (b) $1.5 \mathrm{bits} /$ stage .

## 2. Higher Resolutions/Stage

Digitally corrected pipeline stages with higher bit resolutions are possible and are often used in higher resolution ADCs (12-14bits). They are easily derived from the original cells without digital correction by adding additional comparators to the sub-

ADC , and evenly separating their decision levels by $V_{R} / 2^{M}$ symmetrically around 0 input. A 2.5 bits/stage pipeline cell is presented in Fig. 15 with its ideal transfer function, which is given in (9). While higher resolutions/stage offers attractive benefits as will be demonstrated later, two main aspects need to be considered in their implementation. First, increasing the number of comparators means power-efficient designs need to be adopted. Second, the immunity against sub-ADC errors is reduced.


Fig. 15. A 2.5bits/stage pipeline cell implementation and transfer function.

$$
V_{\text {OUT }}=\left\{\begin{array}{lr}
4 V_{I N}+3 V_{R}, & V_{I N}<-5 V_{R} / 8  \tag{9}\\
4 V_{I N}+2 V_{R}, & -5 V_{R} / 8<V_{I N}<-3 V_{R} / 8 \\
4 V_{I N}+V_{R}, & -3 V_{R} / 8<V_{I N}<-V_{R} / 8 \\
4 V_{I N}, & -V_{R} / 8<V_{I N}<V_{R} / 8 \\
4 V_{I N}-V_{R}, & V_{R} / 8<V_{I N}<3 V_{R} / 8 \\
4 V_{I N}-2 V_{R}, & 3 V_{R} / 8<V_{I N}<5 V_{R} / 8 \\
4 V_{I N}-3 V_{R}, & V_{I N}>5 V_{R} / 8
\end{array}\right.
$$

## B. Implementation of Pipeline Stages

The following sub-sections treat the implementation of the front-end $\mathrm{S} / \mathrm{H}$ and the individual pipeline stages. The topologies most commonly used to realize these functions are presented, and whenever applicable the limitations or concerns regarding their performance in low voltage environments are addressed.

## 1. Front-End $S / H$

The front-end $\mathrm{S} / \mathrm{H}$ is a SC circuit that relies greatly on the performance of amplifier used to implement it. Here we examine the topology from a functional standpoint; the effects of amplifier non-idealities are examined in section $C$, while their physical sources, low voltage limitations and the amplifier architectures best suited for pipeline ADCs are discussed in detail in Chapter IV.

The flip-around $\mathrm{S} / \mathrm{H}$ [40] is perhaps the most adopted architecture for the pipeline ADC front-end. It is given in Fig. 16 in the single-ended form along with the non-overlapped clock phases used to perform its function and operates as follows. During the sampling phase, $\Phi_{1}$, the amplifier is reset providing a virtual ground at $x$, and the input is sampled on $C_{S H}$. Hence, the charge stored on $C_{S H}$ by the end of $\Phi_{1}$ referenced to $x$ is given by (10). In the holding phase, $\Phi_{2}, C_{S H}$ is flipped-around and connected to the output. The charge stored on $C_{S H}$ by the end of $\Phi_{2}$ referenced to $x$ is now given by (11). Since there is no discharge path for $C_{S H}$ between $\Phi_{1}$ and $\Phi_{2}$, the charge is conserved and the output can be expressed by (12). As for the early falling-edge phase, $\Phi_{l e}$, it is used to implement a technique commonly known as bottom-plate sampling; it
defines the sampling moment and effectively minimizes switching errors associated with charge injection and clock feed through, especially in fully-differential implementations.


Fig. 16. The flip-around $\mathrm{S} / \mathrm{H}$; (a) circuit realization and (b) non-overlapped clocking scheme.

$$
\begin{gather*}
q_{\Phi 1}=C_{S H} V_{I N}  \tag{10}\\
q_{\Phi 2}=C_{S H} V_{O U T}  \tag{11}\\
q_{\Phi 1}=q_{\Phi 2} \Rightarrow V_{O U T}=V_{I N} \tag{12}
\end{gather*}
$$

There are several benefits to the flip-around $\mathrm{S} / \mathrm{H}$ that led to its popularity. The resetting of the amplifier during $\Phi_{1}$ samples the amplifier offset on $C_{S H}$, which is then effectively cancelled in $\Phi_{2}$; this is known as direct auto-zeroing [41]. Another benefit is the relaxation of the amplifier slew rate requirement for Nyquist-rate input signals; the resetting of the amplifier in $\Phi_{1}$ ensures a maximum step of half the full-scale input between consecutive samples, while track and hold circuits or $\mathrm{S} / \mathrm{H}$ with previous output memory experience a full-scale step between samples at Nyquist-rate. Finally, during $\Phi_{2}$ the amplifier feedback factor is almost unity, which reduces its bandwidth requirement.

## 2. Sub-ADC

The sub-ADC can be divided into two parts, analog and digital. The analog part is implemented using comparators, which generate a thermometer code representing the position of the input with respect to pre-set voltage references; the circuit realization and design considerations of comparators are presented in detail in Chapter VII. As for the digital part of the sub-ADC, it is a simple thermometer-to-binary decoder. The complete implementation of a 1.5 bits/stage sub-ADC showing the thermometer-to-binary decoder is depicted in Fig. 17, and a truth table of the logic is given in Table 1.

(a)

(b)

Fig. 17. A 1.5bits/stage sub-ADC, (a) circuit realization and (b) an alternative thermometer-to-binary decoder.

TABLE 1
A 1.5bits/Stage Logic Table

| CASE | $V_{I N}$ | CMP $_{2}$ | CMP $_{1}$ | $M_{2}$ | $M_{1}$ |
| :---: | :---: | :---: | :---: | :---: | :---: |
| 1 | $V_{I N}<-V_{R} / 4$ | 0 | 0 | 0 | 0 |
| 2 | $-V_{R} / 4<V_{I N}<V_{R} / 4$ | 0 | 1 | 0 | 1 |
| $3^{*}$ | $V_{I N}<-V_{R} / 4, V_{I N}>V_{R} / 4$ | 1 | 0 | $\mathrm{X}(0)$ | $\mathrm{X}(1)$ |
| 4 | $V_{I N}>V_{R} / 4$ | 1 | 1 | 1 | 0 |

* Not feasible, but can simplify logic implementation if used; $\mathrm{X}=$ don't care.


## 3. $M D A C$

The MDAC in Fig. 10 acts as a DAC, S/H, adder and gain stage. For the 1.5bits/stage pipeline cell given in Fig. 12(b), the DAC can be realized as shown in Fig. 18 following the logic of Table 2. Note that the implementation of the switch signals $S_{2}$ and $S_{3}$ overlap with $m_{1}$ and $m_{2}$ from the sub-ADC.


Fig. 18. A 1.5bits/stage DAC.

TABLE 2
A 1.5bits/Stage DAC Logic Table.

| CMP $_{2}$ | CMP $_{1}$ | $S_{3}$ | $S_{2}$ | $S_{1}$ | $V_{D A C}$ |
| :---: | :---: | :---: | :---: | :---: | :---: |
| 0 | 0 | 0 | 0 | 1 | $-V_{R}$ |
| 0 | 1 | 0 | 1 | 0 | 0 |
| 1 | 1 | 1 | 0 | 0 | $V_{R}$ |

As for the $\mathrm{S} / \mathrm{H}$, adder and gain stage, they are all embedded in the SC amplifier of Fig. 19(a); the individual functions are depicted in Fig. 19(b-d). The SC amplifier uses the same non-overlapped clocking scheme of Fig. 16(b), and its operation is described as
follows with all charge referenced to the virtual ground node $x$. In $\Phi_{1}$ the input $V_{I N}$ is sampled on $C_{1}$ and $C_{2}$ as shown in Fig. 19(b), and the total stored charge at the end of this phase is given by (13). In $\Phi_{2}$ the amplifier is reconfigured; $C_{1}$ is connected to the amplifier output, while $C_{2}$ is connected to the DAC output, $V_{D A C}$. The total stored charge at the end of $\Phi_{2}$ is given by (14). While the charge stored on $C_{1}$ and $C_{2}$ has changed between $\Phi_{1}$ and $\Phi_{2}$, no charge has been lost or added to the system and hence the change in charge is attained by redistribution. By charge conservation, the transfer function of the SC amplifier is expressed by (15), which demonstrates the gain applied to $V_{I N}$, the inversion of $V_{D A C}$ and their summation. For the 1.5bits/stage pipeline cell, $C_{1}$ and $C_{2}$ are nominally equal to achieve a gain of 2 .


Fig. 19. SC amplifier used in $1.5 \mathrm{bits} /$ stage MDAC; (a) overall SC amplifier, (b) S/H function in $\Phi_{1}$, (c) gain stage by charge redistribution in $\Phi_{2}$ and (d) inversion of $V_{D A C}$ and addition to $V_{I N}$ when super-positioned on (c).

$$
\begin{gather*}
q_{\Phi 1}=\left\{\begin{array}{l}
q_{C 1, \Phi 1}=C_{1} V_{I N} \\
q_{C 2, \Phi 1}=C_{2} V_{I N}
\end{array}\right.  \tag{13}\\
q_{\Phi 2}=\left\{\begin{array}{l}
q_{C 1, \Phi 2}=C_{1} V_{O U T} \\
q_{C 2, \Phi 2}=C_{2} V_{D A C}
\end{array}\right.  \tag{14}\\
\Delta q_{C 1}+\Delta q_{C 2}=0 \Rightarrow V_{O U T}=V_{I N}\left(1+\frac{C_{2}}{C_{1}}\right)-V_{D A C} \frac{C_{2}}{C_{1}} \tag{15}
\end{gather*}
$$

The same result in (15) can be obtained by applying superposition and considering $V_{I N}$ and $V_{D A C}$ individually. Considering $V_{I N}$ only, the amplifier is configured in $\Phi_{2}$ as shown in Fig. 19(c), and $C_{2}$ is discharged. However, $x$ acts as a virtual ground and the charge is not lost, but transferred to $C_{1}$. Now $C_{1}$ holds the charge of $C_{1}$ and $C_{2}$ from the previous phase, which generates to a higher voltage across $C_{1}$, and hence $V_{I N}$ is amplified by $\left(C_{1}+C_{2}\right) / C_{1}$. Now considering $V_{D A C}$ only, the amplifier is configured in $\Phi_{2}$ as shown in Fig. 19(d), which is an inverting topology with gain $-C_{2} / C_{1}$. By adding the outputs of each scenario, the transfer function is identical to that given by (15).

The SC amplifier in Fig. 19(a) shares many benefits with the flip-around S/H in Fig. 16(a); this is expected since the flip-around $\mathrm{S} / \mathrm{H}$ is a special case of the SC amplifier in Fig. 19(a) with $C_{2}=0$. Nonetheless, they also share the same low voltage limitation. The direct auto-zeroing technique used in these circuits to cancel the amplifier offset relies on the input and output common modes of the amplifier to be the same. Low voltage applications (1.2V), however, dictate different input and output common modes, where the first is set to optimize the amplifier for speed and the latter for signal swing.

Hence, direct auto-zeroing cannot be used. This is explored further in Chapter VI, where a dual level common mode feedback (CMFB) is proposed to alleviate this issue and reinstate the use of the direct auto-zeroing technique to cancel the amplifier offset. Other amplifier limitations and their effects on the MDAC performance are examined in section $C$.

## 4. Switches

Switches are a fundamental component of SC circuits and play a significant role in their performance. The simplest implementation of a switch is a single MOS device, as depicted in Fig. 20. The $O N$ phase $\Phi_{l}$ is an active high ( $V_{D D}$ ) or active low (GND) for an NMOS and PMOS switches respectively. When the switch is $O N$, the MOS device operates in the linear region and its resistance $R_{O N}$ is expressed by (16) for an NMOS, where $W$ and $L$ are the device dimensions, $C_{o x}$ is the oxide capacitance, $\mu$ is the carrier mobility, $V_{G S}$ is the gate to source voltage and $V_{T}$ is the threshold voltage.


Fig. 20. Switch implementation using a single MOS device.

$$
\begin{equation*}
R_{O N}=\frac{L}{\mu_{N} C_{o x} W\left(V_{G S}-V_{T}\right)} \tag{16}
\end{equation*}
$$

While the realization of switches as in Fig. 20 is straight-forward, their implementation in modern CMOS technologies poses many challenges. Here we discuss the most fundamental challenge: to turn the switch on; the effects of switches on performance are reserved for section $C$.

The minimum requirement for a MOS switch to turn on is $V_{G S}>V_{T}$. This means, and according to Fig. 20, that $V_{I N}$ needs to be at least a $V_{T}$ below $V_{D D}$ for an NMOS, or a $V_{T}$ above $G N D$ for a PMOS. The low voltage supplies imposed by modern CMOS technologies severely limit the input range satisfying these conditions such that a simple implementation MOS switches as in Fig. 20 is no longer feasible; this is depicted in Fig. 21 where both NMOS and PMOS switches are $O F F$ around the ideal signal range centered around $V_{D D} / 2$.


Fig. 21. Low voltage limitations on single MOS switches.

Several techniques have been developed to tackle this limitation, and the bootstrapping technique is quite possibly the most utilized [42]. Conceptually, the bootstrapping technique is presented in Fig. 22(a). When the switch is turned on in $\Phi_{1}$, the pre-charged capacitor $C_{B}$ is applied across the gate and source of $M_{S W}$ thereby fixing
its $V_{G S}$ to $V_{D D}$ as shown in Fig. 22(b). This enables $V_{I N}$ to take any value in the $\{G N D$, $\left.V_{D D}\right\}$ range without turning off the switch. Moreover, not only does the use of bootstrapping minimize $R_{O N}$ by maximizing $V_{G S}$, but also helps keep its value fixed for any $V_{I N}$ and hence reduce its non-idealities. However, bootstrapping comes with a high price; every bootstrapped switch needs an independent $C_{B}$, which is fairly sizable $(>0.5 \mathrm{pF})$ to maintain a steady voltage. Nevertheless, bootstrapping is indispensable for high resolution (12-14bits) pipeline ADCs.

(a)

(b)

(c)

Fig. 22. Bootstrapped switch; (a) conceptual implementation, (b) $O N$ operation in $\Phi_{1}$ and (c) $O F F$ operation in $\Phi_{2}$.

Another technique, which is more technology based, is the use of dual gate oxide (DGO) processes. In these processes, two types of MOS devices are used, where one is optimized for speed and low voltage operation (thinner oxide), and the other is optimized for input/output (I/O) interfaces and can tolerate higher voltage stresses (thicker oxide). Some examples are the $65 \mathrm{~nm} / 90 \mathrm{~nm} / 130 \mathrm{~nm}$ CMOS in $1.2 / 2.5 \mathrm{~V}$ and $0.18 \mu \mathrm{~m}$ CMOS in $1.8 / 3.3 \mathrm{~V}$. Therefore, the switch and its clocking scheme can be implemented with I/O devices, while the rest of the analog blocks are implemented with
low voltage devices as shown in Fig. 23(a). While the implementation is not costly since both MOS device masks are standard is the process, the I/O devices have smaller $\mu_{N}$ and larger $V_{T}$, which makes it difficult to minimize $R_{O N}$ without using large dimensions. Moreover, $R_{O N}$ is not fixed with $V_{I N}$ swing as $V_{G S}$ is not constant. These shortcomings limit the use to 8-10bits of resolution.

On the other hand, regulated power supplies as proposed in [13] and [23] may be used. Although the use of the technique as depicted in Fig. 23(b) is not how it was intended in [13] and [23], the basic principle is to provide a supply level $V_{D D, R}$ that is higher than the low voltage supply, $V_{D D, L}\left(V_{D D}\right.$ in Fig. 21). Hence, the useful range of the low voltage MOS device is extended without excessive oxide stress. This approach reclaims many benefits of using the fast low voltage MOS device as a switch, but $R_{O N}$ still varies with $V_{I N}$ and the resolution is limited to less than 12bits.


Fig. 23. Other MOS switch implementations using (a) an I/O device, (b) a regulated supply and (c) a low $V_{T}$ device.

Finally, and in recent years, low $V_{T}$ devices have been adopted as switches in pipeline ADCs [15], and their use is shown in Fig. 23(c). The robustness of the low $V_{T}$ MOS devices may still be a topic for debate, but certainly has improved as they are becoming standard devices with the increased integration of systems on single chips. Nonetheless, they still suffer the same variable $R_{O N}$ as the previously discussed techniques. With proper sizing, resolutions up to 10bits may be attainable.

In summary, the bootstrapped switch is still the best option as far as linearity (up to +14 bits) and robustness are concerned. However, if the target resolution is below 12bits, then the techniques in Fig. 23 may provide a cheaper, but reliable, alternative solutions.

## C. Performance Limitations

The non-idealities of the SC circuit components used to implement the front-end S/H and MDAC can introduce conversion errors that limit the performance of the pipeline ADC. The main causes for such errors are the capacitor mismatch, finite amplifier gain, finite amplifier bandwidth, amplifier offset, distortion and finite switch resistance. The following sub-sections the effects of each on the transfer function of a 1.5 bits/stage pipeline cell are examined, and the minimum requirements for the first stage MDAC in a 1 Vpp 10bit $160 \mathrm{MS} / \mathrm{s}$ pipeline ADC are evaluated. Also, graphical representations of the effects on the $1 \mathrm{bit} /$ stage will be included to aid the discussion. The physical sources of amplifier non-idealities and the low voltage challenges of implementing robust amplifiers are discussed in detail in Chapter IV.

## 1. Capacitor Mismatch

The capacitors used in 1.5bits/stage pipeline cell are nominally equal; $C_{1}=C_{2}=C$. Process variations and design imperfections, however, induce mismatches between the capacitor values, which result in errors in the transfer function. Equation (17) shows the transfer function (15) of the pipeline cell in the presence of capacitor mismatch. The mismatch considered in (17) is relative; one capacitor with respect to the other.

$$
\begin{align*}
V_{\text {OUT }} & =V_{I N}\left(1+\frac{C+\Delta C}{C}\right)-V_{D A C}\left(\frac{C+\Delta C}{C}\right)  \tag{17}\\
& =2 V_{I N}\left(1+\frac{\Delta C}{2 C}\right)-V_{D A C}\left(1+\frac{\Delta C}{C}\right)
\end{align*}
$$

The ideal transfer function (dotted) and one affected by a positive capacitor mismatch (solid) are shown in Fig. 24. The 1bit/stage experiences a gain conversion error as the output is beyond the range $\left\{-V_{R}, V_{R}\right\}$. Likewise, the $1.5 \mathrm{bit} /$ stage is corrupted even with its output being within $\left\{-V_{R}, V_{R}\right\}$, and the error is maximized at $V_{I N}=0$ if we consider a potential comparator offset of $\pm V_{R} / 4$. To maintain the performance of the overall ADC, the maximum error must be less than the quantization noise-half an $L S B$ of the remaining stages. This is expressed by (18) where the maximum error is obtained by using $V_{I N}=0$ and $V_{D A C}=V_{R}$ in (17). Given that the capacitor value is determined by its dimensions and process parameters, a simple model for the capacitor mismatch is derived and presented in (19), where $W, L, C_{o x}, \varepsilon_{o x}$ and $t_{o x}$ are the capacitor width and length, oxide capacitance, permittivity and thickness respectively.


Fig. 24. Effect of positive capacitor mismatch on (a) 1bit/stage and (b) 1.5bits/stage.

While variations in the oxide thickness are process dependent and out of the designer's control, the choice of the capacitor dimensions is crucial to reducing mismatch errors; for metal-insulator-metal (MIM) capacitors in $0.18 \mu \mathrm{~m}$ CMOS with a density of $\sim 1 \mathrm{fF} / \mu \mathrm{m}^{2}$, a $3 \sigma$ mismatch of less than $0.2 \%$ - 9 bit accuracy - between two 1 pF capacitors is achievable. Also, the dependency of the mismatch error on the resolution of the remaining stages as demonstrated by (18) is very significant, because it allows for the implementation of 12-14bit non-calibrated pipeline ADCs if higher bit resolution stages (3.5 or 4.5) are utilized in the first stage [43].

$$
\begin{gather*}
V_{R} \frac{\Delta C}{C}<\frac{2 V_{R}}{2^{N r+1}} \Rightarrow \frac{\Delta C}{C}<\frac{1}{2^{N r}}  \tag{18}\\
C=W L C_{o x}=W L \frac{\varepsilon_{o x}}{t_{o x}} \Rightarrow \frac{\Delta C}{C} \equiv \frac{\sigma C}{C}=\sqrt{\left(\frac{\sigma W}{W}\right)^{2}+\left(\frac{\sigma L}{L}\right)^{2}+\left(\frac{\sigma t_{o x}}{t_{o x}}\right)^{2}} \tag{19}
\end{gather*}
$$

Finally, while the MDAC performance is prone to capacitor mismatch errors, the flip-around $\mathrm{S} / \mathrm{H}$ previously discussed is not, even in the fully-differential realization. This is confirmed by examining its transfer function given by (12), where it is clearly capacitor independent. Hence the capacitor size of the front-end $\mathrm{S} / \mathrm{H}$ need only be as large as dictated by the noise requirement as will be explained in section $E$. This is another strong advantage for the flip-around S/H over other S/H implementations.

## 2. Finite Amplifier Gain

The transfer functions (7)-(9), (12) and (15) were derived assuming an ideal amplifier. Consider the non-ideal MDAC amplifier in Fig. 25 with finite gain $A$, parasitic capacitance $C_{P}$ and input offset $V_{O S}$. By conserving the charge at $x$ between clock phases and adding capacitor mismatch, the 1.5 bits/stage transfer function given by (15) is modified to (20), which for now ignores the effects of $V_{O S}$. Here $\beta_{\Phi 2}$ is the amplifier feedback factor during $\Phi_{2}$ and is given by the last term in (21) for a general $M$ bit pipeline stage using digital correction and neglecting capacitor mismatch.


Fig. 25. A 1.5bits/stage loaded MDAC with amplifier non-idealities.

$$
\begin{gather*}
V_{\text {OUT }} \cong\left(1-\frac{1}{\beta_{\Phi 2} A}\right)\left[2 V_{I N}\left(1+\frac{\Delta C}{2 C}\right)-V_{D A C}\left(1+\frac{\Delta C}{C}\right)\right]  \tag{20}\\
\beta_{\Phi 2}=\frac{C_{1}}{C_{1}+C_{2}+C_{P}} \cong \frac{1}{2^{M}\left(1+\frac{C_{P}}{2^{M} C}\right)} \tag{21}
\end{gather*}
$$

The finite gain also results in a gain error similar to capacitor mismatch. However, unlike capacitor mismatch which affects individual terms in the transfer function and can be positive or negative, the finite gain error scales the whole transfer function and is always negative as seen in Fig. 26. It reaches its maximum when $V_{I N}=-$ $V_{R}, O, V_{R}$, which needs to be kept under $1 / 2 L S B$ of the remaining stages. Ignoring all errors but the finite gain, the minimum gain required is given by (22).


Fig. 26. Effect of finite amplifier gain on (a) 1bit/stage and (b) 1.5bits/stage.

$$
\begin{equation*}
\frac{V_{R}}{\beta_{\Phi 2} A}<\frac{2 V_{R}}{2^{N r+1}} \Rightarrow A>2^{N r+M}\left(1+\frac{C_{P}}{2^{M} C}\right) \tag{22}
\end{equation*}
$$

This shows that the gain of the amplifier in the first 1.5 bit/stage MDAC needs to be at least $2^{N}$ at maximum output swing ( $N r=N-1, M=1$ ). For a 10bit ADC, the gain of the first MDAC amplifier needs to be at least 60.2 dB , ignoring $C_{P}$. Since this gain is at maximum output swing, the nominal gain may need to be an order of magnitude (20dB) greater than the limit set by (22); that is 80.2 dB . As for the front-end flip-around $\mathrm{S} / \mathrm{H}$, recall that it is a special case of the 1.5 bits/stage MDAC when $C_{2}=0$. For the same 10 bits ADC and using (22), the minimum $\mathrm{S} / \mathrm{H}$ amplifier gain needs to be greater than 60.2 dB at maximum output swing.

## 3. Finite Amplifier GBW

The amplifier speed is limited by its gain bandwidth product (GBW) and its configuration feedback factor. Together they define the system time constant $\tau$, which is given by (23) for the 1.5 bits/stage MDAC during $\Phi_{2}$. On the other hand, the functions performed by the MDAC are time limited; since $\Phi_{1}$ and $\Phi_{2}$ are symmetric, the maximum time allocated for the MDAC to generate the output is half the clock period, $T_{S} / 2$. As the input to the MDAC is best modeled by a step, and the output is reset during $\Phi_{1}$, the transfer function of the 1.5 bits/stage may be further modified to give (24).

$$
\begin{gather*}
\tau=\frac{1}{\beta_{\Phi 2} G B W}  \tag{23}\\
V_{\text {OUT }} \cong\left(1-e^{-\frac{T_{s} \beta_{\Phi 2} G B W}{2}}\right)\left(1-\frac{1}{\beta_{\Phi_{2}} A}\right)\left[2 V_{I N}\left(1+\frac{\Delta C}{2 C}\right)-V_{D A C}\left(1+\frac{\Delta C}{C}\right)\right] \tag{24}
\end{gather*}
$$

Therefore, the incomplete settling due to the finite amplifier $G B W$ applies another scaling factor to the transfer function similar to the finite amplifier gain, and results in the same effect as in Fig. 26. This error is maximum when $V_{I N}=-V_{R}, 0, V_{R}$, and needs to be less than $1 / 2 L S B$ of the remaining stages. Considering only the effect of the finite amplifier $G B W$, a minimum specification is set by the condition in (25). For the first stage MDAC operating at $160 \mathrm{MS} / \mathrm{s}$ with $5 \%$ clock non-overlap and assuming $C_{P}=$ $0.2 C$, a minimum $G B W$ of 745 MHz is required. As for the front-end $\mathrm{S} / \mathrm{H}$ with similar assumption, a minimum GBW of 450 MHz is needed. In a real design, however, the $G B W$ values implemented for the front-end $\mathrm{S} / \mathrm{H}$ and the MDAC amplifier almost double these calculated values; this will be examined in section $F$.

$$
\begin{equation*}
V_{R} e^{-\frac{T_{S} \beta_{\phi_{2}} G B W}{2}}<\frac{2 V_{R}}{2^{N r+1}} \Rightarrow G B W>\frac{2^{M+1}}{T_{S}}\left(1+\frac{C_{P}}{2^{M} C}\right) \ln 2^{N r} \tag{25}
\end{equation*}
$$

## 4. Amplifier Offset

The effect of the amplifier offset as depicted by Fig. 25, can be incorporated in (24) to generate (26). Ignoring all other errors, the offset causes a vertical shift in the transfer function as demonstrated by Fig. 27. The MDAC implementation of Fig. 25, however, uses direct auto-zeroing offset cancellation making the error negligible; in $\Phi_{1}$ the offset is sampled on $C_{1}$ and $C_{2}$ by resetting the amplifier, and is then used in $\Phi_{2}$ to compensate the offset error at the output. This is confirmed in (26) with the factor $(1+A)$, and offsets as large as 10 s of millivolts are effectively reduced well below 1 mV .

$$
\begin{equation*}
V_{O U T} \cong\left(1-e^{-\frac{T_{s} \beta_{\phi 2} G B W}{2}}\right)\left(1-\frac{1}{\beta_{\Phi 2} A}\right)\left[2 V_{I N}\left(1+\frac{\Delta C}{2 C}\right)-V_{D A C}\left(1+\frac{\Delta C}{C}\right)+\frac{V_{O S}}{\beta_{\Phi 2}(1+A)}\right] \tag{26}
\end{equation*}
$$



Fig. 27. Effect of amplifier offset on (a) 1bit/stage and (b) 1.5bits/stage.

## 5. Amplifier Distortion

When the output values get closer to $\pm V_{R}$ the MDAC shows nonlinear effects as depicted in Fig. 28. This behavior is attributed to distortion in the amplifier. The two dominant distortion factors are the amplifier slew rate (SR) and variable gain. The $S R$ sets the maximum current the amplifier can provide to charge/discharge its load, which for large output values may be smaller than the current necessary to maintain the exponential settling of (24), thus introducing larger settling errors than for smaller output values. Similarly, the amplifier gain is variable for different output values. It is at its maximum when $V_{\text {OUT }}=0$, and minimum when $V_{\text {OUT }}= \pm V_{R}$. Both the $S R$ and variable gain contribute to the bowing effect in the transfer function of Fig. 28, which can be
included in the transfer function as seen in (27), where $\overline{V_{\text {OUT }}}$ as expressed by (28) is the expected output with no $G B W$ or $S R$ limitations.

$$
\begin{gather*}
V_{\text {OUT }} \cong \overline{V_{\text {OUT }}}\left(1-\overline{\overline{V_{\text {OUT }}} \beta_{\Phi 2} G B W} e^{-\left[1+\beta_{\phi_{2} G B W}\left(\frac{T_{S}}{2}-\frac{\left.\left.\overline{V_{\text {OUT }}} \frac{S R}{S R}\right)\right]}{}\right)\right.} \begin{array}{l}
\overline{V_{\text {OUT }}} \cong\left(1-\frac{1}{\beta_{\Phi 2} A}\right)\left[2 V_{I N}\left(1+\frac{\Delta C}{2 C}\right)-V_{D A C}\left(1+\frac{\Delta C}{C}\right)+\frac{V_{\text {OS }}}{\beta_{\Phi 2}(1+A)}\right]
\end{array}\right] . \tag{27}
\end{gather*}
$$


(a)

(b)

Fig. 28. Effect of amplifier distortion on (a) 1bit/stage and (b) 1.5bits/stage.

The maximum distortion error is at $\overline{V_{\text {OUT }}}= \pm V_{R}$, which needs to be kept smaller than $1 / 2 L S B$ of the remaining stages to maintain the linearity of the ADC; this is described by (29). Using (29), however, an analytical limit on the $S R$ cannot be directly obtained. In Appendix A the amplifier slewing behavior is analyzed and it is
demonstrated that by allotting a portion $\delta$ of the maximum output step $V_{R}$ to be achieved by slewing, the minimum required $S R$ can be given by (30), whereas the ratio of slewing time, $t_{\text {slew }}$, with respect to $T_{S} / 2$ can be given by (31).

$$
\begin{gather*}
\frac{S R}{\beta_{\Phi_{2}} G B W} e^{-\left[1+\beta_{\phi_{2}} G B W\left(\frac{T_{S}}{2}-\frac{V_{R}}{S R}\right)\right]}<\frac{2 V_{R}}{2^{N r+1}}  \tag{29}\\
S R>\frac{2 V_{R}}{T_{S}}\left[\delta+(1-\delta) \ln \left(2^{N r}(1-\delta)\right)\right]  \tag{30}\\
\frac{t_{\text {slew }}}{T_{S} / 2}=\frac{\delta}{\delta+(1-\delta) \ln \left(2^{N}(1-\delta)\right)} \tag{31}
\end{gather*}
$$

Because of the amplifier slewing, the $G B W$ needs to be increased over what is described by (25) to ensure the amplifier settles to the desired accuracy within $T_{\S} / 2$. The relation given by (32) shows the ratio of the new $G B W_{w / s l e w}$ with respect to the original $G B W$ in terms of the scaling factor $\delta$. The percentage of settling time spent slewing, (31), and the normalized $G B W$ of (32) are presented in Fig. 29.

$$
\begin{equation*}
\frac{G B W_{w / \text { slew }}}{G B W}=\frac{\delta+(1-\delta) \ln \left(2^{N r}(1-\delta)\right)}{(1-\delta) \ln \left(2^{N r}\right)} \tag{32}
\end{equation*}
$$

Considering the first stage MDAC $(N r=9)$ of the pipeline ADC, a 1 Vpp output range means that the maximum step is 500 mV . If while slewing the amplifier reaches $350 \mathrm{mV}(\delta=0.7)$, then according to (30) a minimum $S R$ of $370 \mathrm{~V} / \mu \mathrm{s}$ is required, $t_{\text {slew }}$
constitutes about $30 \%$ of the settling time, and the $G B W_{w / s l e w}$ needs to be increased $20 \%$ over the original $G B W$ ( $745 \mathrm{MHz} \rightarrow 895 \mathrm{MHz}$ ). Using the same assumptions for the front-end $\mathrm{S} / \mathrm{H}$, a $S R$ of at least $410 \mathrm{~V} / \mu$ s is needed with a $G B W_{w / s l e w}$ of 540 MHz .


Fig. 29. Effect of amplifier $S R$ on $G B W$ requirement.

## 6. Finite Switch Resistance

Considering the MDAC states of Fig. 19(b) and (d), which are repeated in Fig. 30 for convenience with switch names, we notice the switches are in the charging path of capacitors $C_{I}$ and $C_{2}$. Given their finite resistance described by (16), they introduce a settling time constant and cause errors within the limited settling time $T_{S} / 2$.


Fig. 30. A 1.5bits/stage MDAC in (a) sampling phase, $\Phi_{1}$, and (b) multiplying and holding phase, $\Phi_{2}$.

From Fig. 30(a), it is safe to assume that $S_{I}$ and $S_{2}$ have the same $R_{O N}$, as they are generally sized the same and experience the same operating conditions. Therefore, for $C_{I}$ $=C_{2}=C$, the time constant $\tau_{1}$ associated with charging either $C_{1}$ or $C_{2}$ can be expressed by (33), where $R_{x}$ is the resistance looking into the node $x$. As for Fig. 30(b), we can define two time constants associated with $V_{O U T}$ charging $C_{1}$ and $V_{D A C}$ charging $C_{2}$; these are given by $\tau_{2}$ and $\tau_{3}$ in (34) and (35) respectively.

$$
\begin{gather*}
\tau_{1}=C_{1,2}\left(R_{O N 1,2}+2 R_{x}\right)  \tag{33}\\
\tau_{2}=C_{1} R_{O N 4}  \tag{34}\\
\tau_{3}=C_{2} R_{O N 3} \tag{35}
\end{gather*}
$$

These time constants affect the charge stored on $C_{1}$ and $C_{2}$ at the end of either clock phase- $T_{S} / 2$ of settling time-as shown in (36)-(38) assuming no amplifier limitations, and the error is demonstrated by $\alpha$ compared to the original (13)-(15).

$$
\begin{gather*}
q_{\Phi 1}=\left\{\begin{array}{l}
q_{C 1, \Phi 1}=C_{1} V_{I N}\left(1-e^{-\frac{T_{s}}{2 \tau_{1}}}\right)=\alpha_{1} C_{1} V_{I N} \\
q_{C 2, \Phi 1}=C_{2} V_{I N}\left(1-e^{-\frac{T_{s}}{2 \tau_{1}}}\right)=\alpha_{1} C_{2} V_{I N}
\end{array}\right.  \tag{36}\\
q_{\Phi 2}=\left\{\begin{array}{l}
q_{C 1, \Phi 2}=C_{1} V_{O U T}\left(1-e^{-\frac{T_{s}}{2 \tau_{2}}}\right)=\alpha_{2} C_{1} V_{O U T} \\
q_{C 2, \Phi 2}=C_{2} V_{D A C}\left(1-e^{-\frac{T_{s}}{2 \tau_{3}}}\right)=\alpha_{3} C_{2} V_{D A C} \\
\Delta q_{C 1}+\Delta q_{C 2}=0 \Rightarrow V_{O U T}=\frac{\alpha_{1}}{\alpha_{2}} V_{I N}\left(1+\frac{C_{2}}{C_{1}}\right)-\frac{\alpha_{3}}{\alpha_{2}} V_{D A C} \frac{C_{2}}{C_{1}}
\end{array}\right. \tag{37}
\end{gather*}
$$

While the terms $\alpha_{1} / \alpha_{2}$ and $\alpha_{3} / \alpha_{2}$ are the source of the error, they offer a couple of solutions. First, using a brute force approach, the resistances associated with the time constants (33)-(35) can be reduced to make $\alpha$ very close to unity; the capacitors $C_{l}$ and $C_{2}$ may also be reduced to achieve the same effect, but there are matching and noise limitations on their values which may prohibit their reduction. Second, $\tau_{1}, \tau_{2}$ and $\tau_{3}$ can be matched, such that $\alpha_{1}, \alpha_{2}$ and $\alpha_{3}$ are all equal.

The second approach, while elegant, is quite problematic to implement because $R_{O N}$ can be variable. The $R_{O N}$ expression associated with Fig. 31 is given by (39), which is an expansion of (16), and shows the dependency of the resistance on $V_{I N}$, the gate voltage $V_{G}$, and the bulk voltage $V_{B}$.

(a)

(b)

Fig. 31. Four terminal NMOS switch; (a) cross-sectional view and (b) schematic view.

$$
\begin{equation*}
R_{O N}=\frac{L}{\mu_{N} C_{o x} W(\underbrace{V_{G}-V_{I N}}_{V_{G S}}-\underbrace{\left[V_{T 0}+\gamma\left(\sqrt{\left|2 \phi_{F B}+V_{I N}-V_{B}\right|}-\sqrt{\left|2 \phi_{F B}\right|}\right)\right]}_{V_{T}})} \tag{39}
\end{equation*}
$$

For a fixed $V_{G}$ and $V_{B}$, it is clear that as $V_{I N}$ increases, $V_{G S}$ decreases and $V_{T}$ increases, and effectively $R_{O N}$ is increased. This dependence of $R_{O N}$ on $V_{I N}$ is why it is impractical to match $\tau_{1}, \tau_{2}$ and $\tau_{3}$ and make $\alpha_{1}, \alpha_{2}$ and $\alpha_{3}$ all equal. Moreover, it is another source for distortion in the MDAC transfer function as depicted in Fig. 32, which unlike the amplifier distortion is asymmetric. This distortion asymmetry is due to $V_{D A C}$. While $V_{I N}$ and $V_{O U T}$ cover the whole $\left\{-V_{R}, V_{R}\right\}$ range, $V_{D A C}$ takes fixed values $\left(-V_{R}, 0\right.$ or $\left.V_{R}\right)$ according to which region $V_{I N}$ falls into. Hence, $\alpha_{3}$ is fixed for a particular $V_{D A C}$ value, whereas $\alpha_{1}$ and $\alpha_{2}$ are continually changing. To a weak approximation $\alpha_{1}$ and $\alpha_{2}$ track and so $\alpha_{1} / \alpha_{2}$ is fairly constant, but $\alpha_{3} / \alpha_{2}$ experiences a clear and distinct jump as $V_{D A C}$ changes from $-V_{R}$ to $V_{R}$.


Fig. 32. Effect of variable switch resistance on (a) 1bit/stage and (b) 1.5bits/stage.

The maximum error caused by the switches occurs at $V_{I N}=V_{R}$, but to express it analytically and extract a set of switch requirements is no easy task. Fortunately, modern CMOS technologies offer small device lengths $L$ and high oxide capacitance $C_{o x}$ which naturally reduce $R_{O N}$ without a lot of design effort. However, this brute force approach may only go up to 12 bits of resolution.


Fig. 33. Modified bootstrapped switch with bulk connections.

On the other hand, there are effective design techniques that ensure high switch linearity and virtually eliminate the $R_{O N}$ dependency on $V_{I N}$. Consider for example the modified bootstrapped switch with bulk connections in Fig. 33; the bootstrapping effectively keeps $V_{G S}$ constant independent of $V_{I N}$, while the bulk switching eliminates the $V_{T}$ dependency on $V_{I N}$ when the switch is on, and grounds the bulk when the switch is off for maximum $V_{I N} / V_{\text {OUT }}$ isolation.

## D. DNL and INL

The common factor of all the errors examined in section $C$ is that they are repeatable for the same $V_{I N}$, and hence can be used to estimate the $D N L$ and consequently the $I N L$ of the pipeline ADC .

We begin by rewriting the MDAC transfer function of (27) and (28) as shown in (40), where $\varepsilon_{S}, \varepsilon_{A}$ and $\varepsilon_{C}$ represent the error components due to settling ( $S R$ and $G B W$ limitations), amplifier gain and capacitor mismatch respectively. Note that the finite switch resistance effects were not considered, as the amplifier settling limitations generally pose more design challenges than the switches.

$$
\begin{equation*}
V_{\text {OUT }} \cong\left(1-\varepsilon_{S}\right)\left(1-\varepsilon_{A}\right)\left[2 V_{I N}\left(1+\frac{\varepsilon_{C}}{2}\right)-V_{D A C}\left(1+\varepsilon_{C}\right)+\varepsilon_{A} V_{O S}\right] \tag{40}
\end{equation*}
$$

By expanding (40) and neglecting second order errors, the MDAC transfer function simplifies to (41) taking into account that the capacitor mismatch and $V_{O S}$ are
random and therefore can be either positive or negative; their sign reversal in (41) is to maximize the error.

$$
\begin{equation*}
V_{\text {OUT }} \cong \underbrace{2 V_{I N}-V_{D A C}}_{\text {Ideal }}-\underbrace{\left[2 V_{I N}\left(\varepsilon_{S}+\varepsilon_{A}+\frac{\varepsilon_{C}}{2}\right)-V_{D A C}\left(\varepsilon_{S}+\varepsilon_{A}+\varepsilon_{C}\right)+\varepsilon_{A} V_{O S}\right]}_{\text {error }} \tag{41}
\end{equation*}
$$

Collectively, the error in (41) is maximized at $V_{I N}=0, V_{D A C}= \pm V_{R}$, considering a potential comparator offset of $\pm V_{R} / 4$. This is expressed by (42) for $V_{R} \gg V_{O S}$, and needs to be kept smaller than $1 / 2 L S B$ of the remaining stages as described by (43). Unlike section $C$ where each MDAC error was considered individually for demonstration purposes, (43) is more practical as it limits all errors under the quantization noise, which guarantees a maximum $D N L$ of $1 / 2 L S B$ if (43) is fulfilled by all pipeline stages; naturally the front-end $\mathrm{S} / \mathrm{H}$ also needs to fulfill (43) for a total ADC resolution $N$.

$$
\begin{align*}
& V_{\varepsilon}=V_{R}\left(\varepsilon_{S}+\varepsilon_{A}+\varepsilon_{C}+\varepsilon_{A} \frac{V_{O S}}{V_{R}}\right) \cong V_{R}\left(\varepsilon_{S}+\varepsilon_{A}+\varepsilon_{C}\right)  \tag{42}\\
& V_{R}\left(\varepsilon_{S}+\varepsilon_{A}+\varepsilon_{C}\right)<\frac{2 V_{R}}{2^{N r+1}} \Rightarrow \varepsilon_{S}+\varepsilon_{A}+\varepsilon_{C}<\frac{1}{2^{N r}} \tag{43}
\end{align*}
$$

As for $I N L$, it is the buildup of the S/H and MDAC errors through all stages and can be given by (44) at the output of the front-end $\mathrm{S} / \mathrm{H}$ for a pipeline ADC of $N$ 1.5 bits/stage. To limit the $I N L$ to less than $1 / 2 L S B$, the condition set by (45) needs to be
met, which is very difficult to analyze as the individual stage errors are not correlated. Nonetheless, an absolute worst case scenario can be evaluated assuming all stages have the same error and it builds up with the same sign; this is described in (46).

$$
\begin{align*}
& I N L=V_{R}\left[\left(\varepsilon_{S}+\varepsilon_{A}+\varepsilon_{C}\right)_{S H}+\frac{1}{2^{N-1}} \sum_{i=1}^{N-1} 2^{N-(i+1)}\left(\varepsilon_{S}+\varepsilon_{A}+\varepsilon_{C}\right)_{i}\right]  \tag{44}\\
& V_{R}\left[\left(\varepsilon_{S}+\varepsilon_{A}+\varepsilon_{C}\right)_{S H}+\frac{1}{2^{N-1}} \sum_{i=1}^{N-1} 2^{N-(i+1)}\left(\varepsilon_{S}+\varepsilon_{A}+\varepsilon_{C}\right)_{i}\right]<\frac{2 V_{R}}{2^{N+1}}  \tag{45}\\
& \frac{2^{N}-1}{2^{N-1}}\left(\varepsilon_{S}+\varepsilon_{A}+\varepsilon_{C}\right)<\frac{1}{2^{N}} \Rightarrow\left(\varepsilon_{S}+\varepsilon_{A}+\varepsilon_{C}\right)<\frac{1}{2\left(2^{N}-1\right)} \tag{46}
\end{align*}
$$

Considering the front-end $\mathrm{S} / \mathrm{H}$ and comparing (43) with (46), the error tolerance is halved. Moreover, (46) assumes that all stage errors are the same and hence becomes very demanding and inefficient in the design of the last stages where accuracy can be significantly relaxed. Therefore, as far as $I N L$ is concerned, it is best to analyze it from a statistical approach where the errors are scaled with each stage but are random and uncorrelated as described by (45).

## E. Noise

ADC quantization noise was the result of the conversion process, and is used in design to mask over well-defined and predictable circuit non-idealities. However, there is another source of non-ideality that cannot be predicted and can influence the ADC performance. Electronic noise-random fluctuations of voltage and/or current-
contributed by the circuit components is added to the output of each pipeline stage and builds up through the ADC. Using the pipeline ADC diagram of Fig. 10, and representing the output noise power of stage $i$ by $v_{n o, i}^{2}$, the total ADC input referred noise power $v_{n, t o t}^{2}$ can be expressed as in (47), where $G_{i}$ is the gain of the $i^{\text {th }}$ stage.

$$
\begin{equation*}
v_{n, t o t}^{2}=\frac{v_{n o, S H}^{2}}{G_{S H}^{2}}+\frac{v_{n o, 1}^{2}}{\left(G_{S H} G_{1}\right)^{2}}+\frac{v_{n o, 2}^{2}}{\left(G_{S H} G_{1} G_{2}\right)^{2}}+\cdots+\frac{v_{n o, i-1}^{2}}{\left(G_{S H} \cdots G_{i-1}\right)^{2}}+\frac{v_{n o, i}^{2}}{\left(G_{S H} \cdots G_{i}\right)^{2}} \tag{47}
\end{equation*}
$$

It is clear that the early stages are the dominant noise contributors and need to be considered carefully in design; the front-end S/H alone may contribute more than $50 \%$ of $v_{n, \text { tot }}^{2}$. To better understand the noise behavior of the pipeline ADC, however, we examine the two fundamental contributors of noise: switches and amplifiers.

## 1. Switches

The noise generated by the finite on-resistance of switches is characterized as thermal noise. Its model is presented in Fig. 34 where $R_{s w}$ is the switch resistance and $v_{n, s w}^{2}$ is its noise power spectral density defined by (48) [44]. Here, $k_{B}$ and $T$ represent Boltzman's constant and the absolute temperature respectively.

$$
\begin{equation*}
v_{n, s w}^{2}=4 k_{B} T R_{s w} \cdot \Delta f \tag{48}
\end{equation*}
$$



Fig. 34. Switch resistance noise model.

The noise sampled on $C_{H}$ is transferred to the next stage and has to be analyzed to evaluate its effect on the overall ADC performance. It is necessary to consider the integrated noise power over all frequencies to account for the aliased noise due to sampling. This is evaluated in (49), and the result shows that the total noise power is solely dependent on the capacitance value and can only be reduced by increasing $C_{H}$.

$$
\begin{equation*}
v_{n, C_{H}}^{2}=4 k_{B} T R_{s w} \int_{0}^{\infty} \frac{d f}{1+\left(2 \pi f R_{s w} C_{H}\right)^{2}}=\frac{k_{B} T}{C_{H}} \tag{49}
\end{equation*}
$$

For simple passive sampling as that modeled in Fig. 34, (49) can be used in conjunction with the quantization noise of the ADC to determine the minimum capacitance required such that the effect of switch resistance noise on the linearity and resolution of the overall ADC is diminished; this is expressed by (50).

$$
\begin{equation*}
\frac{k_{B} T}{C_{H}}<\frac{\left(2 V_{R}\right)^{2}}{12\left(2^{N}\right)^{2}} \Rightarrow C_{H}>3 \frac{2^{2 N} k_{B} T}{V_{R}^{2}} \tag{50}
\end{equation*}
$$

## 2. Amplifiers

Amplifiers also contribute to the noise seen at the output of the ADC. Consider Fig. 35, which depicts all the noise sources present for both the sampling and holding phases in an $M$ bit/stage MDAC based on the example of Fig. 15. To see how the amplifier affects the noise performance of the pipeline cell, we need to analyze the total output noise. Once evaluated, the output noise can be referred back to the input to formulate a figure of merit for amplifier noise and determine the minimum capacitance needed to keep the overall thermal noise of the MDAC below the ADC's quantization noise requirements. This analysis is the subject of Appendix B, where it is demonstrated that the input referred noise of an MDAC can be expressed by (51), where $m=2^{M}-1$ and $\alpha_{n}$ is given by (52)—a similar noise analysis is presented in [45], but assumes a passive sampling approach and thus ignores the amplifier noise in the sampling phase.


Fig. 35. Noise sources present in an M bit/stage MDAC for (a) sampling phase and (b) holding phase.

$$
\begin{equation*}
\left.v_{n, i n}^{2}\right|_{M D A C}=\frac{k_{B} T}{(m+1) C_{L}}\left[\gamma(1+\eta)+\frac{2 g m R_{s w}}{(m+1)}\right]+2 \frac{k_{B} T}{(m+1) C_{u}}\left(1+\alpha_{n}\right) \tag{51}
\end{equation*}
$$

$$
\begin{equation*}
\alpha_{n}=\frac{1}{1+\omega_{u} R_{s w} C_{u}}+\frac{m}{2\left(1+\frac{\omega_{u}^{2} R_{s w}^{2} C_{u}^{2}}{1+\omega_{u} R_{s w} C_{u}}\right)}+\frac{\gamma(1+\eta)}{g m R_{s w}\left(1+\frac{1}{\omega_{u} R_{s w} C_{u}}\right)} \tag{52}
\end{equation*}
$$

The input referred MDAC noise shows that the contributions of the switches and amplifier to the MDAC noise are limited by the unit capacitance, $C_{u}$, used in the MDAC and effective loading capacitance, $C_{L}$, seen by the amplifier. Also, the use of multibit/stage pipeline cells has an added filtering effect on the total noise; this added benefit of lower noise, however, is a direct trade-off with the speed of the pipeline ADC due to the reduced feedback factor of the MDAC amplifier. Another trade-off occurs between the amplifier bandwidth and switch time constant, which is exemplified in $\alpha_{n}$; setting $R_{s w} C_{u}$ significantly smaller than $l / \omega_{u}$ results in the switch resistance noise dominating, and vice versa. Nonetheless, a good compromise is achieved when $R_{s w} C_{u} \approx 1 / \omega_{u}$, where power consumption can be optimized. Further power optimization can be realized if $C_{L}$-the loading of the following stage-is recognized as a function of $C_{u}$ and the MDAC gain, which defines the capacitor scaling along the pipeline stages [46]. Finally, (51) can also be used to evaluate the input referred noise of the $\mathrm{S} / \mathrm{H}$ by using $\mathrm{m}=0$.

## F. Conclusions

The primary focus of this chapter was the analysis of all the underlying phenomena that govern the performance of the pipeline ADC from an analog standpoint. Through numerous derivations, many analog circuit design limitations were linked to characteristic errors in the ADC , and boundaries of their acceptable bounds where
established to facilitate the design process of a non-calibrated pipeline ADC. The amplifier in particular was identified as a major contributor to ADC errors, and hence Chapter IV is devoted to its study. On the other hand, while amplifier errors can be alleviated by design, capacitors and switches remain as the ultimate limitation in noncalibrated pipeline ADC. This is mostly evident from the DNL/INL analysis where capacitor mismatch dominates the performance, and also from the study of MDAC noise where a lower bound is primarily set by $k_{B} T / C_{u}$.

## CHAPTER IV

## AMPLIFIERS

It was demonstrated in Chapter III that amplifiers in pipeline ADCs have a direct and major role in the operation of the individual pipeline stages by performing active sampling and residue amplification. Consequently, the amplifier limitations have a direct impact on the overall ADC performance, which for high speed and very high ( $>12$ bits) resolutions may require the ADC to be digitally calibrated.

This chapter focuses on the study of the performance metrics that govern the design of amplifiers and influence the accuracy of pipeline ADCs. Each performance metric will be discussed using an actively-loaded differential pair amplifier as an example, and keys to high speed design will be highlighted. Also, some optimization methods on the transistor level will be discussed that are applicable to low voltage and low power applications. Then, the fundamental amplifier topologies commonly used in SC applications will be compared for their suitability in a high speed, low voltage and low power pipeline ADCs, along with some techniques of amplifier performance enhancement and the challenges associated with their implementations.

## A. The Differential Pair Amplifier

An actively-loaded differential amplifier is shown in Fig. 36 for single-ended and fully-differential configurations. In the following sections, many amplifier aspects will be discussed based on Fig. 36. In order to facilitate the discussion, frequent reference
will be made to the square law I-V model of MOS devices [47], which for an NMOS device in saturation is expressed by (53) where $\mu_{N}, C_{o x}, W, L, V_{G S}, V_{T_{N}}$ and $V_{G S T N}$ are the electron mobility, gate oxide per unit area, width, length, gate-source, threshold, and overdrive voltages respectively. While this simple model does not take into account many of the secondary effects seen and modeled for state-of-the-art CMOS technologies [48], it serves as a reasonable first-order approximation to MOS device behavior. Nevertheless, secondary order effects will be presented as the need arise and where applicable.


Fig. 36. The differential pair amplifier; (a) single-ended and (b) fully-differential.

$$
\begin{equation*}
I_{D 0}=\frac{1}{2} \mu_{N} C_{o x} \frac{W}{L}\left(V_{G S}-V_{T_{N}}\right)^{2}=\frac{1}{2} \mu_{N} C_{o x} \frac{W}{L} V_{G S T N}^{2} \tag{53}
\end{equation*}
$$

## 1. Bandwidth

The bandwidth of an amplifier is the first indicator to its speed performance. It is also a measure of the frequency bandwidth for which the amplifier's gain is larger than unity, and is commonly referred to as the gain bandwidth product ( $G B W$ ). There are several factors on which the amplifier's $G B W$ depends; the CMOS technology used, device size, quiescent operating point, driven load and amplifier architecture are some examples.

CMOS technology has been aggressively scaling down in device size over the past four decades to accommodate a lager number of gates per unit area. The scaling down of device size also improved the bandwidth. The most commonly used figure of merit to quantify the maximum speed for a given CMOS technology is the transit frequency, $f_{T}$ [49], at which the small signal drain current to gate current ratio of a MOS device is unity, and can be expressed by (54). We can deduce from (54) that the speed of CMOS devices and hence a CMOS amplifier can be enhanced by increasing the overdrive voltage, $V_{G S T}$, or reducing the device channel length, $L$.

$$
\begin{equation*}
f_{T} \cong \frac{\mu}{2 \pi} \frac{V_{G S T}}{L^{2}} \tag{54}
\end{equation*}
$$

From a designer's perspective, however, the amplifier's transconductance and load determine its bandwidth, and hence its speed. The transconductance, $g m$, is the amplifier's ability to convert the input voltage signal into a current that drives the output load, and is defined for an NMOS operating in saturation by (55). Note that all three
forms of $g m$ are equivalent, but provide a different view into the design variables.

$$
\begin{equation*}
g m=\left.\frac{\partial I_{D}}{\partial V_{G S}}\right|_{\text {const. } V_{D S}}=\mu_{N} C_{o x} \frac{W}{L} V_{G S T N}=\frac{2 I_{D}}{V_{G S T N}}=\sqrt{2 \mu_{N} C_{o x} \frac{W}{L} I_{D}} \tag{55}
\end{equation*}
$$

The importance of the amplifier transconductance is best illustrated by the small signal macro-model of Fig. 36(a) given in Fig. 37. Here the output voltage, Vout, is the sum of currents $\left(i_{1}-i_{2}\right)$ driving the given output load $\left(R_{L} \| C_{L}\right)$. Since the differential pair, M1 and M2, of the amplifier in Fig. 36(a) are identical, the differential gain of the amplifier in Fig. 36(a) can be expressed by (56). We can now find the bandwidth, or $G B W$, of the amplifier by setting (56) to unity and solving for $s(j \omega)$, which results in (57). Clearly, the $G B W$ is highly dependent on the device size, quiescent operating point and the load driven as exemplified through $g m$ of equation (55) and $C_{L}$.


Fig. 37. Small signal macro-model of Fig. 36(a) assuming a load $R_{L} \| C_{L}$.

$$
\begin{gather*}
A(s)=\frac{\text { Vout }}{\text { Vin }}=\frac{g m_{1,2} R_{L}}{1+s C_{L} R_{L}}  \tag{56}\\
G B W \cong \frac{g m_{1,2}}{C_{L}} \tag{57}
\end{gather*}
$$

## 2. Gain

Amplifiers are seldom used in an open-loop configuration; in analog design, one does not rely on the absolute value of the amplifier's gain. Consider the low frequency gain value of (56), $A(0)=g m_{1,2} R_{L}$, and suppose the amplifier had no external loading such that $R_{L}$ is its intrinsic output impedance. While the transconductance, $g m_{1,2}$, can be fairly predicted by setting the associated parameters in (55), the amplifier's intrinsic output impedance cannot due to channel length modulation [47].

A MOS device enters the saturation region when $V_{D S}$ exceeds $V_{G S T}$. Then the channel is said to be pinched-off from the drain. With further increase in $V_{D S}$, the MOS device is driven deeper into saturation and the channel length is reduced to $L_{\text {eff }}<L$ as seen in Fig. 38, giving rise to channel length modulation.


Fig. 38. Channel length modulation phenomenon in MOS devices.

The channel length modulation can be incorporated in the square law I-V model of (53) to yield (58), where $I_{D 0}$ is the ideal drain current expressed by (53). Using (58) we can express the intrinsic output impedance of a MOS device as in (59). We can again derive the transconductance, $g m$, from (58) and together with (59), express the intrinsic gain $A_{i}=g m r_{d s}$ of a MOS device as in (60).

$$
\begin{gather*}
I_{D}=\frac{1}{2} \mu C_{o x} \frac{W}{L_{e f f}} V_{G S T}^{2} \equiv \frac{1}{2} \mu C_{o x} \frac{W}{L} V_{G S T}^{2}\left(1+\lambda V_{D S}\right)=I_{D 0}\left(1+\lambda V_{D S}\right)  \tag{58}\\
r_{d s}=\left.\left(\frac{\partial I_{D}}{\partial V_{D S}}\right)^{-1}\right|_{c o n s t . V_{G S}}=\frac{2 L}{\mu_{N} C_{o x} W V_{G S T N}^{2} \lambda}=\frac{1}{\lambda I_{D 0}}  \tag{59}\\
A_{i}=\frac{2\left(1+\lambda V_{D S}\right)}{\lambda V_{G S T N}} \tag{60}
\end{gather*}
$$

Equations (59) and (60) demonstrate the uncertainty in the intrinsic output impedance and gain of a MOS device, and hence of an amplifier. However, because amplifiers are used in a closed loop configuration, i.e. using feedback, this gain uncertainty is of little concern. Fig. 39 shows a simple example of an amplifier in closed loop for which the gain Vout/Vin is given by (61). For a sufficiently large intrinsic amplifier gain $A$, Vout/Vin is virtually independent of $A$ and is set by the capacitor ratio $C_{1} / C_{2}$. Nonetheless, (60) highlights the importance of the quiescent operating point of the MOS device, and can give an approximate measure of the maximum intrinsic MOS gain achievable for a particular CMOS technology.


Fig. 39. An example of a closed loop amplifier circuit.

$$
\begin{equation*}
\frac{\text { Vout }}{V \text { vin }}=-\frac{C_{1}}{C_{2}}\left(1+\frac{C_{1}+C_{2}+C_{P}}{A C_{2}}\right)^{-1} \cong-\frac{C_{1}}{C_{2}}\left(1-\frac{C_{1}+C_{2}+C_{P}}{A C_{2}}\right) \tag{61}
\end{equation*}
$$

What is critical for low voltage applications and can be deduced from (60) is how the amplifier gain is related to the output swing. For large signals, the quiescent operating point of a MOS device can no longer be considered fixed as with small signals, but is incrementally changing; as the output swings closer to either supply rail, the quiescent $V_{D S}$ of the respective device decreases and hence the gain is reduced. This has to be closely considered in low voltage applications, where the signal swing needs to be maximized while maintaining the highest gain possible.

## 3. Phase Margin

Although the macro-model of Fig. 36(a) provides a good approximation of the small signal amplifier behavior, it overlooks the parasitics of the current mirror now modeled in Fig. 40. The mirrored current, $i^{*}{ }_{1}$, is no longer identical to the source $i_{1}$,
which modifies the overall gain expression as given by (62). Unlike (56), the gain expression in (62) is a two-pole system, and its significance is best explained by examining the phase of Vout. Since the amplifier transfer function is variable with frequency, it results in a magnitude and phase change at the output. Given the complex nature of the gain expression $(s=j \omega)$, the phase is given by the cumulative effect of the poles and zeros, and the general phase expression is given by (63) for a transfer function with $n$ poles $\left(\omega_{p}\right)$ and $m$ zeros $\left(\omega_{z}\right)$.


Fig. 40. Improved small signal macro-model of Fig. 36(a).

$$
\begin{gather*}
A(s)=\frac{\text { Vout }}{\text { Vin }} \cong\left(i_{1}^{*}-i_{2}\right)\left(\frac{R_{L}}{1+s C_{L} R_{L}}\right)=\frac{g m_{1,2} R_{L}\left(1+s C_{G S} / 2 g m_{3,4}\right)}{\left(1+s C_{L} R_{L}\right)\left(1+s C_{G S} / g m_{3,4}\right)}  \tag{62}\\
\theta(\omega)=\sum_{l=1}^{n} \tan ^{-1}\left(\frac{\omega}{\omega_{p l}}\right)-\sum_{k=1}^{m} \tan ^{-1}\left(\frac{\omega}{\omega_{z k}}\right) \tag{63}
\end{gather*}
$$

Using (63) we can express the phase of (56) and (62) as in (64) and (65) respectively, where $C_{G S}=C_{g s 3}+C_{g s 4}$. From (64) and (65), we see the phase is negative and decreasing with frequency in the presence of poles. This is potentially problematic with closed-loop amplifier configurations such as in Fig. 39, and detailed analyses are presented in [50]-[51]. However, an intuitive way to analyze this is as follows. The magnitude of the amplifier transfer function is generally a decreasing function of $s$, similar to (56) and (62). Suppose for some transfer function with multiple poles that the phase shift between Vout and Vin exceeds $180^{\circ}$ at some frequency $\omega_{0}$ while the amplifier's gain is greater than unity $\left(\omega_{0}<G B W\right)$, then the output of the amplifier in Fig. 39 will oscillate indefinitely and become unstable.

$$
\begin{gather*}
\left.\theta(\omega)\right|_{(i i i 5)}=-\tan ^{-1}\left(\frac{\omega}{1 / C_{L} R_{L}}\right)  \tag{64}\\
\left.\theta(\omega)\right|_{(i i i 11)}=-\tan ^{-1}\left(\frac{\omega}{1 / C_{L} R_{L}}\right)-\tan ^{-1}\left(\frac{\omega}{g m_{3,4} / C_{G S}}\right)+\tan ^{-1}\left(\frac{\omega}{2 g m_{3,4} / C_{G S}}\right) \tag{65}
\end{gather*}
$$

The phase margin is a measure of how far is the amplifier from being unstable and is measured at $s=G B W$. The phase margin is also a good indicator to the amplifier ability to reach steady state should a fast time-varying signal, such as a step function, be applied to its input. This is illustrated in Fig. 41 by considering the $0.1 \%$ step response settling time of the amplifier in Fig. 36(a) when used as in Fig. 39 for $C_{1}=C_{2}, C_{P}=0$, $A(0)=60 \mathrm{~dB}, G B W=100 \mathrm{MHz}$ and different phase margin values.


Fig. 41. Closed loop step response of the amplifier in Fig. 36(a) for different phase margin values.

The results of Fig. 41 lead to the conclusion that a phase margin around $60^{\circ}$ is ideal for high speed performance. Indeed, this is the case for the chosen settling error $(0.1 \%)$ and if there are no pole-zero doublets within the amplifier's $G B W$. If on the other hand the latter is the case, the frequency of the doublet and separation between the pole and zero can dominate the settling behavior and significantly slow down the amplifier's response [52]-[53].

The amplifier architecture can also play a role in determining the speed. The transfer function of (62) demonstrated that the presence of the current mirror M3:M4 in Fig. 36(a) led to the addition of a pole-zero pair that may hinder the speed of the amplifier if not considered in design. On the other hand, the fully differential amplifier in Fig. 36(b) does not suffer the same limitation. Fig. 42 shows the macro-model of the
fully differential amplifier in Fig. 36(b) and (66) describes the amplifier's transfer function. Note that (66) is identical to (56). Unlike the single ended amplifier, the two signal paths in the fully differential amplifier are identical and their summation at the output does not generate a pole-zero pair that can slow down the speed. This is one of numerous advantages for using fully differential amplifiers over their single ended counterparts.


Fig. 42. Small signal macro-model of Fig. 36(b) assuming a load $R_{L} \| C_{L}$.

$$
\begin{equation*}
A(s)=\frac{(\text { Vout }+)-(\text { Vout }-)}{\operatorname{Vin}}=\frac{g m_{1,2} R_{L}}{1+s C_{L} R_{L}} . \tag{66}
\end{equation*}
$$

## 4. Slew Rate

In SC application, the settling time and accuracy of a circuit is crucial to good linearity. Consider the single ended amplifier of Fig. 36(a) configured as in Fig. 39, and for simplicity refer to its gain transfer function as in (56). The time-domain step response can then be described by (67), where the feedback factor, $\beta$, is $C_{2} /\left(C_{1}+C_{2}+C_{P}\right)$, and closed loop amplifier gain, $\alpha$, is given by (68).

$$
\begin{align*}
\text { Vout } & =-\alpha \cdot \operatorname{Vin}\left(1-e^{-\beta \cdot G B W \cdot t}\right)  \tag{67}\\
\alpha & =\frac{C_{1}}{C_{2}}\left(1+\frac{1}{\beta A(0)}\right)^{-1} \tag{68}
\end{align*}
$$

Given sufficient time, the final value is that described by (61). However, the maximum time allocated for the amplifier to settle is limited by half the sampling clock period, $T_{S}$. This emphasizes the maximization of the amplifier $G B W$ through circuit configuration selection, and $\beta$ through input parasitic consideration, especially when high accuracy settling is desired. However, this is only half the picture.

Equation (67) holds true if and only if the amplifier is capable of handling any level of input. In reality the amplifier's ability to drive current into its output impedance is generally limited by the amount of current available to bias it. In Fig. 36, M0 is referred to as the tail of the amplifier, which provides the tail current $I_{T}$. For large step inputs, one of the input drivers enters into the subthreshold region (shuts off), while the other is strongly driven to conduct all of $I_{T}$. The current $I_{T}$ is then driven directly, or indirectly through the current mirror, into the output load. In this operation mode, the amplifier is said to be slewing, and the linear settling behavior described by (67) is no longer valid. The rate at which the output voltage changes with respect to time in the slewing mode defines the slew rate, $S R$, of the amplifier as described by (69).

$$
\begin{equation*}
S R=\frac{\partial V o u t}{\partial t}=\frac{I_{T}}{C_{L}} \tag{69}
\end{equation*}
$$

Now it can be shown that Vout may be expressed by (70) when the slew rate is taken into account for zero initial conditions [54]-derivations are also available in appendix A . Furthermore, given an absolute settling error $\varepsilon_{S}$, the overall settling time $t_{S}$ is expressed by (71) and the slewing time $t_{\text {slew }}$ and linear settling time $t_{\text {lin }}$ are given by (72) and (73) respectively.

$$
\begin{gather*}
\text { Vout }= \begin{cases}S R \cdot t, & t<t_{\text {slew }} \\
-\alpha \cdot \operatorname{Vin}\left(1-\frac{S R}{\alpha \cdot \operatorname{Vin} \cdot \beta \cdot G B W} \cdot e^{-\beta \cdot G B W\left(t-t_{\text {slew }}\right)}\right), & t>t_{\text {slew }} \\
-\alpha \cdot \operatorname{Vin}\left(1-e^{-\beta \cdot G B W \cdot t}\right), & t_{\text {slew }} \leq 0\end{cases}  \tag{70}\\
t_{S}=t_{\text {slew }}+t_{\text {lin }}=\frac{\alpha \cdot \operatorname{Vin}}{S R}+\frac{1}{\beta \cdot G B W}\left[\ln \left(\frac{S R}{\varepsilon_{S} \alpha \cdot \operatorname{Vin} \cdot \beta \cdot G B W}\right)-1\right]  \tag{71}\\
t_{\text {lin }}=\frac{1}{\beta \cdot G B W} \ln \left(\frac{S R}{\varepsilon_{S} \alpha \cdot \operatorname{Vin} \cdot \beta \cdot G B W}\right) \tag{72}
\end{gather*}
$$

The fastest settling performance is inarguably that for all-linear settling, which can be proven by taking the derivative of (71) with respect to $S R$, and substituting the outcome in (72). More importantly, however, using (72) we can find the maximum input step for which no slewing will occur. The result is given by (74).

$$
\begin{equation*}
\left.V i n\right|_{\text {no slew }} \leq \frac{S R}{\alpha \cdot \beta \cdot G B W}=\frac{V_{G S T}}{\alpha \cdot \beta} \tag{74}
\end{equation*}
$$

The expression in (74) shows that slewing is inevitable for high speed Nyquist rate SC applications such as pipeline ADCs. In such applications the input step size between consecutive cycles can be in the order of the system's full-scale voltage, $V_{F S}$, which is much larger than (74) predicts. In part, the amplifier's input pair $V_{G S T}$ is limited to the range of $100-200 \mathrm{mV}$ to maximize signal headroom and input common mode range. On the other hand, $\alpha$ and $\beta$ are set by the amplifier's configuration and their product can loosely be considered unity for high speed designs. These conditions yield a maximum non-slewing step of roughly $0.2 V_{F S}$ in present-day CMOS technologies. Therefore the only viable approach to reducing the settling time is to enhance both $S R$ and $G B W$ of the amplifier, and hence it is critical to account for the amplifier slewing performance in the power budget.

## 5. Distortion

By examining the square law I-V relationship in (53) we can conclude that the MOS transistor is not a linear device. However, through the use of feedback, CMOS amplifiers are used successfully to implement highly linear functions. Intuitively, because of the high amplifier gain, the positive and negative input terminals of the amplifiers experience a virtual short; that is the amplifier's input terminals seem fixed about the ideal quiescent operating point for any input signal.

The paradox of feedback, however, is the use of a non-linear device to linearize itself. The transfer function described by (61) is ideally given by the ratio $C_{1} / C_{2}$, but the finite amplifier gain results in an error term. When an input is applied, the output swing
modulates the $V_{D S}$ of the output devices thereby changing the amplifier gain and subsequently the error term in the transfer function. To reflect this behavior, (61) is modified to (75) where the gain is now a function of the input. Also, Fig. 43 depicts this graphically for the absolute values of (61) and (75). The result is an input dependent gain error for large output swings and consequently observable distortion at the output, which is undesirable in many applications such as pipeline ADCs.

$$
\begin{equation*}
\frac{\text { Vout }}{V \text { in }}=-\frac{C_{1}}{C_{2}}\left(1+\frac{C_{1}+C_{2}+C_{P}}{A(\text { Vin }) C_{2}}\right)^{-1} \cong-\frac{C_{1}}{C_{2}}\left(1-\frac{C_{1}+C_{2}+C_{P}}{A(\text { Vin }) C_{2}}\right) \tag{75}
\end{equation*}
$$



Fig. 43. Amplifier gain degradation for large output swings.

Another source of distortion is the $S R$ limited settling. Consider the flip-around S/H amplifier discussed in Chapter III. For the case of all-linear settling, the final output is expressed by (76) where all parameters with the $S_{S H}$ subscript have the same previous
definitions, but pertaining to the flip-around $\mathrm{S} / \mathrm{H}$. Note that the error term is independent of Vin and therefore is a fixed for all input values. If the settling becomes $S R$ limited, however, then the output is expressed by (77).

$$
\begin{gather*}
\text { Vout }=-\alpha_{S H} \operatorname{Vin}\left(1-e^{-\beta_{S H} G B W \frac{T_{S}}{2}}\right)  \tag{76}\\
\text { Vout }= \begin{cases}-\alpha_{S H} \operatorname{Vin}\left(1-e^{-\beta_{S H} G B W \frac{T_{S}}{2}}\right), & \operatorname{Vin} \leq \frac{V_{G S T_{S H}}}{\alpha_{S H} \beta_{S H}} \\
-\alpha_{S H} \operatorname{Vin}\left(1-\frac{S R_{S H}}{\alpha_{S H} \operatorname{Vin} \cdot \beta_{S H} G B W} \cdot e^{-\left[1+\beta_{S H} G B W\left(\frac{T_{S}}{2}-\frac{\alpha_{S H} V i n}{S R_{S H}}\right)\right]}\right), \operatorname{Vin}>\frac{V_{G S T_{S H}}}{\alpha_{S H} \beta_{S H}}\end{cases} \tag{77}
\end{gather*}
$$

From (76) we conclude that for the all-linear settling, the value of $T_{S} / 2$ is irrelevant as far as distortion is concerned, because it will result in a consistent gain error for all values of Vin. The same is true for the $S R$ limited settling, but for a small range of input values. As Vin crosses the limit described in (74), the settling error becomes input dependent and results in distortion. This is depicted in Fig. 44, where the thick solid trace is the input, and thin solid trace and dotted trace are the all-linear and $S R$ limited settling of the $\mathrm{S} / \mathrm{H}$ output respectively. The upper traces are the real-time waveforms, while the bottom traces are the end-points plot of the S/H output in the hold phase. Note that the input trace is shifted forward in time by half a clock cycle to correspond to the end points of the S/H traces. In the shaded regions of Fig. 44, the output is compressed for the $S R$ limited case, whereas otherwise it follows the all-linear settling curve.

Therefore, when considering SC linear applications, the linearity is often expressed as a maximum settling error specification; say $\varepsilon_{\max }$, which for the $\mathrm{S} / \mathrm{H}$ example above is expressed by (78).


Fig. 44. Slew rate induced distortion for fixed settling time, $T_{S} / 2$; thick solid: Vin, thin solid: Vout (linear settling) and dashed: Vout (slew rate limited settling).

$$
\begin{equation*}
\varepsilon_{\max } \geq \frac{S R_{S H}}{\alpha_{S H} V i n_{\max } \cdot \beta_{S H} G B W_{S H}} \cdot e^{-\left[1+\beta_{S H} G B W_{S H}\left(\frac{T_{S}}{2}-\frac{\alpha_{S H} V i_{\max }}{S S_{S H}}\right)\right]}, \quad \operatorname{Vin}_{\max }>\frac{V_{G S T_{S H}}}{\alpha_{S H} \beta_{S H}} \tag{78}
\end{equation*}
$$

## 6. Noise

Noise in circuits can be generally defined as spontaneous fluctuations in current, voltage or temperature that set the lower limit on measurements taken on a system under test [55]. In many applications, such as data converters, continuous time filters and audio
amplifiers, these fluctuations are the limiting factors in a design's dynamic performance.
There are two dominant noise sources that affect the performance of CMOS circuits, namely the thermal and flicker noise. Thermal noise is attributed to the random charge carrier motion in a conductor due to the ambient temperature. As for the flicker noise, the major contributor is the interaction of the charge carriers with the $\mathrm{Si}^{-} \mathrm{SiO}_{2}$ interface, where the carriers are trapped and then released back into the channel in a random process [55]. These noise sources can be modeled in MOS devices as a current source in parallel with the device representing the power spectral density, which is given by (79) where the first and second terms represent the thermal and flicker noise contributions respectively. Here, $k_{B}, T, \gamma, K_{F}$ and $f$ represent Boltzman's constant, the absolute temperature, noise factor, process dependent flicker noise coefficient and frequency respectively. Equation (79) is a simplified form of other complex models [56][57], but is very useful for design.

$$
\begin{equation*}
\overline{i_{o}^{2}}=\left(4 k_{\mathrm{B}} T \gamma g m+\frac{K_{F} I_{D}}{C_{o x} L^{2} f}\right) \cdot \Delta f \tag{79}
\end{equation*}
$$

To aid the derivation of the amplifier noise, we refer to the amplifier noise model in Fig. 45 from which we can express the maximum noise seen at the output by (80). To minimize clutter, the thermal and flicker noise components will be examined separately by substituting (79) into (80).


Fig. 45. Noise sources modeled for the amplifier in Fig. 36(a).

$$
\begin{equation*}
\overline{i_{o n}^{2}}=i_{n 1}^{2}+i_{n 2}^{2}+i_{n 3}^{2}+i_{n 4}^{2} \tag{80}
\end{equation*}
$$

Beginning with the thermal noise, the total noise seen at the output is expressed by (81), which is the simplified form if we account for $g m_{l}=g m_{2}$ and $g m_{3}=g m_{4}$. Referring the amplifier's noise to the input makes it easy to compare with the input signal, and to do so we divide the output noise by the square of the amplifier's transconductance $g m_{l}$. The result is given by (82).

$$
\begin{gather*}
\overline{i_{o n}^{2}}=8 k_{\mathrm{B}} T \gamma\left(g m_{1}+g m_{3}\right) \cdot \Delta f  \tag{81}\\
\overline{v_{i T}^{2}}=\frac{\overline{i_{o n}^{2}}}{G m^{2}}=\frac{8 k_{\mathrm{B}} T \gamma}{g m_{1}}\left(1+\frac{g m_{3}}{g m_{1}}\right) \cdot \Delta f \\
=\frac{8 k_{\mathrm{B}} T \gamma}{\sqrt{2 \mu_{P} C_{o x} I_{D 1}\left(W_{1} / L_{1}\right)}}\left(1+\sqrt{\frac{\mu_{N}}{\mu_{P}} \frac{\left(W_{3} / L_{3}\right)}{\left(W_{1} / L_{1}\right)}}\right) \cdot \Delta f \tag{82}
\end{gather*}
$$

Following the same approach with respect to the flicker noise component, the output noise and input referred noise are given by (83) and (84) respectively. Note that the results of (81)-(84) are the same for the fully differential amplifier of Fig. 36(b) since the sum of currents at the output is the same.

$$
\begin{align*}
\overline{i_{o n}^{2}} & =\frac{I_{D}}{C_{o x} f}\left(2 \frac{K_{F P}}{L_{1}^{2}}+2 \frac{K_{F N}}{L_{3}^{2}}\right) \cdot \Delta f  \tag{83}\\
\overline{v_{i y_{f}}^{2}}=\frac{\overline{i_{o n}^{2}}}{G m^{2}} & =\frac{2 I_{D}}{g m_{1}^{2} C_{o x} f}\left(\frac{K_{F P}}{L_{1}^{2}}+\frac{K_{F N}}{L_{3}^{2}}\right) \cdot \Delta f \\
& =\frac{K_{F P}}{\mu_{P} W_{1} L_{1} C_{o x}^{2} f}\left[1+\frac{K_{F N}}{K_{F P}}\left(\frac{L_{1}}{L_{3}}\right)^{2}\right] \cdot \Delta f \tag{84}
\end{align*}
$$

There are a few conclusions that can be drawn from (82) and (84) regarding the amplifier's noise. First, and foremost, the $g m$ of the input pair has a huge impact on the overall amplifier noise performance. Note that referring the output noise to the input was done through the input pair $g m$, and hence it is crucial to maximize the input $g m$ beyond any other transistor; in a truly low noise amplifier, the maximum gm is that of the input pair. Second, the sizing plays a direct role in the noise performance. From (82) we see the ratio of the dimensions of $M 1$ and $M 3$ and also in (84) in terms of the channel length and input pair area. Moreover, a closer look at (84) shows that the flicker noise expression is a quadratic function of $L_{1}$, and hence there exists a value for $L_{l}$ at which the flicker noise is minimized and is given by (85).

$$
\begin{equation*}
L_{1}=L_{3} \sqrt{\frac{K_{F P}}{K_{F N}}} \tag{85}
\end{equation*}
$$

Finally, the device type is also a factor in the amplifier's noise. Both (82) and (84) contain parameters that are device type dependent, namely the mobility $\mu_{N, P}$ and the flicker noise coefficient $K_{F N, P}$. This brings up the question of which device type is most suitable for the input pair for the best noise performance. To answer part of the question, let us first consider the device mobility. The electron mobility $\mu_{N}$ is 3-4 times faster than the hole mobility $\mu_{P}$ in semiconductors [58]-[59], and hence it is best to use NMOS devices as the input pair to minimize the thermal noise.

On the other hand, in CMOS technologies the transistors gates are commonly $\mathrm{n}+$ polysilicon, which result in a buried-channel-like behavior for PMOS devices that minimizes the interaction of the carriers with the $\mathrm{Si}-\mathrm{SiO}_{2}$ interface and consequently the flicker noise coefficient [60]-[62]. This results in $K_{F P}$ to be an order of magnitude smaller than $K_{F N}$, and hence PMOS devices are ideal as input pair devices to minimize the flicker noise. For p+ polysilicon gates, however, $K_{F P}$ and $K_{F N}$ are of the same order.

Therefore, the answer as to which device type is more suitable for low noise amplifier design seems different for each type of noise. Nonetheless, in light of the advancement of CMOS technologies, the contribution of each noise type is also different. From (79), the thermal noise has a flat behavior with frequency, while the flicker noise is inversely proportional to frequency. Equating the two noise components and solving for $f$, yields the noise corner frequency, $f_{n c}$, given by (86).

$$
\begin{equation*}
f_{n c}=\frac{K_{F} I_{D}}{4 k_{B} T \gamma g m C_{o x} L^{2}}=\frac{K_{F} V_{G S T}}{8 k_{B} T \gamma C_{o x} L^{2}} \tag{86}
\end{equation*}
$$

Using (86), the noise corner frequency of present-day CMOS technologies is in the 100 s of MHz , which makes the flicker noise the major contributor to amplifier noise in many applications. Hence, for overall noise performance, it is generally better to use PMOS devices as the input pair of amplifiers.

In summary, to optimize the noise performance of amplifiers for high speed, low voltage and low power applications, one can follow these guidelines:

- Use PMOS devices for the input pair, unless p+ polysilicon is used for the gate material, or extremely high speed applications ( GHz range) are targeted.
- Maximize $I_{D}$ of the input pair within the given power budget.
- Maximize the area and the aspect ratio of the input pair devices ( $W L$ and $W / L$ ).
- Once all device aspect ratios are defined, use (85) to set the value of the input pair channel length, but maintain the aspect ratio.


## 7. Process Variations

Amplifiers are ideally symmetric by design, but once fabricated a zero-input signal commonly results in a finite output. This behavior is attributed to minor asymmetries that are either systematic or random but together constitute what is known by offset. Systematic offset is imposed on the amplifier by the design and choice of
quiescent operating points of devices and can be reduced by good design practices. Random offset, on the other hand, is the result of manufacturing process variations.

Manufacturing process variations are a result of finite lithography resolution, non-uniform oxide thickness growth and doping profiles across the wafer's plane. A mismatch model [63] based on the study of equal area rectangular devices, states that the one-dimensional variance of a parameter $P$ can be expressed by (87) where $A_{P}$ is the area proportionality constant for parameter $P, W$ and $L$ are the device dimensions, $S_{P}$ is the variation of $P$ with spacing, and $D_{x}$ is the distance between two devices along $x$. In practice, critical devices such as the input pair or current mirrors are interdigitated or cross-coupled, and hence $D_{x}$ approaches zero, and the second term of (87) can be neglected. Using (53), the drain-current variance due to process variation can be expressed as in (88) assuming $\sigma^{2}\left(V_{T}\right)$ and $\sigma^{2}\left(\beta_{I}\right)$ are uncorrelated [63]-[66]. Here $\beta_{I}$ represents $\mu C_{o x} W / L$. Equation (88) is very useful, because from a circuit analysis stand point, the drain-current variance can be treated as a small signal that can be referred to the gate of the MOS device through its transconductance, $g m$. The result is given by (89).

$$
\begin{gather*}
\sigma^{2}(P)=\frac{A_{P}^{2}}{W L}+S_{P}^{2} D_{x},  \tag{87}\\
\sigma^{2}\left(I_{D}\right)=4 I_{D}^{2} \frac{\sigma^{2}\left(V_{T}\right)}{V_{G S T}^{2}}+I_{D}^{2} \frac{\sigma^{2}\left(\beta_{I}\right)}{\beta_{I}^{2}}  \tag{88}\\
\sigma^{2}\left(V_{G S}\right)=\sigma^{2}\left(V_{T}\right)+\frac{1}{\left(g m / I_{D}\right)^{2}} \frac{\sigma^{2}\left(\beta_{I}\right)}{\beta_{I}^{2}} \tag{89}
\end{gather*}
$$

It was demonstrated in [67] that for nominal biasing conditions ( $V_{G S T}<650 \mathrm{mV}$ ), $\sigma^{2}\left(V_{T}\right)$ is the dominant mismatch factor. Moreover, in analog design the term $g m / I_{D}$ is generally maximized, which further supports the dominance of $\sigma^{2}\left(V_{T}\right)$ and reduces the effect of the second term of (89). Therefore, we can reduce (89) to (90) to describe the gate-referred (input-referred) current variance in MOS devices. Here $A_{V_{T}}$ is the area proportionality constant for the threshold voltage, $V_{T}$.

$$
\begin{equation*}
\sigma^{2}\left(V_{G S}\right) \cong \sigma^{2}\left(V_{T}\right)=\frac{A_{V_{T}}^{2}}{W L} \tag{90}
\end{equation*}
$$

Using (90), the input offset variance can be expressed as the sum of all device drain current variances at the amplifier's output, and then referred to the input using the amplifier's $g m$. This analysis is quite similar to the noise analysis and the noise model of Fig. 45 can be used here, but with the substitution of $\sigma^{2}\left(I_{D}\right)=g m^{2} \sigma^{2}\left(V_{G S}\right)$ for $\overline{i_{n}^{2}}$. Thus, the total output current variance and input offset variance of the amplifier in Fig. 36(a) are given by (91) and (92) respectively, which again is the simplified form if we account for $g m_{1}=g m_{2}$ and $g m_{3}=g m_{4}$.

$$
\begin{gather*}
\left.\sigma^{2}\left(I_{D}\right)\right|_{\text {output }}=2\left[g m_{1}^{2}\left(\frac{A_{V_{T P}}^{2}}{W_{1} L_{1}}\right)+g m_{3}^{2}\left(\frac{A_{V_{T V}}^{2}}{W_{3} L_{3}}\right)\right]  \tag{91}\\
\sigma^{2}\left(V_{O S}\right)=2\left[\left(\frac{A_{V_{T P}}^{2}}{W_{1} L_{1}}\right)+\frac{g m_{3}^{2}}{g m_{1}^{2}}\left(\frac{A_{V_{T V}}^{2}}{W_{3} L_{3}}\right)\right]=2 \frac{A_{V_{T P}}^{2}}{W_{1} L_{1}}\left[1+\frac{\mu_{N}}{\mu_{P}}\left(\frac{A_{V_{T N}}}{A_{V_{T P}}}\right)^{2}\left(\frac{L_{1}}{L_{3}}\right)^{2}\right] \tag{92}
\end{gather*}
$$

Note again that the results in (91) and (92) are the same for the fully differential amplifier of Fig. 36(b). Moreover, a minimum offset can be achieved for a particular design if the length of the input pair device is sized as in (93).

$$
\begin{equation*}
L_{1}=L_{3} \sqrt{\frac{\mu_{P}}{\mu_{N}}} \cdot \frac{A_{V_{T P}}}{A_{V_{T N}}} \tag{93}
\end{equation*}
$$

Before we discuss the design guideline for low input offset, let us examine the parameter $A_{V_{T}}$. From a physical approach, it was noticed experimentally that $A_{V_{T}}$ scales down with the gate oxide thickness $t_{o x}$ [63], regardless of the device type, and a theoretical expression of this relationship is given by (94) [67], where $D_{\text {Total }}$ is the total dopant implant dose in the bulk material of the device, $q$ is the electron charge and $\varepsilon_{o x}$ is the silicon oxide permittivity.

$$
\begin{equation*}
A_{V_{T}}=\sqrt{\frac{D_{\text {Total }}}{2}} \cdot \frac{q}{\varepsilon_{o x}} \cdot t_{o x} \tag{94}
\end{equation*}
$$

Therefore, as CMOS technologies scale down in terms of $t_{o x}$, the matching characteristics are enhanced through the reduction seen in $A_{V_{T}}$. Yet, this is not fully reflected in device matching since the device dimensions are also scaled down and the matching performance can actually become worse. What is interesting, however, is the dependence on the doping dose in the bulk. With advanced CMOS technologies, low $V_{T}$
devices are becoming more robust and their use in low voltage applications more diverse. Their lower $V_{T}$ is achieved by a counter type implant in the bulk region immediately under the gate where the channel is formed. This effectively reduces the total dose and improves their matching performance over regular devices.

In conclusion, the following design guidelines can be adopted for low input offset amplifiers:

- Maximize the width of the input pair devices.
- Maximize the length of devices apart from the input pair.
- Set the input pair device length as given by (89).
- Use low $V_{T}$ devices if they can support the design requirements.
- The best type of device for the input pair is dependent on the $A_{V_{T}}$ parameter for each device type.


## B. Amplifiers for Pipeline ADCs

The preceding discussion demonstrated the many aspects of amplifier design, and the choice of the amplifier architecture to use in a pipeline cell must consider all to guarantee proper performance within the boundaries imposed by power and area. Fig. 46 shows the three fundamental topologies that are commonly seen in pipeline ADCs, and they are the Telescopic, Current Mirror and Folded Cascode OTAs. The following is a brief comparison of each and a presentation of their key performance metrics; detailed expressions of their performance metrics are given in Appendix C.


Fig. 46. Fundamental OTA architectures; (a) telescopic, (b) current mirror and (c) folded cascode.

If we assume the transconductance $g m_{l}$ of $M_{l}$ to be the same for all amplifiers in Fig. 46, then the summary of parameters in Table 3 can easily be used to summarize the strengths and weakness of each amplifier topology as follows.

TABLE 3
Fundamental Parameter Summary of OTAs Seen in Fig. 46

| Parameter | Telescopic | Current Mirror | Folded Cascode |
| :---: | :---: | :---: | :---: |
| Minimum Supply | $V_{G S}+3 V_{D S, \text { sat }}$ | $V_{G S}+2 V_{D S, s a t}$ | $V_{G S}+2 V_{D S, s a t}$ |
| Maximum Signal Swing | $V_{D D}-5 V_{D S, \text { sat }}$ | $V_{D D}-4 V_{D S, s a t}$ | $V_{D D}-4 V_{D S, s a t}$ |
| Total Current (ITOT) | $I_{T}$ | $I_{T}(1+K)$ | $2 I_{T}$ |
| Transconductance (Gm) | $g m_{1}$ | $K g m_{1}$ | $g m_{1}$ |
| LHP Poles | 2 | 3 | 2 |
| Slew rate (for a $C_{L}$ load) | $\frac{I_{T}}{C_{L}}$ | $\frac{K I_{T}}{C_{L}}$ | $\frac{I_{T}}{C_{L}}$ |
| Input Referred Thermal |  |  |  |
| Noise * $\left(16 k T / 3 g m_{1}\right)$ | $\left(1+\frac{g m_{7}}{g m_{1}}\right)$ | $\left(1+\frac{g m_{3}}{g m_{1}}+\frac{g m_{5}+g m_{11}}{K^{2} g m_{1}}\right)$ | $\left(1+\frac{g m_{3}+g m_{9}}{g m_{1}}\right)$ |
| Efficiency $\left(G m / I_{T O T}\right)$ | $\frac{g m_{1}}{I_{T}}$ | $\frac{K g m_{1}}{(K+1) I_{T}}$ | $\frac{g m_{1}}{2 I_{T}}$ |

The Telescopic OTA is undoubtedly the most power efficient amplifier. It also has the best frequency response in term of the number of poles and zeros and their locations, and the least noise contribution. On the other hand, it requires a higher supply than both the Current Mirror and Folded Cascode OTAs, and leaves the least headroom for signal swing at the output, which makes it the least suitable for low voltage applications despite its other strengths. Overall, the Telescopic OTA is best suited for designs that require very high speed and low noise performance where the signal swing is not very critical.

The Current Mirror OTA is superior to the Telescopic in term of low voltage
operation. It can also deliver a higher slew rate performance and a wider GBW, but at the expense of higher power dissipation. However, its noise contribution is the highest of all amplifiers, and its efficiency is lesser than that of the Telescopic OTA. Furthermore, while the efficiency can be enhanced by increasing $K$, the frequency response is degraded as $K$ plays a significant role is the placement of one of its poles. Overall, the Current Mirror OTA is best suited for low voltage and large signal swing applications, where the noise, speed and power performances are not very critical.

Finally, the Folded Cascode OTA is the comfortable medium of both the Telescopic and Current Mirror OTA. It shares a good low voltage performance with the Current Mirror OTA, but without the additional noise. It also shares a very good frequency performance with the Telescopic OTA, but a degraded efficiency. Overall, the Folded Cascode OTA is suitable for a larger range of applications where speed, noise and signal headroom are valued. It is not surprising then to see the increased use of the Folded Cascode amplifier in the literature in state-of-the-art designs whether as a single stage or the first stage in a multi-stage amplifier [26], [68]-[73].

## C. Amplifier Enhancement

Some of the amplifier requirements needed to implement pipeline cells cannot be satisfied by a simple OTA as those presented in section B. For example, a 200MS/s 10bit resolution MDAC requires an amplifier with a minimum gain of 66 dB to keep the static settling errors within $1 / 2 L S B$. Yet, in modern CMOS technologies the gain attainable by any of the OTAs in Fig. 46 may not exceed 50 dB while operating at that speed. To
overcome this limitation, amplifier enhancement techniques are widely used with the most common being multi-stage cascading and regulated cascode gain boosting.

## 1. Multi-Stage Amplifiers

Gain enhancement using multi-stage amplifiers is a well-established technique that relies on the cascading of two or more gain stages as depicted by the Miller compensated example in Fig. 47. The low frequency gain of the multi-stage amplifier is the product of all cascaded gain stages; $A=A_{1} A_{2}$ for the example in Fig. 47. However, each additional gain stage introduces another low frequency pole, which limits the bandwidth of the amplifier and more importantly increases the risk of instability. Therefore, multi-stage amplifiers utilize different compensation techniques to overcome such limitations [74], and the most popular is the Miller compensation represented by the dashed lines in Fig. 47.


Fig. 47. Miller compensated multi-stage amplifier.

In addition to the increased gain of the overall amplifier, multi-stage amplifiers can utilize the full supply range for signal swing by adopting a rail-to-rail output stage.

This is a significant advantage in low voltage applications as it maximizes the noise tolerance. However, multi-stage amplifiers are often bulky and power hungry due to the use of compensation capacitors, and while some techniques attempt to alleviate these drawbacks, multi-stage amplifiers are still no match to the speed and power efficiency of a single stage amplifier.

## 2. Gain Boosting

Another amplifier enhancement technique is the regulated cascode gain boosting. Instead of cascading several amplifiers to increase the gain, the regulated cascode gain boosting approach relies on active feedback to limit the $V_{D S}$ variations at the source of the cascode device. This is illustrated in Fig. 48 for a Telescopic OTA, but can readily be applied to Current Mirror and Folded Cascode OTAs as well.

The feedback loop of the auxiliary amplifier and the cascode transistor enhances the output impedance, and thus increases the amplifier gain [75]. An advantage of this approach over multi-stage amplifiers is the transparency of the gain enhancement to the signal path; it does not add low frequency poles that slow down the amplifier or compromise its stability. However, as is the case with any feedback loop, some conditions must be satisfied as expressed by (95). Namely, the $G B W$ of the auxiliary amplifier needs to be greater than the closed loop bandwidth of the main amplifier, and less than its non-dominant pole; the first condition guarantees no pole-zero doublets within the closed loop bandwidth that may introduce slow settling components, while the second ensures the stability of the auxiliary amplifier and cascode transistor loop.

$$
\begin{equation*}
\left.\omega_{-3 d B}\right|_{\text {Closed Loop }}<\left.\omega_{u}\right|_{\text {Boosting }}<\left.\omega_{P 2}\right|_{\text {main opamp }} \tag{95}
\end{equation*}
$$



Fig. 48. Regulated cascode gain boosting applied to a telescopic OTA.

Overall, as the auxiliary amplifiers only drive their respective cascode device, they are quite small, which makes the regulated cascode gain boosting method more efficient than utilizing a multi-stage amplifier from a power and area stand point. However, the main drawback to the regulated cascode approach is the limited head for the output swing.

## D. Conclusions

Numerous aspects of amplifier design have a direct impact on the performance of pipeline cells. Through a systematic approach, key performance metrics of amplifiers were derived and discussed with a focus on building robust amplifiers for the application of high speed, high resolution and high efficiency $\mathrm{S} / \mathrm{H}$ and MDAC pipeline cells. Particular emphasis was given to distortion, noise and offset as they are detrimental to the operation of such cells. Moreover, a survey of the fundamental OTA architectures is undertaken to isolate the best overall design for high speed, low voltage and low power applications. Finally, the widely used gain enhancement techniques are described, and for optimum efficiency the regulated cascode gain boosting approach is recommended. However, a combination of gain boosted first stage followed by a rail-to-rail output stage serves as the best compromise.

## CHAPTER V

## RECYCLING FOLDED CASCODE*

Chapter IV highlighted the strengths of the Folded Cascode (FC) amplifier, which recently has become one of the most commonly used amplifier architectures in low voltage pipeline ADCs whether in the single or multi-stage form. Therefore, it is not surprising to see efforts in the literature aimed to enhance its performance, and other amplifiers in general.

Earlier work to enhance the performance of the FC used multi-path schemes [76] and [77]. Other multi-path schemes such as [78] and [79] were applied to the CurrentMirror OTA to specifically enhance the output impedance and slew rate, and to emulate a class $A B$ output stage operation. The concept behind these schemes is to split the input signal among several paths whose sum at the amplifier output is in phase-similar to a current mirror active load in single-ended amplifiers. This may result, however, in numerous low frequency pole-zero pairs in the amplifier's open loop transfer function that may render it useless for high speed applications such as in Nyquist ADCs. While [76]-[79] may fall in that category of slower amplifiers, they nevertheless form the basis of the proposed modification to the FC amplifier presented in Section $A$.

[^1]
## A. Proposed Folded Cascode Amplifier

The conventional FC is shown again in Fig. 49 for convenience. Note how transistors M3 and M4 are the two transistors in the signal path that conduct the most current, and in many designs have the largest transconductance. However, their role is strictly limited to providing a folding node for the small signal current generated by the input differential pair M1 and M2.


Fig. 49. The conventional folded cascode amplifier.

To address this inefficiency, a modified FC amplifier is presented in Fig. 50. The proposed modifications are intended to use M3 and M4 as driving transistors [80], and proceed as follows. First, the input differential pair, M1 and M2 (Fig. 49), are split in half to produce transistors M1a, M1b, M2a, and M2b (Fig. 50). Next, M3 and M4 (Fig.
49) are split to form the current mirrors M3a:M3b and M4a:M4b with a ratio of $K: 1$ (Fig. 50), where $K$ is chosen to be 3 strictly to maintain the same current consumption as the FC in Fig. 49; naturally $K$ has to be greater than 1 to maintain current in the output devices M5-M10. The diode connected transistors, M3b and M4b, are cross-coupled with the input transistors, $M 2 b$ and $M 1 b$, to ensure the small signal currents added at the sources of M5 and M6 are in phase. Finally, cascode devices may be added inside the diode connection of $M 3 b$ and $M 4 b$ to reduce the current dependency on $V_{D S}$ and improve matching in the current mirrors. The modified Folded Cascode will be referred to by the Recycling Folded Cascode (RFC), since we are reusing, or recycling, existing devices and currents to perform an additional task.


Fig. 50. The recycling folded cascode (RFC) amplifier.

## B. Recycling Folded Cascode Characteristics

The modifications presented in Section $A$ provide the RFC with enhanced features over the FC. In this section, various amplifier merits will be derived for the RFC to present these enhancements quantitatively, similar to Chapter IV. This will also help examine the costs associated with the proposed modifications from a design perspective.

## 1. Small Signal Transconductance

We first examine the amplifier's transconductance, Gm. By finding the shortcircuit current at the output of the RFC with respect to its input, its small signal transconductance can be expressed as in (96). Since M1 in the original FC is twice the size of M1a and conducts twice the amount of current for the same bias conditions, then $g m_{l, F C}=2 g m_{l a, R F C}$. Using this result and by substituting for the value of $K=3$ in (96), the low frequency transconductance of the RFC is demonstrated to be twice that of the FC for the same power consumption. In other words, for the same capacitive load, the $G B W$ of the RFC is twice as wide as that of the FC.

$$
\begin{equation*}
G m_{R F C}=g m_{1 a}(1+K) \tag{96}
\end{equation*}
$$

## 2. Low Frequency Gain

It was demonstrated that $G m_{R F C}=2 G m_{F C}$, which results in a 6 dB gain enhancement for the same output impedance. However, $R o_{R F C}$ is also enhanced over $R o_{F C}$. The expression for $R o_{R F C}$ is given by (97) and that of $R o_{F C}$ by (98). Because of the
modifications in RFC, $M 2 a$ and $M 4 a$ conduct less current than their counterparts $M 2$ and M4 of the FC, which enhances their intrinsic output impedance, $r_{d s}$. It is this enhancement in $r_{d s 2 a}$ and $r_{d s 4 a}$ that adds an additional $2-4 \mathrm{~dB}$ to the gain of the RFC. Therefore, an overall low frequency gain enhancement of $8-10 \mathrm{~dB}$ can be seen in the RFC compared to the FC; settling errors associated with the finite amplifier gain are thus reduced by a factor of $\sim 3$ for a RFC over a FC.

$$
\begin{align*}
R o_{R F C} & \cong g m_{6} r_{d s 6}\left(r_{d s 2 a} \| r_{d s 4 a}\right) \| g m_{8} r_{d s 8} r_{d s 10}  \tag{97}\\
R o_{F C} & \cong g m_{6} r_{d s 6}\left(r_{d s 2} \| r_{d s 4}\right) \| g m_{8} r_{d s 8} r_{d s 10} \tag{98}
\end{align*}
$$

## 3. Phase Margin

The phase margin was identified in Chapter IV as a good indicator to the transient response of an amplifier. The expected phase margin expression of the RFC is given by (99), and the corresponding pole-zero locations by (100). The pole-zero locations in the s-domain are depicted in Fig. 51; it is assumed that all non-dominant poles-zeros are beyond $\omega_{u}$ for stability and good phase margin.

$$
\begin{array}{r}
P M_{R F C}=180^{\circ}-\tan ^{-1}\left(\frac{\omega_{u}}{\omega_{p 1}}\right)-\tan ^{-1}\left(\frac{\omega_{u}}{\omega_{p 2}}\right)-\tan ^{-1}\left(\frac{\omega_{u}}{\omega_{p 3}}\right)+\tan ^{-1}\left(\frac{\omega_{u}}{\omega_{z 1}}\right) \\
\omega_{u}=\frac{G m_{R F C}}{C_{L}}, \omega_{p 1}=\frac{-1}{R o_{R F C} C_{L}}, \omega_{p 2} \cong \frac{-g m_{3 b}}{(1+K) C g s_{3 b}}, \omega_{p 3} \cong \frac{-g m_{5}}{C g s_{5}}, \omega_{z 1}=(1+K) \omega_{p 2} \tag{100}
\end{array}
$$



Fig. 51. Pole-zero locations of the RFC in the s-domain.

In comparison with the FC, the RFC has the same poles, but also an additional pole-zero pair, $\omega_{p 2}$ and $\omega_{z 1}$, associated with the current mirrors M3a:M3b (and M4a:M4b). However, this pole-zero pair is associated with NMOS devices, which puts it at a high frequency. Moreover, the choice of $K$ plays a significant role in determining the phase margin, so the selection of $K$ will be limited by the amplifier application; for high speed applications $K$ is ideally chosen such that $\omega_{p 2}>3 \omega_{u}$, which can be used to place an upper boundary on $K$ as described by (101).

$$
\begin{equation*}
\omega_{p 2}>3 \omega_{u} \Rightarrow K<\sqrt{\frac{g m_{3 b} C_{L}}{3 g m_{1 a} C g s_{3 b}}}-1 \tag{101}
\end{equation*}
$$

## 4. Slew Rate

For this derivation, we again assume a single-ended capacitive output load, $C_{L}$, and a large signal seen at the inputs of the RFC. Suppose Vin + goes high, it follows that $M 1 a$ and M1b turn off, which forces $M 4 b$ and $M 4 a$ to turn off. Consequently, the drain potential of M4a rises and M6 is turned off whereas $M 2 a$ is forced into deep triode. This directs the tail current, $I_{T}$, into $M 2 b$, which in turn is mirrored by a factor $K(M 3 b: M 3 a)$
into M5. Therefore, the total current discharging $C_{L}$ at Vout- is $I_{T}(3 K+1) / 4$. On the other hand, only $I_{T}(K-1) / 4$ is available to charge $C_{L}$ at Vout+. This results in a common mode error, which is corrected by the common mode feedback such that the current discharging/charging $C_{L}$ at either output is the same and equal to $K I_{T} / 2$. The reverse chain of events occurs when Vin- goes high, and the end result is a symmetric slew rate expressed by (102) for a differential load of $C_{L} / 2$. The FC slew rate is repeated in (103).

$$
\begin{gather*}
S R_{R F C}=\frac{K I_{T}}{C_{L}}  \tag{102}\\
S R_{F C}=\frac{I_{T}}{C_{L}} \tag{103}
\end{gather*}
$$

By examining (102) and (103), and substituting for the value of $K=3$, the slew rate of the RFC is enhanced 3 times over the FC for the same power consumption. This, however, is the maximum theoretical limit for this $K$ value. In an actual design the devices assumed to fully turn off still conduct some residual current, and hence reduce the amount reaching the output. Moreover, the accuracy of the current mirrors is degraded for large transients. Nevertheless, with proper sizing and biasing of devices, a slew rate enhancement greater than 2 can be realized for $K=3$.

## 5. Noise

The modifications proposed in the RFC add additional devices and alter the signal path. It is therefore imperative to study their effects on the noise performance.

For the purpose of comparison with the FC, the thermal and flicker noise components of the RFC are examined individually to reduce clutter, and following the same procedure adopted in Chapter IV. We begin by evaluating the output current thermal noise PSD of the RFC, which is expressed in (104). Note that the cascode devices are excluded in (104), as their noise contribution was shown to be negligible in Chapter IV.

$$
\begin{equation*}
\overline{i_{o T}^{2}}=4 k_{B} T \gamma\left[2 g m_{1 a}+2 K^{2} g m_{1 b}+2 g m_{3 a}+2 K^{2} g m_{3 b}+2 g m_{9}\right] \tag{104}
\end{equation*}
$$

By observing that $g m_{l a}=g m_{l b}$, and $g m_{3 a}=K g m_{3 b}$, (104) becomes

$$
\begin{equation*}
\overline{i_{o T}^{2}}=8 k_{B} T \gamma\left[g m_{1 a}\left(1+K^{2}\right)+g m_{3 a}(1+K)+g m_{9}\right] . \tag{105}
\end{equation*}
$$

The result in (105) can be referred to the input through the amplifier transconductance, $G m_{R F C}$, given by (96) to result in the input referred thermal noise PSD.

$$
\begin{equation*}
\overline{v_{i T}^{2}}=\frac{\overline{i_{o T}^{2}}}{G m_{R F C}^{2}}=\frac{8 k_{B} T \gamma}{g m_{1 a}(1+K)}\left[\frac{\left(1+K^{2}\right)}{(1+K)}+\frac{g m_{3 a}}{g m_{1 a}}+\frac{g m_{9}}{g m_{1 a}(1+K)}\right] \tag{106}
\end{equation*}
$$

Or equivalently by substituting the $g m$ expression of (55),

$$
\begin{align*}
\overline{v_{i T}^{2}}= & \frac{8 k_{B} T \gamma}{\sqrt{\frac{\mu_{P} C_{o x}(W / L)_{1 a} I_{T}}{2}}}\left[\frac{\left(1+K^{2}\right)}{(1+K)^{2}}\right.  \tag{107}\\
& \quad+\frac{1}{1+K} \\
& \left.\sqrt{K \frac{\mu_{N}}{\mu_{P}} \frac{(W / L)_{3 a}}{(W / L)_{1 a}}}+\frac{1}{(1+K)^{2}} \sqrt{(K-1) \frac{(W / L)_{9}}{(W / L)_{1 a}}}\right]
\end{align*}
$$

To minimize the thermal noise component, we can maximize $I_{T}$ and $(W / L)_{1 a}$ and minimize $(W / L)_{3 a},(W / L)_{9}$ and $K$. However, $K$ has to be greater than 1 . Otherwise, M5M10 will conduct zero current.

Now considering the output current flicker noise PSD, we can write

$$
\begin{equation*}
\overline{i_{o f}^{2}}=\frac{I_{T}}{2 C_{o x} f}\left[\frac{K_{F P}}{L_{1 a}^{2}}+K^{2} \frac{K_{F P}}{L_{1 b}^{2}}+K \frac{K_{F N}}{L_{3 a}^{2}}+K^{2} \frac{K_{F N}}{L_{3 b}^{2}}+(K-1) \frac{K_{F P}}{L_{9}^{2}}\right] \tag{108}
\end{equation*}
$$

By observing that $L_{1 a}=L_{1 b}$, and $L_{3 a}=L_{3 b}$, (108) becomes

$$
\begin{equation*}
\overline{i_{o f}^{2}}=\frac{I_{T}}{2 C_{o x} f}\left[\left(1+K^{2}\right) \frac{K_{F P}}{L_{1 a}^{2}}+K(1+K) \frac{K_{F N}}{L_{3 a}^{2}}+(K-1) \frac{K_{F P}}{L_{9}^{2}}\right] \tag{109}
\end{equation*}
$$

To obtain the input referred flicker noise PSD, we refer (109) to the input through $G m_{R F C}$, which results in

$$
\begin{equation*}
\overline{v_{i f}^{2}}=\frac{\overline{i_{o f}^{2}}}{G m_{R F C}^{2}}=\frac{K_{F P}}{\mu_{P} C_{o x}^{2} W_{1 a} L_{1 a} f}\left[\frac{\left(1+K^{2}\right)}{(1+K)^{2}}+\frac{K}{(1+K)} \frac{K_{F N}}{K_{F P}}\left(\frac{L_{1 a}}{L_{3 a}}\right)^{2}+\frac{(K-1)}{(1+K)^{2}}\left(\frac{L_{1 a}}{L_{9}}\right)^{2}\right] \tag{110}
\end{equation*}
$$

To minimize the flicker noise component, we can maximize $W_{1 a}, L_{3 a}$ and $L_{9}$ and minimize $K$. Once $L_{3 a}, L_{9}$ and $K$ are set, a minimum value for (110) occurs at

$$
\begin{equation*}
L_{1 a}=\left[\frac{K(1+K)}{\left(1+K^{2}\right)} \cdot \frac{1}{L_{3 a}^{2}}+\frac{(K-1)}{\left(1+K^{2}\right)} \cdot \frac{1}{L_{9}^{2}}\right]^{-1 / 2} \tag{111}
\end{equation*}
$$

Finally, in order to have a meaningful comparison between the noise performance of the FC and the RFC, (106) and (110) need to be transformed in terms of the parameters found in (C-29) and (C-30) of Appendix C respectively. To do so we note that $g m_{l a}=2 g m_{1} /(1+K), g m_{3 a}=K g m_{3} /(1+K), W_{l}=2 W_{1 a}, L_{l a}=L_{1}, L_{3 a}=L_{3}$ and $K=3$. The transformed (106) and (110) are given by (112) and (114), whereas (C-29) and (C$30)$ are repeated here in (113) and (115), respectively. A first look through (112)-(115) is inconclusive as to which has lesser noise. However, since two terms in (112) and (114) are smaller than their counterparts in (113) and (115), it is likely the RFC has a lesser or equivalent noise to that of the FC.

$$
\begin{gather*}
\overline{v_{i T, R F C}^{2}}=\frac{8 k_{B} T \gamma}{g m_{1}}\left[\frac{5}{4}+\frac{3}{4} \frac{g m_{3}}{g m_{1}}+\frac{1}{4} \frac{g m_{9}}{g m_{1}}\right]  \tag{112}\\
\overline{v_{i T, F C}^{2}}=\frac{8 k_{B} T \gamma}{g m_{1}}\left[1+\frac{g m_{3}}{g m_{1}}+\frac{g m_{9}}{g m_{1}}\right]  \tag{113}\\
\overline{v_{i f, R F C}^{2}}=\frac{K_{F P}}{\mu_{P} C_{o x}^{2} W_{1} L_{1} f}\left[\frac{5}{4}+\frac{3}{2} \frac{K_{F N}}{K_{F P}}\left(\frac{L_{1}}{L_{3}}\right)^{2}+\frac{1}{4}\left(\frac{L_{1}}{L_{9}}\right)^{2}\right]  \tag{114}\\
\overline{v_{i f, R F C}^{2}}=\frac{K_{F P}}{\mu_{P} C_{o x}^{2} W_{1} L_{1} f}\left[1+2 \frac{K_{F N}}{K_{F P}}\left(\frac{L_{1}}{L_{3}}\right)^{2}+\left(\frac{L_{1}}{L_{9}}\right)^{2}\right] \tag{115}
\end{gather*}
$$

## 6. Input Offset

The input offset variance of the RFC can be expressed as the sum of all device drain-current variances seen at the output, and then referred to the input using $G m_{R F C}$.

Using (86), where $A_{V_{T}}$ is the area proportionality constant of $V_{T}$ according to Pelgrom's mismatch model [63], we can write the RFC output drain-current variance as

$$
\begin{equation*}
\sigma^{2}\left(I_{D}\right)=2\left[A_{V_{T P}}^{2}\left(\frac{g m_{1 a}^{2}}{W_{1 a} L_{1 a}}+K^{2} \frac{g m_{1 b}^{2}}{W_{1 b} L_{1 b}}\right)+A_{V_{T N}}^{2}\left(\frac{g m_{3 a}^{2}}{W_{3 a} L_{3 a}}+K^{2} \frac{g m_{3 b}^{2}}{W_{3 b} L_{3 b}}\right)+A_{V_{T p}}^{2} \frac{g m_{9}^{2}}{W_{9} L_{9}}\right] \tag{116}
\end{equation*}
$$

By observing that $W_{1 a}=W_{l b}, L_{1 a}=L_{l b}, W_{3 a}=K W_{3 b}, L_{3 a}=L_{3 b}, g m_{1 a}=g m_{1 b}$, and $g m_{3 a}=$ $\mathrm{Kgm}_{3 b}$, (116) becomes

$$
\begin{equation*}
\sigma^{2}\left(I_{D}\right)=2\left[g m_{1 a}^{2} \frac{A_{V_{T P}}^{2}}{W_{1 a} L_{1 a}}\left(1+K^{2}\right)+g m_{3 a}^{2} \frac{A_{V_{T N}}^{2}}{W_{3 a} L_{3 a}}(1+K)+g m_{9}^{2} \frac{A_{V_{T p}}^{2}}{W_{9} L_{9}}\right] \tag{117}
\end{equation*}
$$

The result in (117) can be converted to the input referred offset through $G m_{R F C}$, which after some algebra gives the input offset variance as

$$
\begin{equation*}
\sigma^{2}\left(V_{O S, R F C}\right)=2 \frac{A_{V_{T P}}^{2}}{W_{1 a} L_{1 a}}\left[\frac{\left(1+K^{2}\right)}{(1+K)^{2}}+\frac{K}{(1+K)} \frac{\mu_{N}}{\mu_{P}}\left(\frac{A_{V_{T N}}}{A_{V_{T P}}}\right)^{2}\left(\frac{L_{1 a}}{L_{3 a}}\right)^{2}+\frac{(K-1)}{(1+K)^{2}}\left(\frac{L_{1 a}}{L_{9}}\right)^{2}\right] \tag{118}
\end{equation*}
$$

To minimize the input offset, we can maximize $W_{1 a}, L_{3 a}$ and $L_{9}$, and minimize $K$. Once $W_{l a}, L_{3 a}, L_{9}$ and $K$ are set, a minimum value for (118) occurs at

$$
\begin{equation*}
L_{1 a}=\left[\frac{K(1+K)}{\left(1+K^{2}\right)} \frac{\mu_{N}}{\mu_{P}}\left(\frac{A_{V_{T N}}}{A_{V_{T P}}}\right)^{2} \frac{1}{L_{3 a}^{2}}+\frac{(K-1)}{\left(1+K^{2}\right)} \cdot \frac{1}{L_{9}^{2}}\right]^{-1 / 2} \tag{119}
\end{equation*}
$$

Again, for a meaningful comparison between the FC and the RFC, (118) needs to be transformed in terms of the parameters found in (C-32). To do so we note that $W_{1}=$ $2 W_{1 a}, L_{1 a}=L_{1}, L_{3 a}=L_{3}$ and $K=3$. The transformed (118) is given by (120), whereas (C32 ) is repeated here in (121) for convenience.

$$
\begin{gather*}
\sigma^{2}\left(V_{O S, R F C}\right)=2 \frac{A_{V_{T P}}^{2}}{W_{1} L_{1}}\left[\frac{5}{4}+\frac{3}{2} \frac{\mu_{N}}{\mu_{P}}\left(\frac{A_{V_{T V}}}{A_{V_{T P}}}\right)^{2}\left(\frac{L_{1}}{L_{3}}\right)^{2}+\frac{1}{4}\left(\frac{L_{1}}{L_{9}}\right)^{2}\right]  \tag{120}\\
\sigma^{2}\left(V_{O S, F C}\right)=2 \frac{A_{V_{T P}}^{2}}{W_{1} L_{1}}\left[1+2 \frac{\mu_{N}}{\mu_{P}}\left(\frac{A_{V_{T N}}}{A_{V_{T P}}}\right)^{2}\left(\frac{L_{1}}{L_{3}}\right)^{2}+\left(\frac{L_{1}}{L_{9}}\right)^{2}\right] \tag{121}
\end{gather*}
$$

Once again, a first look at (120) and (121) is inconclusive as to which has lesser input offset. However, since two terms in (120) are smaller than their counterparts in (121), it is likely the RFC has a lesser or equivalent input offset to that of the FC.

## 7. Efficiency

In Chapter IV an efficiency parameter, $\eta$, was defined as the ratio of the amplifier transconductance to the quiescent DC current consumed. Here we examine the efficiency of the RFC, which is expressed in (122).

$$
\begin{equation*}
\eta_{R F C}=\frac{G m_{R F C}}{I_{T}+\frac{I_{T}}{2}(K-1)}=\frac{g m_{1 a}(1+K)}{\frac{I_{T}}{2}(1+K)}=\frac{2 g m_{1 a}}{I_{T}} \tag{122}
\end{equation*}
$$

The result in (122) is of great interest. Recall that $g m_{l}$ of the FC is twice $g m_{1 a}$ of the RFC. This means the efficiency of the RFC is twice that of the FC given by (C-28), or equal to that of the Telescopic given by (C-6). More importantly, the efficiency of the RFC is independent of the current mirror factor $K$, unlike the efficiency of the current mirror OTA described by (C-17).

## 8. Area and Power

Amplifier design is application specific. However, the amplifier bandwidth, gain and slew rate are arguably the most critical design criteria. The foregoing analysis shows that for practically the same area and power consumption, the RFC delivers twice the bandwidth, $8-10 \mathrm{~dB}$ expected higher gain, and more than twice the slew rate of the FC. Now suppose we take the RFC (call it RFC1) and reduce the widths of all devices by half to produce RFC2, where RFC2 now occupies half the area and uses half the current of the RFC1, and hence the FC. It follows that (96), (102), (112), (114), and (120) become (123), (124), (125), (126), and (127) respectively.

$$
\begin{gather*}
G m_{R F C 2}=\frac{g m_{1}}{4}(1+K)  \tag{123}\\
S R_{R F C 2}=\frac{K I_{T}}{2 C_{L}}  \tag{124}\\
\overline{v_{i T, R F C 2}^{2}}=\frac{8 k_{B} T \gamma}{g m_{1}}\left[\frac{5}{2}+\frac{3}{2} \frac{g m_{3}}{g m_{1}}+\frac{1}{2} \frac{g m_{9}}{g m_{1}}\right] \tag{125}
\end{gather*}
$$

$$
\begin{gather*}
\overline{v_{i f, R F C 2}^{2}}=\frac{K_{F P}}{\mu_{P} C_{o x}^{2} W_{1} L_{1} f}\left[\frac{5}{2}+3 \frac{K_{F N}}{K_{F P}}\left(\frac{L_{1}}{L_{3}}\right)^{2}+\frac{1}{2}\left(\frac{L_{1}}{L_{9}}\right)^{2}\right]  \tag{126}\\
\sigma^{2}\left(V_{O S, R F C 2}\right)=2 \frac{A_{V_{T P}}^{2}}{W_{1} L_{1}}\left[\frac{5}{2}+3 \frac{\mu_{N}}{\mu_{P}}\left(\frac{A_{V_{T N}}}{A_{V_{T P}}}\right)^{2}\left(\frac{L_{1}}{L_{3}}\right)^{2}+\frac{1}{2}\left(\frac{L_{1}}{L_{9}}\right)^{2}\right] \tag{127}
\end{gather*}
$$

Examining (123) shows the RFC2 to have the same transconductance as the FC, whereas (124) shows the RFC2 still has a better slew rate than the FC. Equations (125)(127), however, demonstrate a degradation in noise and input offset compared to RFC1, and hence FC.

## C. Characterization

To validate the theoretical results, a FC amplifier was designed as a benchmark in a mainstream $0.18 \mu \mathrm{~m}$ CMOS process using core devices. The design procedure followed proven analog design practices; a large input pair in weak/moderate inversion to maximize bandwidth and minimize noise and offset, and current mirror devices with long channels biased in strong inversion to improve accuracy and output impedance. The main constraint applied to the design is a power budget of $800 \mu \mathrm{~A}$. Once the FC design was finalized, RFC1 and RFC2 were derived as outlined in Sections $A$ and B.8.

A single-ended version of the amplifiers was used in the setup of Fig. 52 to simplify the characterization. To preserve the high output impedance and limit the DC output current drawn, $R$ was set to be $560 \mathrm{k} \Omega$. As for $C_{1}$ and $C_{2}$ they were set to 2.2 pF and 2.5 pF respectively, which yields an overall load of 3.6 pF .


Fig. 52. Amplifier characterization setup; (a) AC response and noise and (b) transient response and input offset.


Frequency (Hz)
Fig. 53. Amplifiers AC response; (a) magnitude and (b) phase.

As seen in Fig. 53, RFC1 indeed has a wider bandwidth and higher gain than the FC, whereas RFC2 has virtually the same bandwidth, but higher gain than the FC. The higher gain of RFC2 is not surprising; while RFC1 has +6 dB gain due to an enhanced Gm, RFC2 has +6 dB gain because it consumes half the current-the additional $\sim 3 \mathrm{~dB}$ improvement is attributed to $M 2 a$ and $M 4 a$ as explained in Section B.2.

On the other hand, the phase response shows some degradation for both RFC1 and RFC2 with respect to the FC. This is to be expected. As discussed earlier, the addition of current mirrors in the signal path (M3a:M3b-M4a:M4b) introduces additional pole-zero pairs. However, these pole-zero pairs are for an NMOS low gain current mirror, which are at high frequencies. Moreover, by satisfying the condition set by (101) for the upper limit of $K$, the degradation in the phase margin should not significantly affect the transient response of the amplifiers; here the phase margins of the FC, RFC1 and RFC2 are $80.6^{\circ}, 62.5^{\circ}$ and $75.1^{\circ}$ respectively.

For the transient response shown in Fig. 54, the input signal was a 500 mVpp 10 MHz pulse with a common mode level of 450 mV . This results in an output of 1 Vpp centered at $900 \mathrm{mV}\left(V_{D D} / 2\right)$ using the configuration in Fig. 52(b). Undoubtedly, RFC1 has a superior slew rate performance than FC as seen in Fig. 54(a). RFC2 too has a better slew rate performance, which is seen more clearly in Fig. 54(b) as a higher peak output current. Moreover, the settling behavior of both RFC1 and RFC2 was not affected by the phase margin degradation in comparison to FC.


Time (s)
Fig. 54. Amplifiers transient response; (a) output voltage and (b) total output current.

The input offset was measured by applying a 450 mV DC input and observing the output in a Monte Carlo simulation. The offset was extracted using (128), where $\beta_{F B}$ is the feedback factor and $A$ is the amplifier's DC gain, and used to create Fig. 55. While the offset standard deviation is similar for both FC and RFC1, it is larger for RFC2 as demonstrated by (127). Also, a non-zero mean is observed in Fig. 55 due to systematic offset, not random process variations. This systematic offset has two components. First, there is a finite current drawn by the load $(2 R)$, which causes an imbalance in the
currents of the output devices. Second, the feedback network sets the output to 900 mV , while ideally Vout should follow the gate voltage of M9; in the single-ended version of the amplifiers, M9 in Fig. 49 and Fig. 50 is diode connected to bias M10. This systematic offset is more evident in FC because of the lower DC gain.

$$
\begin{equation*}
V_{O S}=\operatorname{Vout}\left(\beta_{F B}+\frac{1}{A}\right)-\operatorname{Vin} \tag{128}
\end{equation*}
$$



Fig. 55. Input offset distributions; (a) conventional folded cascode, (b) recycling folded cascode 1 and (c) recycling folded cascode 2.

As for noise, Fig. 56 shows the PSD of the input referred noise for the FC, RFC1 and RFC2 over the bandwidth $1 \mathrm{~Hz}-100 \mathrm{MHz}$. Note how the flicker noise dominates the noise performance of advanced CMOS technologies as discussed in Chapter IV. From Fig. 56 we can see how the RFC1 has a better noise performance than the FC, and RFC2 clearly has a worse noise performance as predicted by (125) and (126).


Fig. 56. Input referred noise power spectral density.

A summary of the discussed results is shown in Table 4. In addition, the $0.1 \%$ settling time is also included to demonstrate that indeed the modifications proposed to the FC amplifier do enhance the overall general performance.

TABLE 4
Amplifier Characterization Results

| Parameter | FC | RFC1 | RFC2 |
| :---: | :---: | :---: | :---: |
| Power $[\mu \mathrm{A}]$ | 796 | 782 | 394 |
| DC Gain [dB] | 52.63 | 60.91 | 59.32 |
| GBW [MHz] | 106.3 | 197.2 | 105.9 |
| Phase Margin [deg] | 80.6 | 62.5 | 75.1 |
| Capacitive Load [pF] | 3.6 | 3.6 | 3.6 |
| Slew Rate (average) $[\mathrm{V} / \mu \mathrm{s}]$ | 99.3 | 231.1 | 116.5 |
| 0.1\% Settling Time $[\mathrm{ns}]$ | 21.7 | 11.6 | 21.7 |
| Input Offset $(3 \sigma)[\mathrm{mV}]$ | 7.917 | 7.635 | 11.079 |
| Input Noise (1Hz-100MHz) $[\mu \mathrm{Vrms}]$ | 53.16 | 48.48 | 69.71 |

## D. The Parameter $K$

Thus far, the analysis and results have been for a single value of $K$, which was chosen to be 3 . This choice was solely made to minimize device changes and keep the area and power consumption of the RFC1 equal to the FC, and hence served as a good example for comparison. However, $K$ can take any value greater than 1 . The choice of $K$ is design specific, and hence is left to the designer's judgment. Nonetheless, here are some guidelines.

By steadily increasing $K$ we get a:

- Larger transconductance
- Higher slew rate
- Degrading phase margin (best phase margin achieved at $K=1$ )
- Degrading noise performance (minimum noise achieved at $\mathrm{K}=1$ )
- Degrading offset performance (minimum offset achieved at $\mathrm{K}=1$ )


## E. Conclusions

It has been demonstrated, both in theory and simulation results, that the proposed modifications to the conventional folded cascode, using the same area and power budgets, can boost the gain, bandwidth and slew rate without affecting the noise performance or introducing excess offset. On the other hand, by using half the area and half the power budgets of the conventional folded cascode, the proposed amplifier is still capable of delivering the same or even better dynamic performance, but on the expense of a 1.7 dB noise increase and up to $40 \%$ added offset.

## CHAPTER VI

## COMMON MODE FEEDBACK*

The use of fully-differential amplifiers has several advantages over their singleended counterparts; an increased output signal range, a better phase response and high tolerance to common mode noise are a few examples. Unlike single-ended amplifiers, however, their output common mode (CM) level-the level about which the signal alternates-is not well defined.


Fig. 57. The differential pair amplifier; (a) single-ended and (b) fully-differential.

Consider the single-ended amplifier in Fig. 36(a), and suppose an external feedback network is designed to set the input terminals at $V_{I C M}$ and the output at $V_{O C M}$.

[^2]The current generated by the $V_{I C M}$ through M1 creates a voltage drop across the diode connected transistor M3, which in turn dynamically biases M4. This dynamic bias ensures that the difference of the current through M2 and M4 flowing into the amplifier load will generate $V_{\text {OCM }}$.

On the other hand, if the differential version of the same feedback network is applied to the fully-differential amplifier in Fig. 57(b), and $M 3$ and $M 4$ had a same fixed gate bias $V_{B}$, then the outputs will drift and saturate to one of the supply rails rendering the amplifier useless. This happens because the difference between the currents generated in M1 and M2 by $V_{I C M}$ from that generated in $M 3$ and $M 4$ by $V_{B}$ is not necessarily what is needed to establish and sustain $V_{O C M}$ at the output. Hence, additional feedback is needed to dynamically bias $M 3$ and $M 4$ and set the output CM level to $V_{\text {OCM }}$. This is known as common mode feedback (CMFB), and a general fully-differential amplifier representation with CMFB is given in Fig. 58.


Fig. 58. Fully differential amplifier with CMFB.

In Fig. 58, a CM detector is used to extract the CM level of the output, which given its differential nature is a simple averaging process. In order to capitalize on the output signal headroom of fully-differential amplifiers, the output CM is generally centered between the supply rails, especially in low voltage applications. This is achieved by a comparison with $V_{O C M}$ where the outcome is fed back to the amplifier to adjust its bias levels; in the case of the amplifier in Fig. 57(b), $V_{C M F B}$ replaces the bias $V_{B}$ of $M 3$ and $M 4$ so that their currents adjust to match those of $M 1$ and $M 2$ and set $V_{O C M}$ as the CM output.

## A. Continuous Time vs. Switched Capacitor CMFB

CMFB implementations can be classified under two main categories, continuous time and SC. Generally, a SC CMFB is favored in a SC system such as a pipeline ADC since the clock phases necessary for its operation are preexistent. However, the preferred choice of SC CMFB is not just based on mere convenience. Continuous time CMFB is based on transistors which increase the power budget, and often use current mirrors which introduce extra poles in the CMFB loop that effectively reduce its bandwidth and stability [81]-[82]. These shortcomings are mostly alleviated in SC CMFB.

## B. Switched Capacitor CMFB

The most commonly used SC CMFB implementation is presented in Fig. 59 [83]. In this implementation capacitors $C_{2}$ perform the CM detection by simply averaging the output voltage. As for the comparison with $V_{O C M}$ as seen in Fig. 58 it is performed in two
phases. During $\Phi_{I}$ capacitors $C_{l}$ are pre-charged to the difference of the desired output CM level $V_{O C M}$ and the ideal voltage $V_{\text {REF }}$ needed to bias the current sources/sinks in the amplifier to achieve the desired level. In $\Phi_{2}$ the charge of $C_{1}$ is shared with $C_{2}$ such that the voltage across $C_{2}$ becomes $V_{O C M}-V_{\text {REF }}$. Should the average of Vout + and Vout- be different than $V_{O C M}$ by some $\delta$, then $V_{C M F B}$ is shifted by the same $\delta$ from $V_{R E F}$ which represents a unity gain comparison. This repeats over a few cycles where eventually $\delta$ approaches 0 if the common mode loop gain is sufficiently large.


Fig. 59. Classical switched capacitor CMFB circuit.

There are some considerations necessary for an efficient implementation of a SC CMFB. Primarily, we consider the CMFB loop bandwidth. The capacitors $C_{2}$ increase the differential mode (DM) load and therefore should be minimized so as not to significantly slow down the amplifier. However, as $V_{C M F B}$ biases the gate of a current source/sink, there is a finite parasitic capacitance $C_{P}$ at that node. $C_{P}$ along with $C_{2}$ determine the high frequency feedback factor $\beta$ of the CMFB loop where $\beta=2 C_{2} /\left(2 C_{2}\right.$ $\left.+C_{P}\right)$. To maximize the CMFB loop bandwidth, $\beta$ needs to be maximized which creates
a trade-off between the DM bandwidth and CMFB loop bandwidth. Fortunately, while it is beneficial to have a fast CMFB loop, it is not necessary for it to be comparable to the differential mode (DM) loop. This is because CM disturbances are generally small and the accuracy at which the CM is set can be an order of magnitude lower than that needed by the DM. Additional considerations pertaining to the time needed to reach stead-state and the absolute steady-state conditions are detailed in [84].

## C. Dual Level CMFB

The direct auto-zeroing technique [41] presented in Chapter III is a reliable and power conscious method to eliminate the low frequency disturbances and amplifier DC offset in applications where the amplifier is reset in one of the clock phases. Some examples of circuits using the direct auto-zeroing technique to reduce amplifier offset are the $\mathrm{S} / \mathrm{H}$ and MDAC shown in Fig. 60, which are extensively used in pipeline ADCs. The output of the $\mathrm{S} / \mathrm{H}$ and MDAC during $\Phi_{2}$ are given by (129) and (130) respectively, where $V_{O S}$ is the amplifier offset, $M$ is the effective number of bits resolved in the pipeline stage and $\beta_{\Phi 2}$ is the amplifier feedback factor during $\Phi_{2}$. Hence the output is virtually offset free for a sufficiently high open loop gain, $A$.

$$
\begin{gather*}
V_{O U T, S H}=\alpha_{1} V_{I N}+\frac{\alpha_{1} V_{O S}}{(1+A)}, \quad \alpha_{1}=\frac{A}{1+A}  \tag{129}\\
V_{O U T, M D A C}=\alpha_{2} 2^{M} V_{I N}+\frac{\alpha_{2} 2^{M} V_{O S}}{(1+A)}, \quad \alpha_{2}=\frac{\beta_{\Phi 2} A}{1+\beta_{\Phi 2} A} \tag{130}
\end{gather*}
$$



Fig. 60. Direct auto-zeroing offset cancellation in (a) flip-around S/H and (b) pipeline stage MDAC.

In low voltage applications, however, amplifiers are designed such that $V_{D D}$ is in the order of $V_{G S}+2 V_{G S T}$, which forces the input and output $\mathrm{CM}, V_{I C M}$ and $V_{O C M}$, to be different as illustrated in Fig. 61 for a folded cascode, and prohibits the use of direct auto-zeroing. Hence, current low voltage examples of pipeline ADCs [85]-[87] rely solely on digital correction to overcome the amplifier input offset, which translates to a comparator offset in the next stage. Instead of resetting the amplifier during $\Phi_{1}$, the input and output terminals are shorted to the appropriate $V_{I C M}$ and $V_{O C M}$ so the amplifier is properly biased during $\Phi_{2}$. The offset then appears at the output without reduction-the term $(1+A)$ is eliminated from (129) and (130)—and is most severe for the MDAC
where in fact it is amplified; a 10 mV input offset in a 2.5 bits/stage pipeline stage MDAC is seen as 40 mV at the output and leaves little room for comparator offsets to be corrected. As it is quite common for non-calibrated pipeline ADCs to utilize high resolution stages at the beginning of the pipeline to overcome capacitor mismatch, offset cancellation becomes necessary. In this dissertation, a dual level CMFB is presented to alleviate this issue.


Fig. 61. Different input and output CM in a low voltage folded cascode amplifier.

In Fig. 62(a) an improved implementation [88] of the classical SC CMFB is shown, which has the advantage of balancing the load in applications where the amplifier is used in both clock phases. The drawback, however, is the added switches and capacitors. By removing $C_{2}$ and introducing two CM levels, $V_{I C M}$ and $V_{O C M}$, a dual level CMFB is achieved and is shown in Fig. 62(b). The operation of the dual level CMFB is as follows. During $\Phi_{1}$ the output CM is set to $V_{I C M}$. Thus by resetting the
amplifier as in Fig. 60, the offset is sampled and the amplifier is biased properly in preparation for $\Phi_{2}$. During $\Phi_{2}$ the output CM is switched to $V_{O C M}$ to maximize the output swing. Hence, direct auto-zeroing is reinstated to actively cancel the offset, despite the different input and output CM of low voltage applications.


Fig. 62. Switched capacitor CMFB circuits; (a) improved classical circuit and (b) dual level CMFB.

Such dynamic change in the CM output is simulated in TSMC $0.18 \mu \mathrm{~m}$ CMOS for the fully-differential implementation of the $\mathrm{S} / \mathrm{H}$ shown in Fig. 60(a), using the recycling folded cascode amplifier with gain boosting and a supply of 1.2 V at sampling rate of $100 \mathrm{MS} / \mathrm{s}$. The $\mathrm{S} / \mathrm{H}$ has a total single-ended capacitive load of 1.4 pF . The
waveforms in Fig. 63 show the change in $V_{C M F B}$ and output CM for each clock phase; $V_{I C M}$ is around 300 mV to optimize the biasing of a PMOS driver, and $V_{\text {OCM }}$ is 600 mV to maximize the output swing.


Fig. 63. Dynamic change in output common mode using a dual level CMFB circuit; (a) $V_{C M F B}$ and (b) output CM.

For the CM step of 300 mV shown here, the amplifier can completely switch CM levels in less than 2 ns within $1 \%$ accuracy. An input differential offset of 50 mV was imposed in the schematic similar to Fig. 60(a), and Fig. 64 shows the input and sampled output signals demonstrating effective offset sampling in the reset phase and cancellation in the hold phase for a 500 mVpp 12.6 MHz signal.


Fig. 64. Sampled signal in a flip-around SHA showing 50 mV sampled offset and offset free held output.

## D. Design Concerns of Dual Level CMFB

Similar to the classical SC CMFB, there are some considerations with the dual level CMFB. These considerations can be either static and thus related to the steady-state performance, or dynamic and are best analyzed from the frequency domain perspective.

## 1. Static Performance

The principal function of the dual level CMFB is to change the CM output level for each clock phase thereby enabling the use of direct auto-zeroing in low voltage applications. When setting the output CM level to $V_{I C M}$, the $V_{D S}$ drop across the output devices is reduced and so is the gain, which may imply a reduced efficiency in offset
reduction according to (129) and (130). This, however, is not a major issue for multistage amplifiers as the gain is mostly provided by the first stage(s) and the last stage is designed for rail-to-rail swing. As for single stage cascode amplifiers, gain boosting techniques are necessary for small settling errors and can be designed to provide sufficient gain even when the output is at $V_{I C M}$; a gain of only 40 dB reduces an effective output offset of 50 mV down to $500 \mu \mathrm{~V}$, which has negligible impact on the accuracy of a pipeline stage even at high bits/stage resolution.

Another concern arises from potential different amplifier offsets when the output CM is set to $V_{I C M}$ and $V_{O C M}$. Again, multi-stage amplifiers are immune to such change, because the last stage contributes negligible offset and isolates the output from the previous stage, thus maintaining the same amplifier offset for both output CM values. Single-stage amplifiers, however, may be affected. It was demonstrated in Chapter IV that the offset contribution of cascode devices is negligible as long as the devices directly connected to the supply rails are kept in saturation. This is further affirmed when gain boosting is used. Therefore, as long as $V_{I C M}$ is greater than one $V_{G S T}$ away from the supply rails such that the rail devices are kept in saturation, the offset of single-stage amplifiers will be virtually the same when the CM is set to either $V_{I C M}$ or $V_{O C M}$.

## 2. Dynamic Performance

The cost of removing $C_{2}$ (see Fig. 62) means that for the brief time of clock nonoverlap, the amplifier has no CM control. However, high sampling rates have 100-200ps of clock non-overlap, which minimize the risk by allowing for minor CM drifts. Another
aspect of the removal of $C_{2}$ is the loss of the direct high frequency $C M$ path. Instead, the finite resistance of the switches added in series with either $C_{1 a}$ or $C_{l b}$ has a mild low pass filter effect, which can be optimized with proper switch and capacitor sizing without adversely affecting the CMFB bandwidth. Moreover, the CM feedback factor given here by $\beta=2 C_{1 a, b} /\left(2 C_{1 a, b}+C_{P}\right)$, where $C_{P}$ represents the parasitics at $V_{C M F B}$ can be optimized to the best compromise in speed and stability.

An added benefit of a CMFB circuits is the additional CM noise reduction they provide to fully-differential amplifiers. However, amplifiers using SC CMFB have been demonstrated to be vulnerable to poor power supply rejection (PSR) performance, [83] and [89], and therefore the PSR of the dual level CMFB is analyzed and compared to the classical implementation. To begin, a fully-differential folded cascode amplifier with CMFB circuit connections is shown in Fig. 65. The CMFB can be substituted with the classical implementation of Fig. 59, or the dual level CMFB of Fig. 62(b). Also shown in Fig. 65 are simple implementations of how to generate $V_{\text {REF }}, V_{O C M}$ and $V_{I C M}$.


Fig. 65. Fully-differential folded cascode amplifier with CMFB connections.

For the small signal analysis, consider the simplified half-circuit model of Fig. 65 given in Fig. 66 for the classical and dual level CMFB circuits. The amplifier $A_{C M}$ has $M 5$ as input driver and shares the same load as the folded cascode. A noise source, $v_{n}$, is imposed on $V_{D D}$ and a portion leaks to $V_{R E F}, V_{O C M}$ and $V_{I C M}$ with factors of $\alpha$ and $\gamma$. The CM output noise is denoted by $v_{\text {ocm }}$. By conserving charge between clock phases, the PSR of the classical and dual level CMFB circuits seen at $\Phi_{2}$ can be expressed by (131) and (132) respectively. The PSR is defined here as $v_{o c m} / v_{n}$, which is desired to be $\ll 1$.


Fig. 66. Half-circuit small signal model of CMFB loop; (a) classical CMFB, and (b) dual level CMFB.

$$
\begin{align*}
& {P S R_{\text {Classic }}=}_{\left(1+\frac{1}{A_{C M}}\right)\left[\left(C_{1}+C_{2}\right)+\frac{C_{P}\left(C_{1}+C_{2}\right)}{2 C_{2}\left(1+A_{C M}\right)}-z^{-1 / 2}\left(\frac{C_{P} C_{1}}{2 C_{2}\left(1+A_{C M}\right)}\right)-z^{-1}\left(C_{2}+\frac{C_{P}}{2\left(1+A_{C M}\right)}\right)\right]}^{P S R_{d l, \text { CMFB }}=\frac{C_{1 b}-z^{-1 / 2}\left[C_{1 b}\left(\alpha-\gamma_{b}\right)-C_{1 a}\right]-z^{-1} C_{1 a}\left(\alpha-\gamma_{a}\right)}{\left(1+\frac{1}{A_{C M}}\right)\left[C_{1 b}+\frac{C_{P}}{2\left(1+A_{C M}\right)}+z^{-1 / 2} C_{1 a}-z^{-1}\left(\frac{C_{P}}{2\left(1+A_{C M}\right)}\right)\right]}}
\end{align*}
$$

However, for sufficiently high gain, (131) and (132) can be simplified and given by (133) and (134).

$$
\begin{gather*}
\left.P S R_{\text {Classic }}\right|_{A_{C M} \gg 1}=\frac{\left(C_{1}+C_{2}\right)-z^{-1}\left[C_{1}(\alpha-\gamma)+C_{2}\right]}{\left(C_{1}+C_{2}\right)-z^{-1} C_{2}}  \tag{133}\\
\left.P S R_{d l, C M F B}\right|_{A_{C M} \gg 1}=\frac{C_{1 b}-z^{-1 / 2}\left[C_{1 b}\left(\alpha-\gamma_{b}\right)-C_{1 a}\right]-z^{-1} C_{1 a}\left(\alpha-\gamma_{a}\right)}{C_{1 b}+z^{-1 / 2} C_{1 a}} \tag{134}
\end{gather*}
$$

To compare the low frequency performance of both CMFB circuits we substitute $z=1(\omega=0)$; the results are given in (135) and (136), and indicate the same modest PSR performance for $C_{l a}=C_{l b}$. It is worth to mention, however, that the leakage of $v_{n}$ onto $V_{\text {REF }}$ defined by $\alpha$, is very close to unity for a wide bandwidth, and hence the PSR performance is dominated by the leakage of $v_{n}$ onto $V_{O C M}$, defined by $\gamma$. In Fig. 65, $V_{O C M}$ was generated by a resistive voltage divider, which transfers half of $v_{n}$ onto $V_{O C M}$ resulting in poor PSR. If on the other hand $V_{O C M}$ was generated from a quite source, the PSR of the SC CMFB can be greatly improved. In Fig. 67 the PSR of the dual level

CMFB was simulated using transient periodic steady state analysis; Fig. 67(a) shows $\alpha$, $\gamma_{a}$ and $\gamma_{b}$ versus frequency up to $f_{S} / 2$, and Fig. 67(b) shows the PSR of different noise combinations onto $V_{I C M}$ and $V_{O C M}$ for $C_{1 a}=C_{l b}$. Clearly, a quite $V_{I C M}$ and $V_{O C M}$ can significantly improve the PSR performance.

$$
\begin{gather*}
\left.P S R_{\text {Classic }}\right|_{\omega \rightarrow 0}=1-\alpha+\gamma  \tag{135}\\
\left.P S R_{d l, C M F B}\right|_{\omega \rightarrow 0}=\frac{C_{1 b}\left(1-\alpha+\gamma_{b}\right)+C_{1 a}\left(1-\alpha+\gamma_{a}\right)}{C_{1 b}+C_{1 a}} \tag{136}
\end{gather*}
$$



Frequency (Hz)
Fig. 67. Effects of supply noise components on the PSR of the dual level CMFB.

As for the high frequency PSR performance, the parasitics and the reduced amplifier gain will have an impact. The high frequency PSR can be found by substituting $z=-1(\omega=\infty)$ in (131) and (132) once all powers of $z$ have been changed to positive integers. The results are given by (137) and (138) respectively. Note that both CMFB circuits have again almost the same performance where $1+\alpha-\gamma$ approaches unity at high frequencies in agreement with Fig. 67; $\gamma$ approaches unity because of the deglitching capacitor $C_{D}$ in Fig. 65.

$$
\begin{gather*}
\left.P S R_{\text {Classic }}\right|_{\omega \rightarrow \infty}=\frac{1-\alpha+\gamma}{\left(1+\frac{1}{A_{C M}}\right)\left(1+\frac{C_{P} C_{1}}{C_{2}\left(1+A_{C M}\right)}\right)}  \tag{137}\\
\left.P S R_{d l, C M F B}\right|_{\omega \rightarrow \infty}=\frac{C_{1 b}\left(1+\alpha-\gamma_{b}\right)-C_{1 a}\left(1+\alpha-\gamma_{a}\right)}{\left(1+\frac{1}{A_{C M}}\right)\left(C_{1 b}-C_{1 a}\right)} \cong \frac{1+\alpha-\gamma_{a, b}}{\left(1+\frac{1}{A_{C M}}\right)} \tag{138}
\end{gather*}
$$

## E. Conclusions

The presented dual level CMFB aids the robust implementation of direct autozeroing offset cancellation techniques under low voltage conditions, and its effectiveness has been demonstrated with a S/H example. The static errors of the dual level CMFB are negligible so long as sufficient amplifier gain is maintained. As for the dynamic performance, speed optimization techniques were discussed, and the PSR was analyzed and shown to be similar to that of the classical CMFB.

## CHAPTER VII

## COMPARATORS

Comparators are functional blocks in many analog circuits, particularly pipeline ADCs. In Chapter III, it was demonstrated that the sub-ADC block of a pipeline cell is predominately based on a group of comparators with a handful of supplemental logic gates. In this chapter we discuss the circuit implementation of comparators with a focus on their uses in pipeline ADCs.

## A. Comparator Architectures

The main function of a comparator is to amplify the difference between two signals to a level large enough to be detected by subsequent digital circuitry in a very short time. The first architectural difference among comparators is the input signal type, which can be either voltage or current in nature, but in the case of pipeline ADCs voltage comparators are the most widely used. Another architectural difference is the method by which the input difference is amplified to reach logic values, which classifies comparators into amplifier-type or latch-type.

## 1. Amplifier-Type Comparators

Amplifiers are most commonly used in a closed loop configuration, but if used in open loop can also function as comparators. However, amplifier-type comparators seldom find use in high speed applications. First, the amplifier gain is strongly related to
its output swing, which degrades significantly near rail-to-rail operation necessary to drive subsequent digital circuits; this may require cascading of several amplifiers to achieve the necessary gain, thereby needlessly increasing the complexity, area and power consumption of the comparator. Second, the amplifier is ultimately limited by its slew rate in charging/discharging its load; to achieve a faster rail-to-rail transition will generally dictate a higher bias current. Therefore, amplifier-type comparators are rarely seen in practical application, apart from slow and low cost designs [90].

## 2. Latch-Type Comparators

At the heart of a latch-type comparator is a pair of back-to-back inverters-a latch-from which this comparator type gets its name. Latch-type comparators are generally faster than amplifier-type comparators due to the positive feedback loop of the latch. To briefly examine how a latch-type comparator functions, consider the conceptual schematic in Fig. 68(a). Focusing on the latch alone, and assuming the following small-signal parameters for the inverters: transconductance $g m_{\text {Latch }}$, conductance $g_{O, \text { Latch }}$ and output capacitance $C_{\text {Latch }}$, we can write the following KCL equations at the output of each inverter.

$$
\begin{align*}
& g m_{\text {Latch }} V_{A}+g_{o, \text { Latch }} V_{\bar{A}}+C_{\text {Latch }} \frac{d V_{\bar{A}}}{d t}=0  \tag{139}\\
& g m_{\text {Latch }} V_{\bar{A}}+g_{o, \text { Latch }} V_{A}+C_{\text {Latch }} \frac{d V_{A}}{d t}=0 \tag{140}
\end{align*}
$$

Equations (139) and (140) can be combined as seen in (141), where $V_{\Delta}=V_{A}-V_{\bar{A}}$. When integrated, (141) results in (142) where $\tau_{\text {Latch }}$ represents the latch time constant. Therefore, to enhance the speed of the latch, one needs to minimize the load capacitance at the nodes $V_{A}$ and $V_{\bar{A}}$ and maximize the transconductance of the inverters; with proper design, decision times in the order of 100s of pico seconds can be achieved.

$$
\begin{align*}
& \left(g m_{\text {Latch }}-g_{o, \text { Latch }}\right) V_{\Delta}-C_{\text {Latch }} \frac{d V_{\Delta}}{d t}=0  \tag{141}\\
& V_{\Delta}=V_{\Delta}(0) e^{\frac{\left(g m_{\text {Latch }}-g_{o, \text { Latch }}\right)}{C_{\text {Latch }}} t}=V_{\Delta}(0) e^{\frac{t}{\tau_{\text {Latch }}}} \tag{142}
\end{align*}
$$


(a)

(b)

Fig. 68. Latch-type comparator; (a) conceptual schematic and (b) time-domain waveforms.

The exponential behavior of the latch described by (142) is triggered by the input pair M1 and M2; according to the values of $V 1$ and $V 2$, a small differential current is
generated which causes an imbalance in the charge held by the output capacitance at $V_{A}$ and $V_{\bar{A}}$ thus triggering the exponential behavior of the latch. The resulting time-domain waveforms of $V_{A}$ and $V_{\bar{A}}$ are shown in Fig. 68(b); beginning at some initial value $V_{A}(0)$, $V_{A}$ and $V_{\bar{A}}$ start drifting exponentially towards opposite supply rails. This exponential behavior eventually dampens as the inverters saturate at either rail due to the conductance $g_{O, L a t c h}$ increasing and transconductance $g m_{\text {Latch }}$ diminishing. Once the latch reaches a stable condition, it will retain that state until reset for a new comparison to take place. Therefore, latch-type comparators are naturally clocked comparators and are commonly followed by an SR latch to preserve their output after reset for post processing while waiting for a new comparison event to take place.

Because of their high speed operation and fairly low power consumption, latchtype comparators are preferred over amplifier-type comparators, and are extensively used in ADCs, particularly in Pipeline and Flash ADCs. The following sections are devoted to the sub-division of latch-type comparators: Static and Dynamic.

## B. Static Comparators

Static comparators continually conduct a fixed and known bias current regardless of the comparator state; reset or latched. There are several topologies that fall under this category, and are easily classified as class $A$ or class $A B$ output similar to the comparator architectures of Fig. 69.

## 1. Class A Output

First, let us consider the class A output comparator in Fig. 69(a) [91]-[93]. When $V_{\text {Latch }}$ is HIGH, M1-M4 pre-amplify the difference $V_{I P}-V_{I N}$ and the result is injected as a differential current via M5-M6 and triggers the latch M7-M8. When $V_{\text {Latch }}$ goes LOW, $S 1$ and $S 2$ pull the comparator outputs to $V_{D D}$, while $S 3$ blocks the current path in the latch. Hence, during either phase the steady state current flowing through the comparator is $I_{B}$. This topology is fairly fast and can achieve a low input offset by optimizing the operating points and dimensions of M1-M4. A simpler class A output comparator topology can be implemented by feeding the output of M1-M2 directly into a PMOS cross-coupled load. However, this approach eliminates the preamplifier and thus makes the comparator slower and more vulnerable to kickback noise as will be demonstrated in later sections.


Fig. 69. Static latch-type comparators; (a) class A and (b) class AB output.

## 2. Class AB Output

Now we consider the class AB output comparator in Fig. 69(b) [94]-[98]. When $V_{\text {Latch }}$ is HIGH, M1-M2 convert the difference $V_{I P}-V_{I N}$ into current, which is directly injected into the output nodes thereby triggering the latch M3-M6. Here, and unlike Fig. 69(a), the latch is implemented in a true push-pull manner and hence is class AB output. When $V_{\text {Latch }}$ goes LOW, $S 1$ shorts the output nodes, while $S 2$ blocks the current path in the latch. The outputs finally settle at roughly $V_{D D}-V_{T(3,4)}$. For low voltage applications, this may not be large enough to be considered logic HIGH, and therefore a reset mechanism similar to that of Fig. 69(a) may be used. As for the power consumption, and similar to the class A output comparator, the steady state current during either phase flowing through the comparator of Fig. 69(b) is $I_{B}$.

An advantage of the class AB output over the class A is the shorter regeneration time in the latch due to a larger transconductance- $\left(g m_{3}+g m_{5}\right)$ for class AB vs. $g m_{7}$ only for class A output-which makes it a faster comparator. However, class AB output comparators may suffer worse input referred offset than class A due to the larger latch transconductance. Nonetheless, because pipeline cells use digital redundancy as in the 1.5 bits/stage, they are fairly immune to comparator offsets, which makes class AB output comparators the more attractive choice.

## C. Dynamic Comparators

The principal drawback of static comparators is the constant use of power regardless of the comparator being latched or reset. While the power consumed per
comparator could be small, pipeline ADCs may have +20 comparators whose total power may amount to a significant value. Dynamic comparators aim to resolve this inefficiency by consuming power only when needed, and like static comparators there are several variants of dynamic comparators, but virtually all have the same basic core as either architecture of Fig. 70 [99]-[101].


Fig. 70. Dynamic latch-type comparators; (a) resistive and (b) differential pair input.

## 1. Resistive Divider Input

We begin by analyzing the operation of the resistive divider dynamic comparator in Fig. 70(a). When $V_{\text {Latch }}$ is LOW, $S 1$ and $S 2$ pull the comparator outputs and thus the gates of M3-M6 to $V_{D D}$, while $S 3$ and $S 4$ ensures that $V_{D S(1-4)}=0$, and hence $V_{G S(3,4)}=$ $V_{D D}$. Note that in this operation mode, the comparator conducts no current. As $V_{\text {Latch }}$ goes HIGH, S1 and $S 2$ release the gates of M3-6 while $S 3$ and $S 4$ pull the drains of M3 and M4 towards $V_{D D}$. At this stage, $M 3$ and $M 4$ enter saturation mode. However, $M 1$ and

M2 are operating in triode and act as voltage controlled resistors. Assuming that M3-M6 are all well-matched, the imbalance of the voltage drop across the $r_{d s}$ of $M 1$ and $M 2$ is amplified by $M 3$ and $M 4$ thereby triggering the latch.

Once the regeneration is complete, one of the comparator outputs is held at $V_{D D}$ while the other is at $G N D$, and the comparator conducts no current. If the $V_{\text {Latch }}$ signal had a $50 \%$ duty cycle, then the comparator conducts current no more than $50 \%$ of the time, which is a significant reduction in power consumption compared to the static latchtype comparators. Naturally, however, the sizing of the devices will dictate the current conducted when the comparator is in the regeneration phase and needs to be limited.

As for its input referred offset, the resistive divider dynamic comparator performance is modest at best. Transistors operating in triode are poorly matched as they are very sensitive to $V_{D S}$ and process variations. Moreover, triode devices M1 and M2 have very small gain-near or smaller than unity—which may amplify any mismatch in M3 and M4 when referred to the input. Nonetheless, because of the inherent digital redundancy in pipeline cells, offsets in the order of 100 mV may be tolerated for 1.5 2.5bits/stage even in low voltage applications, making the resistive divider comparator a viable option for its low power consumption.

## 2. Differential Pair Input

Now consider the differential pair dynamic comparator in Fig. 70(b), neglecting $S 3$ and $S 4$. In the reset phase, when $V_{\text {Latch }}$ is LOW, the outputs are held at $V_{D D}$ and hence $M 3$ and M4 are configured in a diode connection, but conduct no current because $S 3$ is
off. Also, and due to the diode connection, the sources of M3 and M4 and drains of M1 and $M 2$ are held at approximately $V_{D D}-V_{T(3,4)}$. The moment $V_{\text {Latch }}$ turns HIGH, M1-M4 are all in saturation and $M 1$ and $M 2$ act as a differential pair steering current to either output according to their inputs, and triggering the latch.

Compared to the resistive divider comparator, the power consumption of the differential pair is quite similar; both comparators consume current only in the regeneration phase. However, the differential pair input is superior as far as input referred offset is concerned, and its performance is comparable to that of static comparators. This enhanced performance is primarily attributed to $M 1$ and $M 2$ being in saturation when the comparator is latched. Moreover, with the inclusion of $S 3$ and $S 4$, the input referred offset of the differential pair comparator is further enhanced; with S3 and $S 4$ included, only $M 1$ and $M 2$ are in saturation when the comparator is latched and hence become the sole dominant contributors to input offset. There is, however, a tradeoff for using the differential pair comparator over its resistive divider counterpart; the resistive divider comparator has a wider input range.

## D. Kickback Noise

The preceding discussion spanned the operation of several comparator architectures and key comparator design issues such as power consumption and input referred offset. Another aspect to be examined here, and perhaps the most critical for high speed and low voltage comparator design, is kickback noise.

The inputs and outputs of a comparator are not perfectly isolated. Thus, the rail-
to-rail transients seen at the output are likely to leak back to the input of the comparator resulting in kickback noise, which can be quite problematic depending on its magnitude and the circuit configuration driving the comparator. In a pipeline ADC for example, the input to a pipeline cell is applied to the comparators of the sub-ADC and sampled by the cell's S/H. Kickback noise generated by the comparators may corrupt the sampled value resulting in conversion errors. This is demonstrated in Fig. 71, where $\boldsymbol{A}$ represents the amplifier of the previous stage in the $H O L D$ phase, with input and feedback capacitors $C_{I}$ and $C_{F}$ respectively; $C_{L}$ represents the sampling capacitor of the following stage; and Comp is a single-path representation of the comparator where the input and output are coupled through capacitor $C_{C}$.


Fig. 71. Kickback noise modeling in a pipeline cell.

Regarding the rail-to-rail transient at the comparator output as a source $V_{I}$, we can find the effects of the kickback noise coupling on the sampled value of the next pipeline cell by evaluating $V_{O}$ across $C_{L}$; this is expressed by (143), where $\beta_{A}=C_{F} /\left(C_{F}+C_{I}\right)$ is the feedback factor of amplifier $\boldsymbol{A}$. The kickback noise transfer
function of (143) has a zero at $\omega=0$ and a high frequency pole $\omega_{p k}$ given by (144). Since the comparator is aimed to react very fast, the location of $\omega_{p k}$ and the maximum magnitude of the kickback noise transfer function given by (145) for which $\omega \gg \omega_{p k}$, are of greater interest.

$$
\begin{gather*}
\frac{V_{O}}{V_{I}}=\frac{s C_{C}}{\beta_{A} g m_{A}+g_{O A}+s\left[C_{C}+C_{L}+C_{F}\left(1-\beta_{A}\right)\right]}  \tag{143}\\
\omega_{p k}=\frac{\beta_{A} g m_{A}+g_{O A}}{C_{C}+C_{L}+C_{F}\left(1-\beta_{A}\right)} \cong \frac{\beta_{A} g m_{A}}{C_{C}+C_{L}+C_{F}\left(1-\beta_{A}\right)}  \tag{144}\\
\left.\frac{V_{O}}{V_{I}}\right|_{\max }=\left.\frac{V_{O}}{V_{I}}\right|_{\omega \gg \omega_{p k}}=\frac{C_{C}}{C_{C}+C_{L}+C_{F}\left(1-\beta_{A}\right)} \tag{145}
\end{gather*}
$$

Through the examination of the modeling results, we make two observations regarding the reduction of kickback noise. First, $\omega_{p k}$ coincides with the amplifier's bandwidth, $\omega_{-3 d B}$. Intuitively, closed loop wide bandwidth amplifiers such as those used in high speed pipeline ADCs force $V_{O}$ to behave as a low impedance node, which has a natural tendency to dissipate kickback noise quickly. This is only true as long as the frequency of the transients caused by kickback noise is below the amplifier's $\omega_{-3 d B}$, which is maximized when $\beta_{A}=1$. Therefore, a possible approach to minimize kickback noise is to design the amplifier's time constant $\tau_{A}$ driving the comparator to be smaller than that of the latch as expressed by (146), and in so doing maximize the amplifier's immunity against kickback noise.

$$
\begin{equation*}
\tau_{A}=\frac{C_{C}+C_{L}+C_{F}\left(1-\beta_{A}\right)}{\beta_{A} g m_{A}+g_{O A}}<\frac{C_{\text {Latch }}}{g m_{\text {Latch }}-g_{O, \text { Latch }}}=\tau_{\text {Latch }} \tag{146}
\end{equation*}
$$

Nevertheless, relying solely on the condition set by (146) to reduce kickback noise may not be practical as it may require increasing the power consumption of amplifier $\boldsymbol{A}$. This leads to the second observation; the kickback noise magnitude is strongly related to the value of the coupling capacitor, $C_{C}$. Indeed, if $C_{C}$ is reduced, so would be the kickback noise, and that is precisely what almost all kickback noise reduction techniques aim to accomplish.

Before we discuss these techniques, however, let us take a closer look at (145). Note that the values of $C_{L}, C_{F}$ and $\beta_{A}$ are all predefined by the architecture used to implement the pipeline cells. Also, it seems that a smaller $\beta_{A}$, and thus a larger number of bits per stage, would minimize the kickback noise. On the other hand, and due to capacitor scaling, $C_{L}$ and $C_{F}$ become increasingly smaller as the number of bits per stage increases, whereas $C_{C}$ remains unchanged as the same comparator architecture is generally used throughout the ADC; this effectively increases the effects of kickback noise in latter stages of the pipeline ADC . Moreover, a smaller $\beta_{A}$ would reduce the amplifier's natural ability to suppress kickback noise by lowering $\omega_{p k}$ as expressed in (144). Therefore, to maximize the pipeline ADC natural immunity against kickback noise induced conversion errors, the use of smaller number of bits per stage is encouraged, especially when an aggressive capacitor scaling approach is used.

## 1. Kickback Noise Reduction in Static Comparators

In addition to the natural approach of kickback noise reduction outside of the comparator, some design techniques are used to physically limit the value of $C_{C}$ and here we discuss the most relevant to static comparators.

First, we examine the preamplifier approach [37], [81]. The kickback noise leakage through $C_{C}$ can be reduced if a preamplifier is inserted between the previous stage amplifier and the latched comparator; this is similar to Fig. 69(a), where M1-M4 represent the preamplifier and is modeled in Fig. 72.


Fig. 72. Kickback noise modeling in a pipeline cell with a preamp driven comparator.

Naturally, deriving the kickback noise transfer function at $V_{O}$ is more complex than that of Fig. 71, but from an intuitive standpoint, $C g d_{l}$ introduces another zero at $\omega$ $=0$ and $g m_{3}$ introduces another low impedance node in the kickback noise path to dissipate it. It is simple, however, to determine the maximum magnitude of the kickback noise transfer function by evaluating the capacitive path represented in solid lines in Fig. 72 ; the result is given by (147).

$$
\begin{equation*}
\left.\frac{V_{O}}{V_{I}}\right|_{\text {max }, \text { w/ preamp }}=\frac{C g d_{1}}{C g d_{1}+C_{L}+C_{F}\left(1-\beta_{A}\right)} \cdot \frac{C_{C}}{C_{C}+C g s_{3}+\frac{C g d_{1}\left(C_{L}+C_{F}\left(1-\beta_{A}\right)\right)}{C g d_{1}+C_{L}+C_{F}\left(1-\beta_{A}\right)}} \tag{147}
\end{equation*}
$$

Note that (147) is quite similar in form to (145), but there is an added term that is always less than unity. This new term is another tool with which the designer may reduce kickback noise. For example, increasing $C g s_{3}$ directly enhances kickback noise reduction. Nonetheless, the use of preamplifiers to drive the comparators has two main drawbacks. First, the preamplifier adds to the power consumption of the overall comparator. Second, the preamplifier introduces another pole in the signal path ( $\omega_{p} \approx$ $\mathrm{gm}_{3} / \mathrm{Cgs}_{3}$ ), which slows the comparator response. Both drawbacks make the use of preamplifiers less favorable in high speed and low power designs, despite their effectiveness in reducing kickback noise.

Another approach to kickback noise reduction commonly seen in static comparators is the passive cross-coupled capacitive neutralization as seen in Fig. 73. In the absence of a preamplifier, $C g d$ in Fig. 73 represents the kickback noise coupling path, and since $M 1$ and $M 2$ are generally in saturation it is primarily assumed by the gate-drain overlap capacitance, $C g d_{o v}$. With the addition of a cross-coupled pair half the size of $M 1$ and $M 2$, an alternate path for kickback noise is created that counteracts the effect of $C g d$. Thus the superposition effect of $V_{\bar{A}}$ through $C g d_{1}$ and that of $V_{A}$ through $(M 1) / 2$ is ideally zero, which leaves $V_{1}$ unaffected. The same applies to $V_{2}$.

There are, however, some limitations to the use of the cross-coupling neutralization technique in practice as mismatch between of $M 1,2$ and (M1,2)/2 is
unavoidable; physical mismatch in size is a limitation induced by process variations, which leads to imperfect kickback noise cancellation. Moreover, saturation conditions in short channel devices are no longer characterized by the classical channel pinch-off where charge vanishes at the drain [56]. Therefore $C g d_{l, 2}$ can be considerably different from $C g d_{o v}$ provided by $(M 1,2) / 2$ creating an even greater mismatch and consequently only modest kickback noise reduction.


Fig. 73. Passive cross-coupled capacitive neutralization technique.

Finally, an additional technique to reduce the effects of kickback noise is the insertion of cascode devices inline with the differential pair of the comparator as pictured in Fig. 74. Cascode devices are commonly used in amplifiers to enhance their gain by limiting the output induced signal variations across the $r_{d s}$ of the device they are isolating. The same concept can be extended to comparators, but to isolate $C g d$ of M1 and $M 2$ from the output. Thus, the effect of $V_{\bar{A}}$ on the drain of $M 1$ and $V_{A}$ on the drain of $M 2$ is initially reduced by the gains of $M C 1$ and $M C 2$ respectively. Eventually, $M C 1$ and $M C 2$ enter triode and lose their gain as $V_{A}$ and $V_{\bar{A}}$ reach the supply rails. Therefore, it is
crucial to ensure the bulk terminals of $M C 1$ and $M C 2$ are grounded and not shorted to their sources. This guarantees a resistive path, however small, between the drains and sources of $M C 1$ and $M C 2$ that impedes the propagation of kickback noise to $V 1$ and $V 2$. Otherwise, $C d b_{C 1}$ and $C d b_{C 2}$ create an easy high frequency path for the kickback noise to propagate to the inputs of the comparator.


Fig. 74. Kickback noise isolation using cascode devices.

While perhaps the most effective approach to kickback noise reduction, the use of cascode devices shares a drawback with the preamplifier approach, and that is the insertion of an additional pole $\left(\omega_{p} \approx g m_{C l} / C g s_{C l}\right)$ in the signal path which slows the response of the comparator. Moreover, the use of cascode devices may be problematic as voltage supplies continue to shrink. Nonetheless, when used together, as is the common practice, the preceding techniques can significantly reduce the kickback noise of latchtype static comparators.

## 2. Kickback Noise Reduction in Dynamic Comparators

The techniques described in the previous section are not strictly associated with static comparators, but can also be used to with dynamic comparators. However, because of the different mechanisms of operation and propagation of kickback noise in dynamic comparators compared to their static counterparts, these techniques may not be very effective or efficient. For example, the use of a static power consuming preamp with a dynamic comparator is counter productive as far as power saving is concerned. This section begins with a description of how kickback noise is generated in dynamic comparators, followed by a presentation of a commonly used sampling-based kickback noise reduction technique, and finally proposes a new power conscious approach to kickback noise cancellation.

In static comparators, the primary leakage path of kickback noise was through the gate-drain capacitance, $C g d$, of the input pair $M 1$ and $M 2$. This is not always the case in dynamic comparators. For instance, Fig. 75(a) shows an enlarged partial schematic of the differential pair dynamic comparator of Fig. 70(b), including explicit representation of the gate-drain parasitics and key waveforms. Unlike the static comparators where the drain potentials of $M 1$ and $M 2$ begin at the same voltage and end at opposite supply rails, thus generating kickback noise, the drain potentials of $M 1$ and $M 2$ in dynamic comparators begin at the same voltage and end at the same supply rail, which is $G N D$ in this case.

Hence, and from a signal perspective, it seems that $C g d_{1}$ and $C g d_{2}$ do not provide a leakage path for kickback noise as they experience the same input step. However, what
is not seen in Fig. 75(a) is the dependency of $C g d_{1}$ and $C g d_{2}$ on their respective input values, and it is this input voltage dependency that creates a path for kickback noise. This path is strongest when the differential input is at its maximum $\left(C g d_{l} \neq C g d_{2}\right)$, and virtually nonexistent when the differential input is very small $\left(C g d_{1} \approx C g d_{2}\right)$. The resistive divider dynamic comparator is more sensitive to this kickback noise path as M1 and $M 2$ are normally operating in triode, unlike the differential pair comparator where $M 1$ and $M 2$ begin their operation in saturation; suppose a maximum differential input is applied to the resistive divider comparator with $V 1>V 2$, then it is likely that $M 2$ is off while $M 1$ is in deep triode making $C g d_{1} \gg C g d_{2}$ and inducing a significant kickback signal at the inputs.


Fig. 75. Kickback noise coupling in dynamic comparators; (a) through Cgd and (b) through Cgs.

There is also another kickback noise path that is found in dynamic comparators and not static comparators, and that is through $\mathrm{Cgs}_{1}$ and $\mathrm{Cgs}_{2}$. The differential pair
comparator is more vulnerable to this path due to the presence of a latched switch at the sources of M1 and M2 as seen in Fig. 75(b). When $V_{\text {Latch }}$ is LOW, the source potential $V_{S 1,2}$ of $M 1$ and $M 2$ is approximately at $\max \{V 1, V 2\}-V_{T(1,2)}$. As $V_{L a t c h}$ goes HIGH, $V_{S 1,2}$ is pulled to $G N D$ generating a step that is fed back to the inputs through $C g s_{1}$ and $C g s_{2}$. Similar to the kickback noise path through $C d g$ described earlier, this path is strongest when the differential input is at its maximum $\left(C g s_{1} \neq C g s_{2}\right)$, and virtually nonexistent when the differential input is very small $\left(C g s_{1} \approx C g s_{2}\right)$. The resistive divider comparator can too suffer from this kickback noise path but at a much smaller scale; if the routing parasitic to $G N D$ has a sizable resistance $R_{G N D}$, then the inrush current as the comparator is latched can create a step across $R_{G N D}$ leading to kickback noise through $C g s_{1}$ and $C g s_{2}$.

Unfortunately, none of the kickback noise reduction techniques used for static comparators are robust against voltage-dependent capacitors. Therefore, to overcome the effects of kickback noise in dynamic comparator, a sampling based approach is used [102]. This is demonstrated in Fig. 76 for a sub-ADC comparator in a pipeline cell, and the operation is based on a two-phase clock cycle synchronous with that of the pipeline ADC. In the sampling phase, $\Phi_{1}$, the input and reference voltages are sampled on $C_{I N}$ and $C_{R}$ respectively, and in the holding/latching phase, $\Phi_{2}$, the charge is redistributed and applied to M1 and M2 and the comparator is latched. Any kickback noise is now completely isolated from the inputs due to the sample and hold process. Instead, the comparator kickback noise is dumped on $C_{I N}+C_{R}$ at either input and the error is compensated for by the digital redundancy of the pipeline cell. The only error seen at the original inputs, Vin+ and Vin-, is the clock feed-through from the switches.


Fig. 76. Kickback noise reduction using a charge redistribution capacitive input circuit.

It is clear that the sampling-based technique is an effective approach to kickback noise cancellation. However, it is not the most economical. First, the capacitor mismatch between $C_{I N}$ and $C_{R}$ adds directly to the overall mismatch of the comparator, and hence $C_{I N}$ and $C_{R}$ cannot be overly small to limit offset degradation. Second, the use of sampling capacitors adds significantly to the area once the total number of comparators in the ADC is considered. Finally, the timing sequence used in the sampling-based kickback noise cancellation technique adds indirectly to the power consumption.

To qualify the last statement, consider Fig. 77 of a pipeline cell and its corresponding timing scheme. The settling time of the amplifier is expected to be $t_{s, \text { exp }}$ which is equal to the duration of $\Phi_{2}$. However, since the comparator is latched at the beginning of $\Phi_{2}$, there is a delay time $t_{d l}$ until the comparators fully regenerate. Next, a digital propagation delay $t_{d 2}$ is spent before the proper signal is sent to the MUX. Finally, another delay time $t_{d 3}$ is wasted before the proper $V_{D A C}$ value is selected and applied to the amplifier. Therefore, the amplifier ends with only $t_{s, a c t}$ which can be in the order of

100s of pico seconds less than $t_{s, \text { exp }}$. As a result, and to achieve the required settling accuracy in $t_{s, a c t}$, the amplifiers need to be faster and that can only be achieved by increasing their power budget since the loads are predefined by matching and thermal noise requirements and cannot be reduced.


Fig. 77. A pipeline cell and its corresponding timing and delay scheme.

In an effort to reduce the kickback noise of dynamic comparators without excessively compromising the power and area budgets, a replica approach and a different timing scheme are proposed, which are geared towards the differential pair dynamic comparator. First we reexamine the timing scheme as given by Fig. 78 where the main change from Fig. 77 is the separation of $V_{\text {Latch }}$ from $\Phi_{2} ; V_{\text {Latch }}$ now goes HIGH halfway into $\Phi_{1}$ and LOW synchronous to $\Phi_{1}$. This separation of $V_{\text {Latch }}$ from $\Phi_{2}$ provides ample time for comparator regeneration, digital delay propagation and selection of the
proper $V_{D A C}$ before is $\Phi_{2}$ enabled. Thus, the amplifier indeed utilizes all of $\Phi_{2}$ for settling and an increase in its power consumption for these delays as in the case of the sampling approach is avoided.


Fig. 78. Proposed timing scheme for kickback noise reduction.

A concern may arise from the timing scheme of Fig. 78 in regard to the comparator being latched before the MDAC of the previous pipeline stage has completely settled at the end of $\Phi_{1}$. However, since the settling requirement of an MDAC is far stringent than the input requirement of a sub-ADC, latching the comparator halfway through $\Phi_{l}$ will not result in decision errors. Consider for example the first MDAC in a $1.5 \mathrm{bits} /$ stage of a 10 bit ADC. The error in the final settling of the MDAC needs to be within $0.05 \%$ ( $1 / 2 L S B$ of $9+1$ bits), which translates to a minimum of
7.6 time constants. On the other hand, due to the inherent digital redundancy of the subADC and the output being only 2bits, the accuracy of the input to the comparators needs to be within $12.5 \%$ ( $1 / 2 L S B$ of 2 bits), which is only 2 time constants. If indeed the comparators are latched halfway into $\Phi_{1}$, then the MDAC of the previous stage would have settled to within $2.2 \%$ ( 3.8 time constants) and an accurate comparator decision is guaranteed. Moreover, latching the comparators early maximizes the time for the amplifier to naturally recover from kickback noise as discussed at the beginning of section $D$.

In addition to the timing scheme, the proposed approach makes use of a comparator replica to counteract the step seen at the sources of M1 and M2 when the comparator is latched. This is demonstrated in Fig. 79 along with the necessary latch signals. The replica is intended to be an exact copy of the comparator, but only sharing the input connections. To generate the replica latch signal $V_{\text {Rep }}$, an inversion of $V_{\text {Latch }}$ and $\Phi_{l}$ are passed through an AND gate. Hence, the comparator and replica are only latched during the duration of $\Phi_{1}$ to save on power. Neither the rising edge of $V_{\text {Rep }}$ nor the falling edge of $V_{\text {Latch }}$ is of significance as they occur with the rising and falling edges of $\Phi_{1}$. The crossing of $V_{\text {Rep }}$ and $V_{\text {Latch }}$ is where the kickback noise reduction takes place.

The area increase for the replica is disadvantageous but necessary to be able to replicate the dynamics of the kickback noise through $C g s$ of $M 1$ and M2. Moreover, a replica is only needed for the first few pipeline stages. Once the capacitor scaling of stages is ceased and the last pipeline stage is copied down, the resolution becomes rather coarse and the pipeline cells more tolerant to kickback noise, and thus a replica is no
longer necessary. Therefore, the comparator area is not doubled, but perhaps increased by $40 \%$ if a replica is no longer used beyond the fourth pipeline stage based on a $1.5 \mathrm{bits} /$ stage architecture. Finally, the replica does not provide perfect cancellation as the dynamics of the voltage-dependent capacitors are difficult to match precisely. Nonetheless, together with the timing scheme good results can be achieved.


Fig. 79. Proposed kickback noise reduction approach including the core comparator, a replica and the associated timing scheme.

## E. Comparator Implementation and Simulation Results

The comparator architecture used in the pipeline ADC is based on the dynamic differential pair input comparator and the proposed kickback noise reduction technique discussed in the previous section. Here the focus is shifted to the implementation of the comparator used in the pipeline ADC of this work and the simulation results.

The comparator presentation thus far was based on single ended input comparators, but in a pipeline ADC all signals are take differential form. To convert the comparator from single to differential input, one need only modify the differential pair as in Fig. 80. Moreover, since the differential input pair operate in saturation when the comparator is latched, the switching threshold of the comparators is adjusted by scaling the dimension of $M 1$ and $M 2$ as expressed by (148) [99]-[101].


Fig. 80. Transformation of a single-ended input comparator to fully differential.

$$
\begin{equation*}
\left.V_{I N}\right|_{\text {Threshold }}=V_{I N}^{+}-V_{I N}^{-}=\frac{(W / L)_{2}}{(W / L)_{1}}\left(V_{R}^{+}-V_{R}^{-}\right)=\frac{(W / L)_{2}}{(W / L)_{1}} V_{R} \tag{148}
\end{equation*}
$$

For the following simulation results, the setup used a model for a first stage MDAC with 90 dB DC gain, $1.35 \mathrm{GHz} G B W$ and total capacitive load of 2.0 pF to drive the comparator differentially based on Fig. 71. The MDAC is sampling a 10 MHz signal at $200 \mathrm{MS} / \mathrm{s}$ with a 1 Vpp swing. The waveforms in Fig. 81 show the output of the MDAC without a comparator; with a comparator but no kickback noise reduction; and with a comparator and proposed kickback noise reduction.


Fig. 81. The effect of kickback noise on MDAC output.

Note how the magnitude of the kickback noise follows that of the input signal due to the voltage-dependent capacitance that dominates the leakage path as previously discussed. This is fairly clear from the enlarged sections where the magnitude of the glitches is larger for larger inputs and vice versa. Also, while the proposed technique
does not fully eliminate the kickback noise as apparent from the presence of glitches, the overall charge injected from the comparator and replica help bring the final settling value closer to the ideal case. In the case of the comparator alone the final value was $715 \mu \mathrm{~V}$ greater than the ideal case $(0.73 L S B)$, whereas using the proposed kickback noise reduction technique the final value was only $153 \mu \mathrm{~V}(0.16 L S B)$ away.

As for timing, using the proposed scheme and sampling at 200 MHz , the comparator has roughly 1ns to fully regenerate. In Fig. 82, the outputs of the comparator $V_{A}$ and $V_{\bar{A}}$ are captured along with $V_{\text {Latch }}$. The worst case regeneration time occurs for the smallest input and is typically 380 ps, which leaves ample time to account for digital circuitry and MUX delays.


Fig. 82. Comparator regeneration time; (a) outputs $V_{A}$ and $V_{\bar{A}}$ and (b) $V_{\text {Latch }}$.

Finally, the greatest advantage of the dynamic comparators is their low power consumption, and Fig. 83 shows the total transient current of the comparator including the replica used for kickback noise reduction. Notice how the current transient is characterized with three distinctive spikes per sampling period (5ns), which indicate the shoot-through currents as the comparator and replica latches are activated. These spikes, however, decay quickly as the comparator regenerates. The average power consumption of the comparator including the replica is $120 \mu \mathrm{~A}$, which amounts for a total power of 1.8 mA for the whole ADC-approximately $6.9 \%$. This can be further improved if a slight non-overlap is established for $V_{\text {Latch }}$ and $V_{\text {Rep }}$ to drive the PMOS and NMOS devices separately and minimize shoot-through.


Time (s)
Fig. 83. Comparator and replica total current; (a) control signals $V_{\text {Latch }}$ and $V_{\text {Rep }}$ and (b) total transient current.

In summary, through a survey of the different comparator types and analysis of kickback noise behavior, a power conscious technique for kickback noise reduction is proposed for dynamic differential pair input comparators. When examined with a true MDAC, the proposed technique proved useful in reducing the error caused by kickback noise such that the settling error is within $0.16 L S B$. This is achieved through the use of a timing scheme that allows the amplifier to capitalize on its natural kickback noise rejection, and a replica which reduces the kickback noise by superposition of opposite polarity signals. Moreover, the proposed technique is power efficient despite the fact that it relies on a replica approach, as it ensures the maximum allowable settling time for the amplifiers through the elimination of digital circuit and MUX delay intrusions into the HOLD phase of the MDAC.

## CHAPTER VIII

## SIMULATION AND EXPERIMENTAL RESULTS

The discussion in the preceding chapters demonstrated the direct effects of the amplifier performance on that of the pipeline ADC, and the challenges that come with high speed, low voltage and low power design. For that purpose, a new amplifier topology-the recycling folded cascode-was proposed that effectively addresses many of these limitations.

In this chapter, the simulation and experimental results of the research are presented. First, the conventional folded cascode and the recycling folded cascode are examined. Then, a S/H amplifier suitable for a 1 Vpp 10bit Pipeline ADC with up to $200 \mathrm{MS} / \mathrm{s}$, which is built on the recycling folded cascode and employing the dual level CMFB is presented. The prototype amplifiers and the S/H were fabricated on the same die using TSMC $0.18 \mu \mathrm{~m}$ CMOS technology, and the die was packaged in a QFN40 chip. To facilitate the testing, a PCB was designed to accommodate both test setups, and provide easy access to multiple points in the signal path for verification. Fig. 84 shows the bonded die in the open cavity QFN40 package, and Fig. 85 shows the PCB used in the test. Finally, a complete low voltage, low power, 10bit, 160MS/s pipeline ADC is discussed, highlighting the performance with respect to state-of-the-art present implementations.


Fig. 84. Amplifiers and S/H prototype chip.


Fig. 85. Prototype chip characterization PCB.

## A. Recycling Folded Cascode

The simulation results of the prototype amplifiers were presented in the characterization section of Chapter V. Here we examine the experimental results. The prototype amplifiers, FC, RFC1 and RFC2 were fabricated in a closed loop configuration driving the same on-chip load, and biased with $800 \mu \mathrm{~A}$ for the FC and RFC1, and $400 \mu \mathrm{~A}$ for RFC2. Fig. 86 shows the amplifiers and their respective loads as they appear on silicon; the FC, RFC1 and RFC2 measure roughly 4700, 5000, and $3000 \mu \mathrm{~m}^{2}$ respectively. The amplifiers configuration is shown schematically in Fig. 87 with the load and test setup. The internal load $R \| C_{1}$ and $C_{2}$ are $560 \mathrm{k} \Omega \| 2.16 \mathrm{pF}$ and 2.5 pF respectively. As for $C_{3}$, it represents the total parasitic capacitance of the output PCB trace and the oscilloscope probe.


Fig. 86. Enlarged die section showing the FC, RFC1 and RFC2 with their loads.


Fig. 87. Amplifier prototype test setup.

The single-ended setup in Fig. 87 is used to simplify the characterization of several metrics, for which the results are presented in the following sub-sections.

## 1. Gain Bandwidth

The amplifiers are configured in an inverting gain configuration. The closed loop gain expression, $A_{C L}$, is given by (149), where $Z_{l}$ is $R \| C_{I}$ and $A(s)$ is the amplifier gain as a function of frequency. At $s=j G B W$, the amplifier's gain is unity and the closed loop gain magnitude becomes $1 / 3$. Therefore, the $G B W$ is obtained by simply sweeping the frequency of a signal with known amplitude ( 500 mVpp ) and noting the frequency where the gain drops to $1 / 3(-9.5 \mathrm{~dB})$. The experimental results are given in Fig. 88 .

$$
\begin{equation*}
A_{C L}=-\frac{Z_{1}}{2 Z_{1}}\left(\frac{A(s)}{1+\frac{Z_{1}}{2 Z_{1}} A(s)}\right)=-\left(\frac{A(s)}{2+A(s)}\right) \tag{149}
\end{equation*}
$$



Fig. 88. Experimental frequency sweep of $\mathrm{FC}, \mathrm{RFC} 1$ and RFC2 amplifier outputs.

The first observation is that the frequency performance of the FC and RFC2 is virtually identical despite RFC2 using only half the area and half the power of the FC. The frequency at which the gain of the FC and RFC2 amplifiers falls to -9.5 dB is roughly 70 MHz , in disagreement with the simulation results in Chapter V. This is because in the simulations results the parasitic capacitance of the PCB traces and the oscilloscope probes was not taken into account. By accounting for 0.25 pF of PCB trace capacitance and 2 pF for the Tektronix P6205 active FET probe, the simulated load is increased by $60 \%$, which reduces the simulated $G B W(115 \mathrm{MHz})$ of the FC and RFC2 to 72 MHz , in good agreement with the experimental results. As for the RFC1, the frequency range of the Tektronix AFG 3102 was limited to 100 MHz , and hence the response was extrapolated with a -6 dB /octave slope as shown in Fig. 88. Through
extrapolation the $G B W$ of the RFC1 is found to be 110 MHz . However, the GBW of the RFC1 was expected to be twice that of the FC. This discrepancy is the result of the different PCB trace capacitance seen by RFC1. Fig. 89 is a closer look at the PCB traces associated with each amplifier and their differences. Given the width of the traces and thickness of the PCB, the trace capacitance for the FC, RFC1 and RFC2 can be calculated to be $0.18 \mathrm{pF}, 1.39 \mathrm{pF}$ and 0.24 pF respectively. In addition to the 2 pF input capacitance of the Tektronix P6205 active FET probe, the overall capacitive load seen by the FC , RFC 1 and RFC 2 is $5.76 \mathrm{pF}, 6.97 \mathrm{pF}$ and 5.82 pF respectively. The difference in trace capacitance for the FC and RFC2 is negligible when compared to the overall load $(\sim 1 \%)$. However, the trace capacitance for the RFC1 increases the overall load by $21 \%$. When this is taken into account and by normalizing the $G B W$ with respect to the FC capacitive load, the $G B W$ of the RFC1 becomes 133.1 MHz , almost twice (1.9 times) the $G B W$ of the FC.


Fig. 89. PCB trace differences for the FC, RFC1 and RFC2 amplifiers.

An alternative way to measure the GBW is by applying a small step signal and measuring the settling time. Fig. 90 shows the step response of the prototype amplifiers to a $10 \mathrm{MHz}, 100 \mathrm{mV} p \mathrm{step}$ signal. The amplitude of $100 \mathrm{mV} p \mathrm{ensures}$ that the amplifiers will not slew as predicted by (28). The FC and RFC2 traces are closely overlapped indicating similar $G B W$, while the RFC1 clearly demonstrates a higher $G B W$. The settling time of the FC, RFC1 and RFC2 to an accuracy of $1 \%$ is $20.7 \mathrm{~ns}, 13.2 \mathrm{~ns}$ and 20.8 ns respectively. This translates to a GBW of 70.7 MHz for the $\mathrm{FC}, 110.9 \mathrm{MHz}$ for RFC1 and 70.4 MHz for RFC2, in good agreement with the frequency sweep results.


Fig. 90. Small signal step response of the FC, RFC1 and RFC2 amplifiers.

## 2. Slew Rate

The amplifiers are forced to slew if the input is greater than the maximum described by (28), which is roughly 200 mV . The output response of the FC, RFC1 and RFC2 amplifiers to a 5 MHz , 1Vpp input step is shown in Fig. 91. From Fig. 91, the RFC1 has a clearly improved $S R$ over the FC despite the same bias current. Moreover, at half the bias current, the $S R$ of the RFC2 is slightly better than the FC. The average $S R$ of the FC , RFC1 and RFC2 normalized to the FC total capacitive load is $42.15 \mathrm{~V} / \mu \mathrm{s}$, $94.13 \mathrm{~V} / \mu \mathrm{s}$ and $48.12 \mathrm{~V} / \mu \mathrm{s}$ respectively; that is the $S R$ of the RFC 1 is enhanced 2.23 times over the FC for the same bias current, while that of the RFC2 is enhanced 1.14 times over the FC for half the bias current.


Fig. 91. Large signal step response of the FC, RFC1 and RFC2 amplifiers.

## 3. Distortion

The linearity of the amplifiers can be measured through their distortion behavior. A 1 Vpp two tone test centered at 1 MHz and separated by $100 \mathrm{kHz}(500 \mathrm{mV} p \mathrm{at}$ 0.95 MHz and 500 mVpp at 1.05 MHz ) was applied to all amplifiers and the FFT data of the outputs was captured using the Tektronix TDS 5054 and plotted in MATLAB. The results for the FC, RFC1 and RFC2 are given in Fig. 92. The third intermodulation distortion, $I M 3$, is -61.7 dB for the $\mathrm{FC},-66.1 \mathrm{~dB}$ for RFC 1 and -61.6 dB for RFC 2 .

In Chapter IV the amplifier slew rate effects were considered for step inputs. For continuous signals, the $S R$ can indicate the maximum input frequency as given by (150), where the output voltage is a time-varying signal $A_{m} \cos \left(2 \pi f_{i n} t\right)$, with amplitude $A_{m}$ and frequency $f_{\text {in }}$. According to (150) and the average $S R$ results presented earlier, the FC, RFC1 and RFC2 amplifiers can support signals up to $13.4 \mathrm{MHz}, 29.9 \mathrm{MHz}$ and 15.3 MHz respectively without slewing.

$$
\begin{equation*}
S R \geq \frac{\partial \text { Vout }}{\partial t}=\left.2 \pi f_{\text {in }} A_{m} \sin \left(2 \pi f_{\text {in }} t\right)\right|_{\max } \Rightarrow f_{\text {in }} \leq \frac{S R}{2 \pi A_{m}} \tag{150}
\end{equation*}
$$

Therefore, the distortion is solely due to the reduced amplifier gain around 1 MHz . The FC and RFC2 have almost identical GBWs and hence it is not surprising to see them have similar IM3 performance, as they have the same voltage gain around 1 MHz . The RFC1, on the other hand has a wider $G B W$ and hence a higher gain around 1 MHz , which explains the improved IM3 performance. According to the absolute $G B W$
results, the open loop gain at 1 MHz of the FC and RFC 2 amplifiers is 36.6 dB and the gain of the RFC1 amplifier is 40.8 dB . The difference in gains is in good agreement with the difference in IM3 performance for the FC, RFC1 and RFC2.

## 4. Summary

The overall summary of the FC, RFC1 and RFC2 performance is presented in Table 5. Overall, if one intends to better utilize the power, RFC1 proved to be a more efficient amplifier. It provides a higher gain, almost twice the $G B W$, better than twice the average $S R$, a better $I M 3$ and a lower noise and offset-all while utilizing the same power and virtually the same silicon area. On the other hand, for half the power and almost half the area, the same performance of the FC can be replicated using RFC2, on the expense of an increase in noise and input offset.


Fig. 92. Two tone FFT spectrums of the FC, RFC1 and RFC2 for a 1 Vpp signal centered at 1 MHz and separated by 100 kHz .

TABLE 5
Simulation and Experimental Results Performance Summary of the FC, RFC1 and RFC2 Amplifiers

| Parameter | FC | RFC1 | RFC2 |
| :---: | :---: | :---: | :---: |
| Power ( $\mu \mathrm{A}$ ) | 800 | 800 | 400 |
| Area ( $\mu \mathrm{m}^{2}$ ) | $\begin{gathered} 4712.9 \\ (53.8 \times 87.6) \end{gathered}$ | $\begin{gathered} 4958.2 \\ (56.6 \times 87.6) \end{gathered}$ | 3001.8 $(58.4 \times 51.4)$ |
| GBW (MHz) | 70.7 | 110.9 (134.2*) | 70.4 |
| Load $\mathrm{C}_{\mathrm{L}}(\mathrm{pF})$ | 5.76 | 6.97 | 5.82 |
| DC Gain (dB) | $\begin{gathered} 46.02 \\ \left(52.63^{* *}\right) \end{gathered}$ | $\begin{gathered} 53.56 \\ (60.91 * *) \end{gathered}$ | $\begin{gathered} 54.89 \\ \left(59.32^{* *}\right) \end{gathered}$ |
| SR+ (V/ $\mu \mathrm{s}$ ) | 42.75 | 65.13 (78.81*) | 41.09 |
| SR- (V/ $\mu \mathrm{s}$ ) | 41.55 | $\begin{gathered} 90.45 \\ \left(109.44^{*}\right) \end{gathered}$ | 55.14 |
| Average SR (V/ $/ \mathrm{s}$ ) | 42.15 | 77.79 (94.13*) | 48.12 |
| IM3-1Vpp input at $1 \mathrm{MHz}(\mathrm{dB})$ | -61.7 | -66.1 | -61.6 |
| Input integrated noise ( $\mu \mathrm{Vrms}$ ) $(1 \mathrm{~Hz}-100 \mathrm{MHz})^{* *}$ | 53.16 | 48.48 | 69.71 |
| Input offset - $3 \sigma(\mathrm{mV})^{* *}$ | 7.92 | 7.64 | 11.08 |
| Settling Time - 1\% (ns) | 20.7 | 13.2 (10.91*) | 20.8 |

* Normalized to the FC capacitive load
** Simulation results


## B. Sample and Hold

The front end of a pipeline ADC is the $\mathrm{S} / \mathrm{H}$ amplifier. A flip-around $\mathrm{S} / \mathrm{H}$ architecture was designed for a 10 bit 1 Vpp pipeline ADC and fabricated in TSMC $0.18 \mu \mathrm{~m}$ CMOS technology as an application of the recycling folded cascode. The S/H also utilized the low voltage two-stage gain boosting technique discussed in Chapter IV and the dual level CMFB discussed in Chapter VI. An enlarged section of the die where the $\mathrm{S} / \mathrm{H}$ amplifier is placed is shown in Fig. 93, where $C_{S H}$ and $C_{C}$ are the sample and hold and coupling capacitors respectively.


Fig. 93. Enlarged die section showing the $\mathrm{S} / \mathrm{H}$ amplifier and open drain buffer.


Fig. 94. S/H test setup.

The test setup used to characterize the $\mathrm{S} / \mathrm{H}$ is given in Fig. 94, showing all components and their values. As the testing is done at high frequencies, careful choice of components was necessary to improve impedance matching between the equipment used and the PCB. The single-ended input signal was converted to fully differential using the THS4513 wideband low distortion fully differential amplifier. Also, the THS4513 configuration acts as an anti-aliasing filter, with a corner frequency of 96.5 MHz . The clock phases necessary for operation were generated on chip using a self-biased amplifier [103] and the logic shown in Fig. 95, which in simulations guaranteed a $50 \%$ duty cycle $\pm 2 \%$ for different corners and temperatures up to a CLK input of 250 MHz in a 1.8 V domain. The switches were implemented using NMOS low $V_{T}$ devices, which eliminated the use of bootstrapping techniques. The $C L K$ input was provided by either a pure sinusoidal waveform from the Tektronix AFG 3102 up to 100 MHz , or the HP 81110A pulse/pattern generator up to 200 MHz .


Fig. 95. On-chip generation of S/H clock phases.

In practice the $\mathrm{S} / \mathrm{H}$ will drive the input of the first pipeline stage, which is a small capacitive load compared to the bondpad and equipment input parasitics. Therefore, to facilitate the testing, a highly linear open drain buffer was designed to support the drive capability. However, as the $\mathrm{S} / \mathrm{H}$ and the output buffer operate from different supply domains, AC coupling was necessary. Overall, the total capacitance seen at the output of the $\mathrm{S} / \mathrm{H}$ is 2 pF . Assuming all-linear settling, a $G B W$ of 700 MHz is necessary for the $\mathrm{S} / \mathrm{H}$ amplifier to settle to 10 bit accuracy at 200MS/s. However, taking into account that slewing is inevitable, the $\mathrm{S} / \mathrm{H}$ was designed with $G B W$ of 0.935 GHz and more than $60^{\circ}$ of phase margin. The simulated AC response is given in Fig. 96.


Fig. 96. S/H amplifier AC response.

To verify functionality, the $\mathrm{S} / \mathrm{H}$ amplifier was first tested at a $100 \mathrm{MS} / \mathrm{s}$ for various input frequencies up to and beyond the Nyquist rate. For this purpose the Tektronix AFG 3102 dual signal generator was used. This simplified the capture of the signals on the Tektronix TDS 5054 as both signals come from the same source. Figs. 97, 98 and 99, show trace captures on the Tektronix TDS 5054 for an input of 5 MHz , 50 MHz and 75 MHz respectively.


Fig. 97. 5 MHz input and $\mathrm{S} / \mathrm{H}$ output at $100 \mathrm{MS} / \mathrm{s}$.


Fig. 98. 50 MHz input and $\mathrm{S} / \mathrm{H}$ output at $100 \mathrm{MS} / \mathrm{s}$.


Fig. 99. 75 MHz input and $\mathrm{S} / \mathrm{H}$ output at $100 \mathrm{MS} / \mathrm{s}$.

The results shown in Figs. 97-99 confirm functionality of the $\mathrm{S} / \mathrm{H}$ amplifier. More specifically, the functionality of the low voltage gain boosting, the dual level CMFB and the use of low $V_{T}$ devices as switched are verified. One note, however, is the attenuated output level with respect to the input signal. By examining the signal path from the Tektronix AFG 3102 to the Tektronix TDS 5054, the THS4513 configuration contributes an attenuation of 0.95 , the internal coupling an attenuation of 0.5 , the output buffer an attenuation of 0.92 and finally the output matching network with an attenuation of 0.375 . This results in a total attenuation of 0.164 , which is in good agreement with the results seen in Figs. 97-99.

To step up the sampling frequency to $200 \mathrm{MS} / \mathrm{s}$, the HP 81110A pulse/pattern generator was used for $C L K$. For this test, the FFT function of the Tektronix TDS 5054 was used to capture the output data of the $\mathrm{S} / \mathrm{H}$ amplifier and processed in MATLAB to generate the plots. A single tone test was first performed near the Nyquist sampling limit using a 90 MHz full scale ( 1 Vpp ) input. The results are shown in Fig. 100. The spectrum of the $\mathrm{S} / \mathrm{H}$ amplifier shows an $H D 3$ of 57.32 dB . As the spectrum is clear of any other spurs, the $S F D R$ is also 57.32 dB . As for the $S N R$, the noise floor seen in Fig. 100 is dominated by the noise of the equipment used and the output buffer. Nonetheless, assuming a 10bit noise floor, the $E N O B$ can be calculated to give 9.03 bits. As for Fig. 101 and 102 , they show the results of a two tone test with tones of 500 mV pp each centered at 90 MHz and separated by 100 kHz . The spectrum is clear of any spurs, apart from the intermodulation distortion seen in Fig. 102, yielding an IM3 of 51.34dB.


Fig. 100. S/H output spectrum for a $1 \mathrm{Vpp}, 90 \mathrm{MHz}$ input sampled at 200 MHz .


Fig. 101. S/H output spectrum for a 2 tone 1 Vpp input centered at 90 MHz and separated by 100 kHz , sampled at 200 MHz .


Fig. 102. Expanded spectrum of Fig. 101.

## C. Pipeline $A D C$

A 10bit, $1 \mathrm{Vpp}, 160 \mathrm{MS} / \mathrm{s}$ pipeline ADC was designed using the techniques presented in the previous chapters, and laid-out using SMIC $0.18 \mu \mathrm{~m}$ CMOS technology. The general architecture of the pipeline ADC and layout of the whole chip are presented in Fig. 103 and Fig. 104 respectively. The individual building blocks in Fig. 103 are based on those discussed in Chapter III, using the gain boosted RFC amplifier with dual level CMFB and dynamic comparators with kickback noise reduction where applicable. Grey and white blocks operate from 1.2 V and 1.8 V supplies respectively.


Fig. 103. Block diagram of pipeline ADC designed in SMIC $0.18 \mu \mathrm{~m}$ CMOS technology.


Fig. 104. Complete pipeline ADC layout in SMIC $0.18 \mu \mathrm{~m}$ measuring $4 \times 4 \mathrm{~mm}^{2}$ and featuring 4 separate ADCs multiplexed to the LVDS output drivers.

The most challenging block in the pipeline ADC is the amplifier used to implement the MDAC of the first pipeline stage. It needs to settle within $0.1 \%$ (10bit accuracy) in less than 2.8 ns for $160 \mathrm{MS} / \mathrm{s}$ at full scale input ( 1 Vpp ). For that purpose, a 1.45 GHz bandwidth amplifier is designed with $55^{\circ}$ of phase margin and Fig. 105 shows its AC response. To demonstrate functionality of the internal blocks, a small input is applied to the ADC and the output of the S/H, P1 and P2 stages are plotted in Fig. 106. This shows the multiply by 2 function of the pipeline cells and the capability to handle a 1Vpp swing.


Fig. 105. First pipeline stage amplifier AC response.


Fig. 106. Transient simulation of pipeline ADC showing the input signal and outputs of the $\mathrm{S} / \mathrm{H}, \mathrm{P} 1$ and P2 blocks.

Sampling at $160 \mathrm{MS} / \mathrm{s}$, a $F S$ single tone near the Nyquist sample rate at 78 MHz was applied as an input to the pipeline ADC. The FFT plot is given in Fig. 107. The only visible spur is that of the third harmonic, and the HD3 is 62.46dB. Also from Fig. 107, the calculated $S N D R$ is 57.12 which translates to an $E N O B$ of 9.19 . A figure of merit $\left(F o M_{1}\right)$ used to measure the efficiency of an ADC is described by (151), which links the ADC power $P$ to the resolution $E N O B$ and sampling frequency $f_{S}$. The efficiency is quantified as $\mathrm{pJ} /$ conversion-step. However, to incorporate the added speed advantage of newer CMOS technologies over older ones in the comparison, a modified version of (151) given in (152) can be used. In (152), the transit frequency $f_{T, \text { other }}$ of other CMOS implementations is normalized to that of the device under test, $f_{T, D U T}$. Thus, the resulting $\mathrm{FoM}_{2}$ is more design based than technology based.


Fig. 107. 1k FFT ADC spectrum for $f_{I N}=78 \mathrm{MHz}$ at $160 \mathrm{MS} / \mathrm{s}$.

$$
\begin{gather*}
F o M_{1}=\frac{P}{2^{E N O B} f_{S}}  \tag{151}\\
\text { FoM }_{2}=\frac{P}{2^{\text {ENOB }} f_{S}} \cdot \frac{f_{T, \text { Other }}}{f_{T, D U T}} \tag{152}
\end{gather*}
$$

A comparison of the pipeline ADC presented here with recent state-of-the-art implementations of comparable resolution in literature is summarized in Table 6.

TABLE 6
Performance Summary of Recent State-of-the-Art Pipeline ADCs

| Parameter | [13] | [15] | [16] | [19] | [23] | [26] | This <br> Work* |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| CMOS <br> Technology | $\begin{gathered} 0.18 / 0.35 \\ \mu \mathrm{~m} \end{gathered}$ | 90nm | 130nm | 130 nm | $\begin{gathered} 0.18 / 0.35 \\ \mu \mathrm{~m} \end{gathered}$ | 90nm | $0.18 \mu \mathrm{~m}$ |
| Resolution | 10bits | 10bits | 10bits | 10bits | 10bits | 10bits | 10bits |
| $f_{S}[\mathrm{MHz}]$ | 170 | 200 | 400 | 100 | 210 | 205 | 160 |
| Supply [V] | 3.3 | 1.2 | 1.2 | 3 | 1.8 | 1.2 | 1.2 |
| Vin [Vpp] | 2 | 0.8 | 1.2 | 2 | 1.5 | 1 | 1 |
| Power [mW] | 180 | 55 | 160 | 72 | 140 | 40 | 42 |
| Area $\left[\mathrm{mm}^{2}\right]$ | 0.85 | 1.26 | 4.2 | 0.54 | 1.5 | 1 | 1.1 |
| SNDR [dB] | 57.8 | 53.6 | 53.7 | 56.3 | 59.3 | 53.9 | 57.12 |
| SFDR [dBc] | 70.3 | 66.5 | 60.9 | 67.5 | 85.9 | 61.8 | 62.46 |
| ENOB | 9.31 | 8.61 | 8.63 | 9.06 | 9.56 | 8.66 | 9.19 |
| $\mathrm{FoM}_{1}$ [pJ/Conv-Step] | 1.67 | 0.71 | 1.01 | 1.35 | 0.88 | 0.48 | 0.45 |
| $\mathrm{FoM}_{2}$ [pJ/Conv-Step] | 1.67 | 1.97 | 1.82 | 2.43 | 0.88 | 1.35 | 0.45 |

[^3]From Table 6, the pipeline ADC presented in this work demonstrates the highest potential efficiency. Considering the technology of implementation, the presented work is twice as efficient as a similar $0.18 \mu \mathrm{~m}$ CMOS implementation, but using $27 \%$ smaller area. The presented work also shows a slightly better efficiency than similar 90 nm CMOS implementations with only $10 \%$ increase in area.

## CHAPTER IX

## CONCLUSIONS

The pipeline ADC is a popular architecture that finds use in a wide range of applications with resolutions up to 14 bits and +100 MHz sampling speeds. The current approach to its design puts a lot of emphasis on the architectural level, rather than finding new techniques to enhance the performance on the transistor design level. In order to propose such new techniques, a thorough study of the pipeline ADC behavior was undertaken, highlighting the main limitations posed by the individual building blocks. This study concludes that the majority of errors are attributed to the performance metrics of the amplifier used in the front-end S/H and MDACs.

A Recycling Folded Cascode (RFC) amplifier was proposed as a replacement to the conventional folded cascode to remedy or reduce many of its limitations. It has been demonstrated, both in theory and experimental results, that the RFC can achieve twice the gain bandwidth, 8-10 additional gain and at least twice the slew rate as the conventional folded cascode, without adding noise, degrading offset or using additional area or power. Moreover, the recycling folded cascode was confirmed to be robust low voltage environments.

To tackle the amplifier offset effects, the direct auto-zeroing offset cancellation scheme is optimized for low voltage environments using a dual level CMFB circuit, and it was shown that amplifier differential offsets up to 50 mV are effectively cancelled. Together with the RFC, the dual level CMFB was used to implement a fully-differential
flip-around sample and hold amplifier driving a single-ended load of 1.4 pF and using only 2.6 mA , and at $200 \mathrm{MS} / \mathrm{s}$ better than 9 bit linearity is achieved.

Finally the kickback noise of dynamic comparators was addressed using a power conscious technique that replicates the noise with the opposite polarity such that the overall effect is reduced.

Together, these techniques were used in the design of a non-calibrated 1 Vpp 10bit 160MS/s pipeline ADC in SMIC $0.18 \mu \mathrm{~m}$ CMOS technology. The ADC is composed of a simple architecture; a front-end $\mathrm{S} / \mathrm{H}$, eight $1.5 \mathrm{bits} /$ stage pipeline cells and a 2bit flash. The ADC uses an area of $1.1 \mathrm{~mm}^{2}$ and consumes 42 mW in its analog core, and with a near Nyquist-rate full scale signal, 9.2 ENOB is achieved. Compared to recent state-of-the-art implementations in the 100-200MS/s range, the presented ADC design uses the least power per conversion rated at $0.45 \mathrm{pJ} /$ conversion-step.

## REFERENCES

[1] R. Brewer, J. Gorbold, P. Hurrel, C. Lyden, R. Maurino, et al., "A 100dB SNR 2.5MS/s output data rate $\Delta \Sigma$ ADC," ISSCC Dig. Tech. Papers, vol. 1, pp. 172591, Feb 2005.
[2] P. Morrow, M. Chamarro, C. Lyden, P. Ventura, A. Abo, et al., "A $0.18 \mu \mathrm{~m}$ 102dB-SNR mixed CT SC audio-band $\Delta \Sigma$ ADC," ISSCC Dig. Tech. Papers, vol. 1, pp. 178-592, Feb 2005.
[3] R. Schreier, N. Abaskharoun, H. Shibata, I. Mehr, S. Rose, et al., "A 375-mW quadrature bandpass $\Delta \Sigma$ ADC with $8.5-\mathrm{MHz} \mathrm{BW}$ and $90-\mathrm{dB}$ DR at 44 MHz ," IEEE J. Solid-State Circuits, vol. 41, no. 12, pp. 2632-2640, Dec 2006.
[4] G. Mitteregger, C. Ebner, S. Mechnig, T. Blon, C. Holuigue, et al., "A 20-mW 640-MHz CMOS continuous-time $\Sigma \Delta$ ADC with $20-\mathrm{MHz}$ signal bandwidth, $80-$ dB dynamic range and 12-bit ENOB," IEEE J. Solid-State Circuits, vol. 41, no. 12, pp. 2641-2649, Dec 2006.
[5] T. Chang, L. Dung, J. Guo and K. Yang, "A 2.5-V 14-bit, 180-mW cascaded $\Sigma \Delta$ ADC for ADSL2+ application," IEEE J. Solid-State Circuits, vol. 42, no. 11, pp. 2357-2368, Nov 2007.
[6] Y. Yang, T. Sculley and J. Abraham, "A single-die 124dB stereo audio deltasigma ADC with 111dB THD," IEEE J. Solid-State Circuits, vol. 43, no. 7, pp. 1657-1665, July 2008.
[7] S. Park, Y. Palaskas and M. P. Flynn, "A 4-GS/s 4-bit flash ADC in $0.18-\mu \mathrm{m}$

CMOS," IEEE J. Solid-State Circuits, vol. 42, no. 9, pp. 1865-1872, Sep 2007.
[8] H. Yu and M. F. Chang, "A 1-V 1.25-GS/s 8-bit self-calibrated flash ADC in 90nm digital CMOS," IEEE Trans. Circuits and Systems-II: Express Briefs, vol. 55, no. 7, pp. 668-672, July 2008.
[9] A. Ismail and M. Elmasry, "A 6-bit 1.6-GS/s low-power wideband flash ADC in $0.13-\mu \mathrm{m}$ CMOS technology," IEEE J. Solid-State Circuits, vol. 43, no. 9, pp. 1982-1990, Sep 2008.
[10] K. Deguchi, N. Suwa, M. Ito, T. Kumamoto and T. Miki, "A 6-bit 3.5-GS/s 0.9V 98-mW flash ADC in 90-nm CMOS," IEEE J. Solid-State Circuits, vol. 43, no. 10, pp. 2303-2310, Oct 2008.
[11] Z. Wang and M. F. Chang, "A 600-MSPS 8-bit CMOS ADC using distributed track-and-hold with complementary resistor/capacitor averaging," IEEE Trans. Circuits and Systems-I: Regular Papers, vol. 55, no. 11, pp. 3621-3627, Dec 2008.
[12] T. N. Andersen, B. Hernes, A. Briskemyr, F. Telsto, J. Bjornsen, et al., "A costefficient high-speed 12-bit pipeline ADC in $0.18-\mu \mathrm{m}$ digital CMOS," IEEE J. Solid-State Circuits, vol. 40, no. 7, pp. 1506-1513, July 2005.
[13] J. Li, G. Manganaro, M. Courcy, B. Min, L. Tomasi, et al., "A 10b 170MS/s CMOS pipelined ADC featuring 84dB SFDR without calibration," VLSI Circuits Symp. Dig. Tech. Papers, pp. 226-227, 2006.
[14] K. Iizuka, H. Matsui, M. Ueda and M. Daito, "A 14-bit digitally self-calibrated pipelined ADC with adaptive bias optimization for arbitrary speeds up to

40MS/s," IEEE J. Solid-State Circuits, vol. 41, no. 4, pp. 883-890, Apr 2006.
[15] D. Kurose, T. Ito, T. Ueno, T. Yamaji and T. Itakura, "55-mW 200-MSPS 10-bit pipeline ADC for wireless receivers," IEEE J. Solid-State Circuits, vol. 41, no. 7, pp. 1589-1595, July 2006.
[16] S.-C. Lee, K. Kim, J. Kwon, J. Kim and S.-H. Lee, "A 10-bit 400-MS/s 160-mW $0.13-\mu \mathrm{m}$ CMOS dual-channel pipeline ADC without channel mismatch calibration," IEEE J. Solid-State Circuits, vol. 41, no. 7, pp. 1596-1605, July 2006.
[17] A. M. A. Ali, C. Dillon, R. Sneed, A. S. Morgan, S. Bradsley, et al, " A 14-bit $125 \mathrm{MS} / \mathrm{s}$ IF/RF sampling pipeline ADC with 100 dB SFDR and 50 fs jitter," IEEE J. Solid-State Circuits, vol. 41, no. 8, pp. 1846-1855, Aug 2006.
[18] K. Gulati, M. S. Peng, A. Pulincherry, C. E. Munoz, M. Lugin, et al, "A highly integrated CMOS analog baseband transceiver with 180 MSPS 13-bit pipelined CMOS ADC and dual 12-bit DACs," IEEE J. Solid-State Circuits, vol. 41, no. 8, pp. 1856-1866, Aug 2006.
[19] T. Oh, S. Yoo, K. Moon and J. Kim, "A 3.0V 72mW 10b 100 MSample/s Nyquist-rate CMOS pipelined ADC in $0.54 \mathrm{~mm}^{2}$, , IEEE ISCAS Proc., pp. 10271030, 2006.
[20] S. K. Gupta, M. A. Inerfield and J. Wang, "A 1-GS/s 11-bit ADC with 55-dB SNDR, $250-\mathrm{mW}$ power realized by a high bandwidth scalable time-interleaved architecture," IEEE J. Solid-State Circuits, vol. 41, no. 12, pp. 2650-2657, Dec 2006.
[21] S. M. Chen and R. W. Brodersen, "A 6-bit 600MS/s 5.3-mW asynchronous ADC in 0.13- $\mu \mathrm{m}$ CMOS," IEEE J. Solid-State Circuits, vol. 41, no. 12, pp. 2669-2680, Dec 2006.
[22] B. P. Ginsburg and A. P. Chandrakasan, "500-MS/s 5-bit ADC in $65-\mathrm{nm}$ CMOS with split capacitor array DAC," IEEE J. Solid-State Circuits, vol. 42, no. 4, pp. 739-748, Apr 2007.
[23] J. Li, R. Leboeuf, M. Courcy and G. Manganaro, "A 1.8V 10b 210MS/s CMOS pipelined ADC featuring 86dB SFDR without calibration," IEEE CICC Proc., pp. 317-320, Sep 2007.
[24] Z. Lee, C. Wang and J. Wu, "A CMOS 15-bit 125MS/s time-interleaved ADC with digital background calibration," IEEE J. Solid-State Circuits, vol. 42, no. 10, pp. 2149-2160, Oct 2007.
[25] L. Brooks and H. Lee, "A zero-crossing-based 8-bit $200 \mathrm{MS} / \mathrm{s}$ pipelined ADC," IEEE J. Solid-State Circuits, vol. 42, no. 12, pp. 2677-2687, Dec 2007.
[26] S. Lee, Y. Jeon, J. Kwon and J. Kim, "A 10-bit 205-MS/s $1.0-\mathrm{mm}^{2} 90-\mathrm{nm}$ CMOS pipeline ADC for flat panel display applications," IEEE J. Solid-State Circuits, vol. 42, no. 12, pp. 2688-2695, Dec 2007.
[27] Y. Shu and B. Song, "A 15-bit linear 20-MS/s pipelined ADC digitally calibrated with signal-dependent dithering," IEEE J. Solid-State Circuits, vol. 43, no. 2, pp. 342-350, Feb 2008.
[28] S. Jiang, M. Do, K. Yeo and W. Lim, "An 8-bit 200-MSample/s pipelined ADC with mixed-mode front-end S/H circuit," IEEE Trans. Circuits and Systems-I:

Regular Papers, vol. 55, no. 6, pp. 1430-1440, July 2008.
[29] T. Liechti, A. Tajalli, O. C. Akgun, Z. Toprak and Y. Leblebici, "A 1.8V 12-bit 230-MS/s pipeline ADC in $0.18 \mu \mathrm{~m}$ CMOS technology," IEEE APCCAS Proc., pp. 21-24, Dec 2008.
[30] H. Nyquist, "Certain topics in telegraph transmission theory," Trans. AIEE, vol. 47, pp. 617-644, Apr 1928.
[31] C. E. Shannon, "Communication in the presence of noise," Proc. Institute of Radio Engineers, vol. 37, no.1, pp. 10-21, Jan 1949.
[32] J. G. Proakis and D. G. Manolakis, Digital Signal Processing, 4th ed., Upper Saddle River, NJ: Pearson, 2007.
[33] J. C. Candy and G. C. Temes, Oversampling Delta-Sigma Data Converters, New York, NY: IEEE Press, 1992.
[34] W. Kester (ed.), The Data Conversion Handbook, Burlington, MA: Elsevier, 2005.
[35] F. Maloberti, Data Converters, Dordrecht, The Netherlands: Springer, 2007.
[36] P. M. Aziz, H. V. Sorensen and J. V. Spiegel, "An overview of sigma-delta converters: how a 1-bit ADC achieves more than 16-bit resolution," Signal Processing Magazine, IEEE, vol. 13, no. 1, pp. 61-84, Jan 1996.
[37] B. Razavi, Principles of Data Conversion System Design, New York, NY: IEEE Press, 1995.
[38] S. H. Lewis and P. R. Gray, "A pipelined 5-Msample/s 9-bit analog-to-digital converter," IEEE J. Solid-State Circuits, vol. SC-22, no. 6, pp. 954-961, Dec
1987.
[39] B. Ginetti, P. G. A. Jespers and A. Vandemeulebroecke, "A CMOS 13-b cyclic RSD A/D converter," IEEE J. Solid-State Circuits, vol. 27, no. 7, pp. 957-964, July 1992.
[40] Y. A. Haque, R. Gregorian, R. W. Blasco, R. A. Mao and W. E. Nicholson, "A two-chip PCM voice CODEC with filters," IEEE J. Solid-State Circuits, vol. 14, no. 6, pp. 961-969, Dec 1979.
[41] C.C. Enz, and G.C. Temes, "Circuit techniques for reducing the effects of opamp imperfections: autozeroing, correlated double sampling, and chopper stabilization," IEEE Proceedings, vol. 84, no. 11, pp. 1584-1614, Nov 1996.
[42] A. M. Abo and P. R. Gray, "A 1.5 V, 10 bit, 14.3 MS/s CMOS pipeline analog-to-digital converter," IEEE J. Solid-State Circuits, vol. 34, no. 5, pp. 599-606, May 1999.
[43] J. Goes, J. C. Vital and J. E. Franca, "Systematic design for optimization of highspeed self-calibrated pipelined A/D converters," IEEE Trans. Circuits and Systems-II: Analog and Digital Signal Processing, vol. 45, no. 12, pp. 15131526, Dec 1998.
[44] C. T. Motchenbacher and J. A. Connelly, Low-Noise Electronics System Design, New York, NY: John Wiley \& Sons, 1993.
[45] M. Y. Azizi, A. Saeedfar, H. Z. Hoseini and O. Shoaei, "Thermal noise analysis of multi-bit SC gain-stages for low-voltage high-resolution pipeline ADC design," ISSCS Dig. Tech. Papers, vol. 2, pp. 581-584, July 2003.
[46] D. W. Cline and P. R. Gray, "A power optimized 13-b 5 Msamples/s pipelined analog-to-digital converter in $1.2 \mu \mathrm{~m}$ CMOS," IEEE J. Solid-State Circuits, vol. 31, no. 3, pp. 294-303, Mar 1996.
[47] H. Shichman and D. A. Hodges, "Modeling and simulation of insulated-gate field-effect transistors switching circuits," IEEE J. Solid-State Circuits, vol. SC3, pp. 285-289, Sep 1968.
[48] X. Xi, M. Dunga, J. He, W. Liu, K. M. Cao, et al., BSIM4.3.0 MOSFET ModelUser's Manual, Berkeley, CA: University of California, Berkeley, 2003.
[49] B. Razavi, Design of Analog CMOS Integrated Circuits, New York, NY: McGraw-Hill, 2001.
[50] E. H. Nordholt, Design of High-Performance Negative-Feedback Amplifiers, Amsterdam, The Netherlands: Elsevier, 1983.
[51] S. Rosenstark, Feedback Amplifier Principles, New York, NY: Macmillan, 1986.
[52] Y. B. Klamath, R. G. Meyer, and P. R. Gray, "Relationship between frequency response and settling time of operational amplifiers," IEEE J. Solid-State Circuits, vol. 9, pp. 347-352, Dec 1974.
[53] B. K. Thandri and J. Silva-Martinez, "A robust feed-forward compensation scheme for multistage operational transconductance amplifiers with no miller capacitors," IEEE J. Solid-State Circuits, vol. 38, no.2, pp.237-243, Feb 2003.
[54] L. A. Williams, III and B. A. Wooley, "A third-order sigma-delta modulator with extended dynamic range," IEEE J. Solid-State Circuits, vol. 29, no. 3, pp. 193202, Mar 1994.
[55] A. V. Ziel, Noise in Solid State Devices and Circuits, New York, NY: John Wiley \& Sons, 1986.
[56] Y. Tsividis, Operation and Modeling of the MOS Transistor, 2nd ed., New York, NY: Oxford University Press, 1999.
[57] W. Liu, MOSFET Models for SPICE Simulation including BSIM3v3 and BSIM4, New York, NY: John Wiley \& Sons, 2001.
[58] H. C. Casey, Devices for Integrated Circuits: Silicon and III-V Compound Semiconductors, New York, NY: John Wiley \& Sons, 1999.
[59] U. Cilingiroglu, Systematic Analysis of Bipolar and MOS Transistors, Norwood, MA: Artech House, 1993.
[60] A. S. Roy, Noise and Small Signal Modeling of Nanoscale MOSFETS, Doctoral Dissertation, Ecole Polytechnique Fédérale de Lausanne, Switzerland, 2007.
[61] E. Simoen and C. Claeys, "On the ficker noise in submicron silicon MOSFETs," Elsevier J. Solid-State Electronics, vol. 43, pp. 865-882, June 1999.
[62] K. W. Chew, K. S. Yeo and S. F. Chu, "Effect of technology scaling on the 1/f noise of deep submicron PMOS transistors," Elsevier J. Solid-State Electronics, vol. 48, pp. 1101-1109, Mar 2004.
M. J. M. Pelgrom, A. C. J. Duinmaijer and A. P. G. Welbers, "Matching properties of MOS transistors," IEEE J. Solid-State Circuits, vol. 24, no. 5, pp. 1433-1440, Oct 1989.
[64] J. Bastos, M. Steyaert, R. Roovers, P. Kinget, W. Sansen, et al., "Mismatch characterization of small size MOS transistors," in Proc. IEEE Int. Conf.

Microelectronic Test Structures, pp. 271-276, Mar 1995.
[65] L. Portmann, C. Lallement, and F. Krummenacher, "A high density integrated test matrix of MOS transistors for matching study," in Proc. IEEE Int. Conf. Microelectronic Test Structures, pp. 19-24, Mar 1998.
[66] U. Gruenebaum, J. Oehm, and K. Schumacher, "Mismatch modeling and simulation—A comprehensive approach," Analog Integrated Circuits and Signal Processing, vol. 29, no. 3, pp. 165-171, Dec 2001.
[67] P. R. Kinget, "Device mismatch and tradeoffs in the design of analog circuits," IEEE J. Solid-State Circuits, vol. 40, no. 6, pp. 1212-1224, June 2005.
[68] R. Adams, et al., "A 113dB SNR oversampling DAC with segmented noiseshaped scrambling," IEEE J. Solid-State Circuits, vol. 33, no. 12, pp. 1871-1878, Dec 1998.
[69] K. Nguyen, R. Adams, K. Sweetland, and H. Chen, "A 106-dB SNR Hybrid Oversampling Analog to Digital Converter for Digital Audio," IEEE J. SolidState Circuits, vol. 40, no. 12, pp. 2408-2415, Dec 2005.
[70] P. Y. Wu, V. S. -L. Cheung and H. C. Luong, "A 1V 100MS/s 8bit CMOS switched-opamp pipelined ADC using loading-free architecture," IEEE J. SolidState Circuits, vol. 42, no. 4, pp. 730-738, Apr 2007.
[71] K. Lee, S. Kwon and F. Maloberti, "A power efficient two-channel timeinterleaved $\Sigma \Delta$ modulator for broadband applications," IEEE J. of Solid-State Circuits, vol. 42, no. 6, pp. 1206-1215, June 2007.
[72] J. Shen and P. R. Kinget, " 0.5 V 8bit $10 \mathrm{MS} / \mathrm{s}$ pipeline ADC in 90 nm CMOS,"

IEEE J. Solid-State Circuits, vol. 43, no. 4, pp. 787-795, Apr 2008.
[73] I. Ahmed and D. A. Johns, "An 11bit 45MS/s pipelined ADC with rapid calibration of DAC errors in a multibit pipeline stage," IEEE J. Solid-State Circuits, vol. 43, no. 7, pp. 1626-1637, July 2008.
[74] R. G. H. Eschauzier and J. H. Huijsing, Frequency Compensation Techniques for Low-Power Operational Amplifiers, Dordrecht, The Netherlands: Kluwer, 1995.
[75] K. Bult and G. Geelen, "A fast-settling CMOS opamp for SC circuits with 90-dB DC gain," IEEE J. Solid-State Circuits, vol. 25, no. 6, pp. 1397-1384, Dec 1990.
[76] K. Nakamura and L. R. Carley, "An enhanced fully differential folded-cascode op amp," IEEE J. Solid-State Circuits, vol. 27, no.4, pp. 563-568, Apr 1992.
[77] J. Adut, J. Silva-Martinez and M. Rocha-Perez, "A 10.7MHz sixth-order SC ladder filter in $0.35 \mu \mathrm{~m}$ CMOS technology," IEEE Trans. Circuits and SystemsI: Regular Papers, vol. 53, no. 8, pp. 1625-1635, Aug 2006.
[78] J. Roh, "High-gain class-AB OTA with low quiescent current," Springer J. Analog Integrated Circuits and Signal Processing, vol. 47, no.2, pp. 225-228, May 2006.
[79] L. Yao, M. Steyaert and W. Sansen, "A 1-V $140 \mu$ W 88-dB audio sigma-delta modulator in 90-nm CMOS," IEEE J. Solid-State Circuits, vol. 39, no. 11, pp. 1809-1818, Nov 2004.
[80] R. Assaad and J. Silva-Martinez, "Enhancing general performance of folded cascode amplifier by recycling current," IEE Electronics Letters, vol. 43, no. 23, Nov 2007.
[81] D. Johns and K. Martin, Analog Integrated Circuits Design, New York, NY: John Wiley \& Sons, 1997.
[82] J. E. Duque-Carrillo, "Control of the common-mode components in CMOS continuous-time fully differential signal processing," Kluwer J. Analog Integrated Circuits and Signal Processing, vol. 4, no. 2, pp. 131-140, Sep 1993.
[83] D. Senderowicz, S. F. Dreyer, J. H. Huggins, C. F. Rahim and C. A. Laber, "A family of differential NMOS analog circuits for a PCM codec filter chip," IEEE J. Solid-State Circuits, vol. SC-17, no. 6, pp. 1014-1023, Dec 1982.
[84] O. Choksi and L. R. Carley, "Analysis of switched-capacitor common-mood feedback circuit," IEEE Trans. Circuits and Systems-II: Analog and Digital Signal Processing, vol. 50, no. 12, pp. 906-917, Dec 2003.
[85] B. Hernes, A. Briskemyr, T. N. Andersen, F. Telste, T. E. Bonnerud, et al., "A 1.2V 220MS/s 10b pipeline ADC implemented in $0.13 \mu \mathrm{~m}$ digital CMOS," ISSCC Dig. Tech. Papers, pp. 256-258, Feb 2004.
[86] G. Geelen, E. Paulus, D. Simanjuntak, H. Pastoor and R. Verlinden, "A 90nm CMOS 1.2 V 10 b power and speed programmable pipelined ADC with 0.5pJ/Conversion-Step," ISSCC Dig. Tech. Papers, pp. 782-791, Feb 2006.
[87] M. Boulemnakher, E. Andre, J. Roux and F. Paillardet, "A 1.2V 4.5mW 10b 100MS/s pipeline ADC in a 65nm CMOS," ISSCC Dig. Tech. Papers, pp. 250252, Feb. 2008.
[88] D. Garrity and P. Rakers, "Common-mode output sensing circuit," U.S. Patent No. 5894284, Apr 13, 1999.
[89] D. Hernandez-Garduno and J. Silva-Martinez, "A continuous-time commonmode feedback for high-speed switched-capacitor networks," IEEE J. Solid-State Circuits, vol. 40, no. 8, pp. 1610-1617, Aug 2005.
[90] M. Gustavsson, J. J. Wikner and N. N. Tan, CMOS Data Converters for Communications, Dordrecht, The Netherlands: Kluwer, 2000.
[91] H. L. Fiedler, B. Hoefflinger, W. Demmer and P. Draheim, "A 5-bit building block for 20 MHz A/D converters," IEEE J. Solid-State Circuits, vol. SC-16, no. 3, pp. 151-155, June 1981.
[92] J. Robert, G. C. Temes, V. Valencic, R. Dessoulavy and P. Deval, "A 16-bit lowvoltage CMOS A/D converter," IEEE J. Solid-State Circuits, vol. SC-22, no. 2, pp. 157-163, Apr 1987.
[93] B.-S. Song, S.-H. Lee, and M. F. Tompsett, "A 10-b 15-MHz CMOS recycling two-step A/D converter," IEEE J. Solid-State Circuits, vol. 25, no. 6, pp. 13281338, Dec 1990.
[94] A. Yukawa, "A CMOS 8-bit high-speed A/D converter IC," IEEE J. Solid-State Circuits, vol. SC-20, no. 3, pp. 775-779, June 1985.
[95] S. Sutarja and P. Gray, "A pipelined 13-bit, 250-ks/s, 5-V analog-to-digital converter," IEEE J. Solid-State Circuits, vol. 23, no. 6, pp. 1316-1323, Dec 1988.
[96] G. Yin, F. Eynde, and W. Sansen, "A high-speed CMOS comparator with 8-b resolution," IEEE J. Solid-State Circuits, vol. 27, no. 2, pp. 208-211, Feb 1992.
[97] P. Amaral, J. Goes, N. Paulino and A. Steiger-Garcao, "An improved low-
voltage low-power CMOS comparator to be used in high-speed pipeline ADCs," IEEE ISCAS Proc., vol. 5, pp. 141-144, May 2002.
[98] K. Uyttenhove and M. Steyaert, "A 1.8 V 6-bit 1.3-GHz flash ADC in $0.25 \mu \mathrm{~m}$ CMOS," IEEE J. Solid-State Circuits, vol. 38, no. 7, pp. 1115-1122, July 2003.
[99] T. Kobayashi, K. Nogami, T. Shirotori, and Y. Fujimoto, "A current controlled latch sense amplifier and a static power-saving input buffer for low-power architecture," IEEE J. Solid-State Circuits, vol. 28, no. 4, pp. 523-527, Apr 1993.
[100] T. Cho and P. Gray, "A $10 \mathrm{~b}, 20 \mathrm{Msample} / \mathrm{s}, 35 \mathrm{~mW}$ pipeline A/D converter," IEEE J. Solid-State Circuits, vol. 30, no. 3, pp. 166-172, Mar 1995.
[101] W. Song, H. Choi, S. Kwak, and B. Song, "A 10-b 20-Msample/s low power CMOS ADC," IEEE J. Solid-State Circuits, vol. 30, no. 5, pp. 514-521, May 1995.
[102] M. Waltari and K. A. I. Halonen, "1-V 9-bit pipelined switched-opamp ADC," IEEE J. Solid-State Circuits, vol. 36, no. 1, pp. 129-134, Jan 2001.
[103] M. Bazes, "Two novel fully complementary self-biased CMOS differential amplifiers," IEEE J. Solid-State Circuits, vol. 26, no. 2, pp. 165-168, Feb 1991.

## APPENDIX A

## AMPLIFIER SETTLING PERFORMANCE UNDER SLEWING CONDITIONS

Consider the switched-capacitor amplifier given in Fig. A-1. Under no slewing conditions, $\operatorname{Vout}(t)$ is expressed by (A-1) in $\Phi_{2}$ where the time constant $\tau=1 /(\beta G B W)$ and $\beta=C_{S H} /\left(C_{S H}+C_{P}\right)$. Given sufficient time, the final value of Vout( t ) is given by (A2). However, as the amplifier is limited by its slew rate (SR) when tracking fast transients, the settling no longer follows the exponential behavior.


Fig. A-1. A switched-capacitor amplifier.

$$
\begin{gather*}
\operatorname{Vout}(t)=\operatorname{Vin}\left(1+\frac{1}{\beta A}\right)^{-1}\left(1-e^{-\frac{t}{\tau}}\right)=\alpha \operatorname{Vin}\left(1-e^{-t \beta G B W}\right)  \tag{A-1}\\
\operatorname{Vout}(\infty) \equiv \overline{\operatorname{Vout}}=\alpha \operatorname{Vin} \tag{A-2}
\end{gather*}
$$

An amplifier settling with $S R$ limitation is depicted by Fig. A-2. The settling can be divided into two regions, $S R$-limited and $G B W$-limited, and if given sufficient time,
the final value will be limited by the amplifier DC gain only.


Fig. A-2. Amplifier settling with SR limitation.

The $S R$ is defined as the rate of change of $\operatorname{Vout}(t)$ with respect to time as given by (A-3), where $I_{S R}$ is the maximum current the amplifier can provide to charge/discharge its effective load $C_{\text {Leff. }}$. Since the value given by (A-3) is a linear quantity, $\operatorname{Vout}(t)$ can be expressed by (A-4) in the $S R$-limited portion of the amplifier settling.

$$
\begin{gather*}
S R=\frac{\partial \operatorname{Vout}(t)}{\partial t}=\frac{I_{S R}}{C_{\text {Leff }}}=\frac{I_{S R}}{C_{L}+\beta C_{P}}  \tag{A-3}\\
\operatorname{Vout}(t)=S R \cdot t, \quad t \leq t_{\text {slew }} \tag{A-4}
\end{gather*}
$$

As the amplifier enters the $G B W$-limited region, the remainder of the input step takes an exponential settling behavior and can be described by (A-5).

$$
\begin{align*}
\operatorname{Vout}(t) & =S R \cdot t_{\text {slew }}+\left(\overline{\text { Vout }}-S R \cdot t_{\text {slew }}\right)\left(1-e^{-\frac{\left(t-t_{\text {sew }}\right)}{\tau}}\right)  \tag{A-5}\\
& =\overline{\text { Vout }}-\left(\overline{\text { Vout }}-S R \cdot t_{\text {slew }}\right) e^{-\frac{\left(t-t_{\text {sew }}\right)}{\tau}}, \quad t \geq t_{\text {slew }}
\end{align*}
$$

Note that (A-4) and (A-5) satisfy the boundary value condition between the $S R$ and $G B W$-limited settling regions at $t=t_{\text {slew }}$, but they also need to preserve continuity at the same boundary as described by (A-6). This is evaluated in (A-7), which defines $t_{\text {slew }}$ such that continuity is preserved.

$$
\begin{gather*}
\left.\frac{\partial \operatorname{Vout}(t)_{(A-4)}}{\partial t}\right|_{t=t_{\text {slew }}}=\left.\frac{\partial \operatorname{Vout}(t)_{(A-5)}}{\partial t}\right|_{t=t_{\text {slew }}}  \tag{A-6}\\
S R=\frac{1}{\tau}\left(\overline{\operatorname{Vout}}-S R \cdot t_{\text {slew }}\right) \Rightarrow t_{\text {slew }}=\frac{\overline{\operatorname{Vout}}}{S R}-\tau \tag{A-7}
\end{gather*}
$$

Using (A-2) and (A-7) we can find the maximum input step Vin such that no slewing will occur as shown by (A-8). However, $G B W$ can be expressed by (A-9), where the transconductance of the input differential pair $g m=I_{T} / V_{G S T}, I_{T}$ is the total tail current of the differential pair and $V_{G S T}$ is the overdrive voltage $V_{G S}-V_{T}$. Moreover, for many amplifiers the current $I_{S R}$ is equivalent to the differential pair tail current $I_{T}$. Therefore, the maximum input step Vin for no slewing conditions can be expressed by (A-10).

$$
\begin{equation*}
t_{\text {slew }}=\frac{\alpha V i n}{S R}-\tau \leq 0 \Rightarrow \operatorname{Vin} \leq \frac{\tau \cdot S R}{\alpha}=\frac{I_{S R}}{\alpha \beta G B W C_{L e f f}} \tag{A-8}
\end{equation*}
$$

$$
\begin{align*}
& G B W=\frac{g m}{C_{\text {Leff }}}=\frac{I_{T}}{V_{G S T} C_{\text {Leff }}}  \tag{A-9}\\
& \operatorname{Vin} \leq \frac{I_{S R}}{\alpha \beta G B W C_{L e f f}}=\frac{V_{G S T}}{\alpha \beta} \tag{A-10}
\end{align*}
$$

The result in (A-10) is significant as it demonstrates that amplifier slewing is inevitable in low voltage applications as $V_{G S T}$ is limited to $100-200 \mathrm{mV}$ and $\alpha \beta$ is roughly unity, whereas input steps can be as large as 500 mV , especially in Nyquist-rate ADCs. Hence it is important to consider the settling time needed to attain a given accuracy in the presence of slewing and not solely rely on the $G B W$ calculation given by (A-1).

We can rewrite $\operatorname{Vout}(t)$ as in (A-11) by substituting $t_{\text {slew }}$ from (A-7) in (A-5) and using $\tau=1 /(\beta G B W)$. Then, for a given absolute settling error $\varepsilon_{S}$, we can find the settling time $t_{S}$ as given by (A-12). By subtracting $t_{\text {slew }}$ from (A-12) we get the time spent in the $G B W$-limited region, $t_{l i n}$, given in (A-13).

$$
\begin{align*}
& \operatorname{Vout}(t)=\overline{\operatorname{Vout}}-\frac{S R}{\beta G B W} e^{-\left[1+\beta G B W\left(t-\frac{\overline{\text { out }}}{S R}\right)\right]}, \quad t \geq t_{\text {slew }}  \tag{A-11}\\
& \varepsilon_{S}=\frac{S R}{\beta G B W} e^{-\left[1+\beta G B W\left(t_{s}-\frac{\overline{V o u t}}{S R}\right)\right]} \Rightarrow t_{S}=\frac{\overline{\operatorname{Vout}}}{S R}+\frac{1}{\beta G B W}\left[\ln \left(\frac{S R}{\varepsilon_{S} \beta G B W}\right)-1\right]  \tag{A-12}\\
& t_{\text {lin }}=\frac{1}{\beta G B W} \ln \left(\frac{S R}{\varepsilon_{S} \beta G B W}\right) \tag{A-13}
\end{align*}
$$

On the other hand, and as it is the case with switched-capacitor circuits, the final
settling value is determined at half the clock period $T_{S} / 2$, which is then used to determine the requirements of the $G B W$ and $S R$ of the amplifier according to a desired final error. For this purpose we can express $\operatorname{Vout}\left(T_{S} / 2\right)$ in terms of $T_{S} / 2$ for the different settling regions as shown in (A-14). From the previous discussion, most amplifiers in low voltage applications will fall under the second expression in (A-14).

$$
\operatorname{Vout}\left(\frac{T_{S}}{2}\right)= \begin{cases}S R \frac{T s}{2}, & t_{\text {slew }} \geq \frac{T_{S}}{2}  \tag{A-14}\\ \overline{\operatorname{Vout}}-\frac{S R}{\beta G B W} e^{-\left[1+\beta G B W\left(\frac{T_{S}}{2}-\frac{\overline{\text { Vout }}}{S R}\right)\right]}, & t_{\text {slew }} \leq \frac{T_{S}}{2} \\ \overline{\operatorname{Vout}}\left(1-e^{-\beta G B W \frac{T_{s}}{2}}\right), & t_{\text {slew }} \leq 0\end{cases}
$$

In ADCs the settling error is generally expressed in terms of bit resolutions. So for $N$ bits desired accuracy, the error can be expressed by (A-15), but a direct analytical solution for either $S R$ or $G B W$ is not possible.

$$
\begin{equation*}
\varepsilon_{S}=\frac{S R}{\beta G B W} e^{-\left[1+\beta G B W\left(\frac{T_{S}}{2}-\frac{\overline{\text { Vout }}}{S R}\right)\right]}<\frac{\overline{\text { Vout }}}{2^{N}} \tag{A-15}
\end{equation*}
$$

This difficulty is alleviated, however, by introducing a parameter $\delta$, which represents the portion of $\overline{\text { Vout }}$ that is covered in the $S R$-limited region as depicted in Fig. A-2, and can be used to substitute for $\beta G B W$ as shown in (A-16). Using (A-16) with (A-
15), and after some algebra, an expression for the minimum $S R$ necessary for $N$ bits accuracy is reached, which is given in (A-17). The result of (A-17) can be applied to (A16) to find a similar expression for $G B W$, which is given in (A-18).

$$
\begin{gather*}
t_{\text {slew }}=\frac{\overline{\text { Vout }}}{S R}-\frac{1}{\beta G B W}=\delta \frac{\overline{\text { Vout }}}{S R} \Rightarrow \beta G B W=\frac{S R}{\overline{\operatorname{Vout}(1-\delta)}}  \tag{A-16}\\
S R>\overline{\text { Vout }} \frac{2}{T_{S}}\left[\delta+(1-\delta) \ln \left(2^{N r}(1-\delta)\right)\right]  \tag{A-17}\\
G B W_{\text {w/ slew }}>\frac{2}{\beta T_{S}}\left[\frac{\delta}{(1-\delta)}+\ln \left(2^{N r}(1-\delta)\right)\right] \tag{A-18}
\end{gather*}
$$

If we considered the $G B W$ needed for $N$ bits accuracy in the absence of $S R$ limitation as given by the third expression of (A-14), the result would be (A-19). By normalizing (A-18) with respect to (A-19) we get (A-20).

$$
\begin{gather*}
G B W>\frac{2}{\beta T_{S}} \ln \left(2^{N r}(1-\delta)\right)  \tag{A-19}\\
\frac{G B W_{w / \text { slew }}}{G B W}=\frac{\delta+(1-\delta) \ln \left(2^{N r}(1-\delta)\right)}{(1-\delta) \ln \left(2^{N r}\right)} \tag{A-20}
\end{gather*}
$$

Moreover, by rearranging (A-17) to express the $T_{\varsigma} / 2$ and using the first half of (A-16), we can normalize $t_{\text {slew }}$ with respect to $T_{S} / 2$ as in (A-21).

$$
\begin{equation*}
\frac{t_{\text {slew }}}{T_{s} / 2}=\frac{\delta}{\delta+(1-\delta) \ln \left(2^{N}(1-\delta)\right)} \tag{A-21}
\end{equation*}
$$

Expressions (A-20) and (A-21) are plotted for $N=6,8$, and 10 with respect to $\delta$ in Fig. A-3, to demonstrate the effect of $S R$ on the $G B W$ requirement of the amplifier. For example, if while slewing the amplifier covers 0.7 Vout , and a settling accuracy of 8 bits is required, then $t_{\text {slew }}$ constitutes about $35 \%$ of the settling time, and the $G B W_{w / s l e w}$ needs to be increased by at least $20 \%$ over the original $G B W$ where the amplifier $S R$ was not accounted for.


Fig. A-3. Effect of amplifier $S R$ on $G B W$ requirement.

## APPENDIX B

## MDAC NOISE ANALYSIS WITH ACTIVE SAMPLING

Switches and amplifiers are the fundamental contributors to the noise seen at the output of an MDAC. All noise sources present for both the sampling and holding phases in an $M$ bit/stage MDAC are shown in Fig. B-1. Here, the power spectral density of the amplifier's input referred noise is denoted by $v_{n, A}^{2}$, whereas the subscripts $f b$, and 0 through $m$, where $m=2^{M}-1$, represent the feedback and input switches and capacitors respectively. The forthcoming analysis will aim to find the total noise at the output to see how the switches and amplifier affect the MDAC noise performance.


Fig. B-1. Noise sources present in an M bit/stage MDAC for (a) sampling phase and (b) holding phase.

Beginning with the sampling phase, the integrated noise on each capacitor is examined. Referring to Fig. B-1(a), and using $R_{0} . . R_{m}=R_{s w}$, and $C_{0} . . C_{m}=C_{u}$, the circuit can be described by (B-1)-(B-3). Here, $A(s)$ is the amplifier transfer function and is defined by $\omega_{u} / s$ for simplicity, where $\omega_{u}$ is the amplifier unity gain frequency.

$$
\begin{gather*}
\frac{1}{R_{s w}}\left[v_{x}-\left(v_{n, \text { out }}+v_{n, f b}\right)\right]+\frac{s C_{u}}{1+s R_{s w} C_{u}}\left[\sum_{i=0}^{m}\left(v_{x}-v_{n, i}\right)\right]=0  \tag{B-1}\\
v_{n, \text { out }}=A(s)\left(v_{n, A}-v_{x}\right) \cong \frac{\omega_{u}}{s}\left(v_{n, A}-v_{x}\right)  \tag{B-2}\\
v_{n, C_{0}}=\frac{v_{x}-v_{n, 0}}{\left(1+s R_{s w} C_{u}\right)} \tag{B-3}
\end{gather*}
$$

By substituting (B-1) and (B-2) into (B-3), the noise power stored on $C_{0}$ can be expressed as shown in (B-4), where $H_{l}(s), H_{2}(s), H_{3}(s)$ and $H_{4}(s)$ are given by (B-5), (B6 ), (B-7) and (B-8) respectively.

$$
\begin{gather*}
v_{n, C_{0}}^{2}=\left|H_{1}(s)\right|^{2} \cdot v_{n, 0}^{2}+\left|H_{2}(s)\right|^{2} \cdot v_{n, A}^{2}+\left|H_{3}(s)\right|^{2} \cdot v_{n, f b}^{2}+\left|H_{4}(s)\right|^{2} \cdot \sum_{i=1}^{m} v_{n, i}^{2}  \tag{B-4}\\
H_{1}(s)=-\frac{1+s\left(\frac{1}{\omega_{u}}+R_{s w} C_{u}\right)+s^{2} \frac{R_{s w} C_{u}}{\omega_{u}}(m+1)}{\left(1+s R_{s w} C_{u}\left(1+s\left(\frac{1}{\omega_{u}}+R_{s w} C_{u}\right)+s^{2} \frac{R_{s w} C_{u}}{\omega_{u}}(m+2)\right)\right.}  \tag{B-5}\\
H_{2}(s)=\frac{1}{1+s\left(\frac{1}{\omega_{u}}+R_{s w} C_{u}\right)+s^{2} \frac{R_{s w} C_{u}}{\omega_{u}}(m+2)}  \tag{B-6}\\
H_{3}(s)=\frac{s}{1+s\left(\frac{1}{\omega_{u}}+R_{s w} C_{u}\right)+s^{2} \frac{R_{s w} C_{u}}{\omega_{u}}(m+2)} \tag{B-7}
\end{gather*}
$$

$$
\begin{equation*}
H_{4}(s)=\frac{s^{2} \frac{R_{s w} C_{u}}{\omega_{u}}}{\left(1+s R_{s w} C_{u}\right)\left(1+s\left(\frac{1}{\omega_{u}}+R_{s w} C_{u}\right)+s^{2} \frac{R_{s w} C_{u}}{\omega_{u}}(m+2)\right)} \tag{B-8}
\end{equation*}
$$

Next, the square magnitude of $H_{1}(s), H_{2}(s), H_{3}(s)$ and $H_{4}(s)$ is to be integrated over the frequency range $\{0, \infty\}$ to find the total noise power. A closed form solution for these integrals, however, is not easily achieved due to the complex roots of the second order transfer function in the numerator of (B-5) and denominators of (B-5)-(B-8). To alleviate this difficulty, we can drop the term $(m+1)$ and $(m+2)$ from (B-5)-(B-8) thereby transforming (B-5)-(B-8) to (B-9)-(B-12) respectively.

$$
\begin{gather*}
H_{1}(s)=-\frac{1}{\left(1+s R_{s w} C_{u}\right)}  \tag{B-9}\\
H_{2}(s)=\frac{1}{\left(1+s R_{s w} C_{u}\right)\left(1+\frac{s}{\omega_{u}}\right)}  \tag{B-10}\\
H_{3}(s)=\frac{\frac{s}{\omega_{u}}}{\left(1+s R_{s w} C_{u}\right)\left(1+\frac{s}{\omega_{u}}\right)}  \tag{B-11}\\
H_{4}(s)=\frac{s^{2} \frac{R_{s w} C_{u}}{\omega_{u}}}{\left(1+s R_{s w} C_{u}\right)^{2}\left(1+\frac{s}{\omega_{u}}\right)} \tag{B-12}
\end{gather*}
$$

Naturally, the simplification introduces an error, which is clearly visible in Fig. (B-2), where (B-5)-(B-8) are plotted for $\mathrm{m}=1,2,3$, along with (B-9)-(B-12); the values for $R_{s w}, C_{u}$ and $\omega_{u}$ used in the plots were $100 \Omega, 1 \mathrm{pF}$ and $5 \mathrm{Grad} / \mathrm{s}$ respectively.


Fig. B-2. The magnitude plots of the original and simplified transfer function; (a) $H_{l}(s)$, (b) $H_{2}(s)$, (c) $H_{3}(s)$ and (d) $H_{4}(s)$.

While, this shows that (B-9)-(B-12) will yield an over estimate for the integrated noise, the resulting closed form integrals make for a more useable hand calculations tool
that inherently leaves a comfortable margin for design error. The final integral values are now given in (B-13)-(B-16).

$$
\begin{gather*}
\int_{0}^{\infty}\left|H_{1}(s)\right|^{2} \cdot d f=\frac{1}{4 R_{s w} C_{u}}  \tag{B-13}\\
\int_{0}^{\infty}\left|H_{2}(s)\right|^{2} \cdot d f=\frac{1}{4 R_{s w} C_{u}\left(1+\frac{1}{\omega_{u} R_{s w} C_{u}}\right)}  \tag{B-14}\\
\int_{0}^{\infty}\left|H_{3}(s)\right|^{2} \cdot d f=\frac{1}{4 R_{s w} C_{u}\left(1+\omega_{u} R_{s w} C_{u}\right)}  \tag{B-15}\\
\int_{0}^{\infty}\left|H_{4}(s)\right|^{2} \cdot d f=\frac{1}{8 R_{s w} C_{u}\left(1+\frac{\omega_{u}^{2} R_{s w}^{2} C_{u}^{2}}{1+\omega_{u} R_{s w} C_{u}}\right)} \tag{B-16}
\end{gather*}
$$

The computed integrals of (B-13)-(B-16) can now be substituted in (B-4) to get the total noise power sampled on $C_{0}$, which is given by ( $\mathrm{B}-17$ ).

$$
\begin{align*}
v_{n, C_{0}}^{2}= & \frac{v_{n, 0}^{2}}{4 R_{s w} C_{u}}+\frac{v_{n, A}^{2}}{4 R_{s w} C_{u}\left(1+\frac{1}{\omega_{u} R_{s w} C_{u}}\right)}+\frac{v_{n, f b}^{2}}{4 R_{s w} C_{u}\left(1+\omega_{u} R_{s w} C_{u}\right)} \\
& +\frac{\sum_{i=1}^{m} v_{n, i}}{4 R_{s w} C_{u}\left(1+\frac{\omega_{u}^{2} R_{s w}^{2} C_{u}^{2}}{1+\omega_{u} R_{s w} C_{u}}\right)} \tag{B-17}
\end{align*}
$$

The thermal noise power spectral density of a resistor is defined by (B-18) [44],
while the input referred noise of an amplifier can be modeled by (B-19) [49], where $g m$ is the input pair transconductance, $\gamma$ is a process dependent excess noise co-efficient and $\eta$ is the amplifier architecture noise factor.

$$
\begin{gather*}
v_{n, s w}^{2}=4 k_{B} T R_{s w} \cdot \Delta f  \tag{B-18}\\
v_{n, A}^{2}=\frac{4 k_{B} T \gamma(1+\eta)}{g m} \cdot \Delta f \tag{B-19}
\end{gather*}
$$

Using (B-18) and (B-19) in (B-17) yields the final form of $v_{n, c_{0}}^{2}$, which is the same for any $v_{n, C_{i}}^{2}$ as the characteristic equations (B-1)-(B-3) apply to any capacitor in Fig. B-1(a). The final result is given in (B-20). During the holding phase, Fig. B-1(b), all the noise power stored on the capacitors $C_{0} . . C_{m}$ is transferred to the now feedback capacitor $C_{0}$. Therefore the sampling phase component of the output noise can be expressed by (B-21).

$$
\begin{gather*}
\left.v_{n, C_{i}}^{2}\right|_{\Phi_{S}}=\frac{k_{B} T}{C_{u}}\left[1+\frac{1}{1+\omega_{u} R_{s w} C_{u}}+\frac{m}{2\left(1+\frac{\omega_{u}^{2} R_{s w}^{2} C_{u}^{2}}{1+\omega_{u} R_{s w} C_{u}}\right)}+\frac{\gamma(1+\eta)}{g m R_{s w}\left(1+\frac{1}{\omega_{u} R_{s w} C_{u}}\right)}\right]  \tag{B-20}\\
v_{n, \text { out }\left.\right|_{\Phi_{S}}}^{2}=\frac{Q_{n, \text { tot }}^{2}}{C_{0}^{2}}=\left.\sum_{0}^{m} v_{n, C_{i}}^{2}\right|_{\Phi_{S}}=\left.(m+1) \cdot v_{n, C_{i}}^{2}\right|_{\Phi_{S}} \tag{B-21}
\end{gather*}
$$

As for the holding phase component of the output noise, it can be evaluated using

Fig. B-1(b), which is characterized by (B-22) and (B-23).

$$
\begin{align*}
& v_{x}-\left(v_{n, \text { out }}+v_{n, f b}\right)+\sum_{i=0}^{m}\left(v_{x}-v_{n, i}\right)=0  \tag{B-22}\\
& v_{n, \text { out }}=A(s)\left(v_{n, A}-v_{x}\right) \cong \frac{\omega_{u}}{s}\left(v_{n, A}-v_{x}\right) \tag{B-23}
\end{align*}
$$

Using (B-22) and (B-23) to eliminate $v_{x}$, the output noise power in the holding phase, $\left.v_{n, \text { out }}^{2}\right|_{\Phi_{H}}$, can be expressed by (B-24), where $H_{5}(s)$ is given by (B-25).

$$
\begin{gather*}
\left.v_{n, \text { out }}^{2}\right|_{\boldsymbol{\Phi}_{H}}=\left|H_{5}(s)\right|^{2} \cdot\left[(m+1) v_{n, A}^{2}+\sum_{i=1}^{m} v_{n, i}^{2}\right]  \tag{B-24}\\
H_{5}(s)=\frac{1}{1+\frac{s(m+1)}{\omega_{u}}} \tag{B-25}
\end{gather*}
$$

For the final form of the holding phase output noise, the square magnitude of $H_{5}(s)$ is integrated over the frequency range $\{0, \infty\}$, and (B-18) and (B-19) are substituted for $v_{n, i}^{2}$ and $v_{n, A}^{2}$ respectively. This results in (B-26), where $\omega_{u}$ is substituted with $g m / C_{L}$, and $C_{L}$ being the effective loading capacitance.

$$
\begin{equation*}
\left.v_{n, o u t}^{2}\right|_{\Phi_{H}}=\frac{k_{B} T}{C_{L}}\left[g m R_{s w}+\gamma(1+\eta)(m+1)\right] \tag{B-26}
\end{equation*}
$$

The total noise seen at the output of the MDAC is the summation of (B-21) and (B-26), and is given in (B-27) where $\alpha_{n}$ is described by (B-28). Note that (B-21) is doubled for a fully differential implementation as well as the effect of the switch resistance in (B-26). Also, it is common practice to refer this noise to the input of the pipeline cell for a meaningful comparison with the input signal. This is accomplished by simply normalizing the output noise to the square of the MDAC gain $(m+1)$, and is demonstrated by (B-29).

$$
\begin{gather*}
\left.v_{n, \text { out }}^{2}\right|_{M D A C}=\frac{k_{B} T}{C_{L}}\left[2 g m R_{s w}+\gamma(1+\eta)(m+1)\right]+2(m+1) \frac{k_{B} T}{C_{u}}\left(1+\alpha_{n}\right)  \tag{B-27}\\
\alpha_{n}=\frac{1}{1+\omega_{u} R_{s w} C_{u}}+\frac{m}{2\left(1+\frac{\omega_{u}^{2} R_{s w}^{2} C_{u}^{2}}{1+\omega_{u} R_{s w} C_{u}}\right)}+\frac{\gamma(1+\eta)}{g m R_{s w}\left(1+\frac{1}{\omega_{u} R_{s w} C_{u}}\right)}  \tag{B-28}\\
\left.v_{n, \text { in }}^{2}\right|_{M D A C}=\frac{k_{B} T}{(m+1) C_{L}}\left[\gamma(1+\eta)+\frac{2 g m R_{s w}}{(m+1)}\right]+2 \frac{k_{B} T}{(m+1) C_{u}}\left(1+\alpha_{n}\right) \tag{B-29}
\end{gather*}
$$

In conclusion, the noise contributions of the switches and amplifier to the MDAC noise are limited by the unit capacitance, $C_{u}$, used in the MDAC and the effective loading capacitance, $C_{L}$, seen by the amplifier. Moreover, the use of multi-bit/stage pipeline cells has an added filtering effect on the total noise. However, this added benefit of lower noise is a direct trade-off with the speed of the pipeline ADC.

## APPENDIX C

## OPERATIONAL TRANSCONDUCTANCE AMPLIFIER PARAMETERS

Referring to the OTAs in Fig. 46, the following is a summary of key performance metrics expressions, based on the following assumptions: all amplifiers are driving a capacitive load $C_{L}$, cascode devices are treated as source degenerated devices for noise and offset calculations and hence are neglected due to their small transconductance, and finally $M$ denotes the transistor aspect ratio $W / L$.
A. Telescopic OTA (TL)

$$
\begin{gather*}
G m_{T L}=g m_{1}  \tag{C-1}\\
A(0)=G m_{T L} R_{O U T} \cong g m_{1} \cdot\left(g m_{3} r_{d s 1} r_{d s 3} \| g m_{5} r_{d s 5} r_{d s 7}\right)  \tag{C-2}\\
P M=180-\tan ^{-1}\left(\frac{G B W}{\omega_{P 1}}\right)-\tan ^{-1}\left(\frac{G B W}{\omega_{P 2}}\right)  \tag{C-3}\\
G B W=\frac{g m_{1}}{C_{L}}, \quad \omega_{P 1}=\frac{1}{R_{\text {OUT }} C_{L}}, \quad \omega_{P 2} \cong \frac{g m_{3}}{C_{g s 3}}  \tag{C-4}\\
S R=\frac{I_{T}}{C_{L}}  \tag{C-5}\\
\eta=\frac{G m_{T L}}{I_{\text {total }}}=\frac{g m_{1}}{I_{T}}  \tag{C-6}\\
\overline{v_{i T}^{2}}=\frac{8 k_{\mathrm{B}} T \gamma}{g m_{1}}\left(1+\frac{g m_{3}}{g m_{1}}\right) \cdot \Delta f=\frac{8 k_{\mathrm{B}} T \gamma}{\sqrt{2 \mu_{P} C_{o x} I_{D 1} M_{1}}}\left(1+\sqrt{\frac{\mu_{N}}{\mu_{P}} \frac{M_{3}}{M_{1}}}\right) \cdot \Delta f \tag{C-7}
\end{gather*}
$$

$$
\begin{gather*}
\overline{v_{i y}^{2}}=\frac{2 I_{D}}{g m_{1}^{2} C_{o x} f}\left(\frac{K_{F P}}{L_{1}^{2}}+\frac{K_{F N}}{L_{7}^{2}}\right) \cdot \Delta f=\frac{K_{F P}}{\mu_{P} W_{1} L_{1} C_{o x}^{2} f}\left[1+\frac{K_{F N}}{K_{F P}}\left(\frac{L_{1}}{L_{7}}\right)^{2}\right] \cdot \Delta f  \tag{C-8}\\
\left.\left.\overline{v_{i}^{2}}\right|_{f / f}\right|_{\min } \Rightarrow L_{1}=L_{7} \sqrt{\frac{K_{F P}}{K_{F N}}}  \tag{C-9}\\
\sigma^{2}\left(V_{O S}\right)=2 \frac{A_{V_{T P}}^{2}}{W_{1} L_{1}}\left[1+\frac{\mu_{N}}{\mu_{P}}\left(\frac{A_{V_{T N}}}{A_{V_{T P}}}\right)^{2}\left(\frac{L_{1}}{L_{7}}\right)^{2}\right]  \tag{C-10}\\
\left.\sigma^{2}\left(V_{O S}\right)\right|_{\min } \Rightarrow L_{1}=L_{3} \sqrt{\frac{\mu_{P}}{\mu_{N}}} \cdot \frac{A_{V_{T P}}}{A_{V_{T N}}} \tag{C-11}
\end{gather*}
$$

## B. Current Mirror OTA (CM)

$$
\begin{gather*}
G m_{C M}=K \cdot g m_{1}  \tag{C-12}\\
A(0)=G m_{C M} R_{O U T} \cong K \cdot g m_{1} \cdot\left(g m_{7} r_{d s 7} r_{d s 5} \| g m_{9} r_{d s 9} r_{d s 11}\right)  \tag{C-13}\\
P M=180-\tan ^{-1}\left(\frac{G B W}{\omega_{P 1}}\right)-\tan ^{-1}\left(\frac{G B W}{\omega_{P 2}}\right)-\tan ^{-1}\left(\frac{G B W}{\omega_{P 3}}\right)  \tag{C-14}\\
G B W=\frac{K \cdot g m_{1}}{C_{L}}, \quad \omega_{P 1}=\frac{1}{R_{O U T} C_{L}}, \quad \omega_{P 2} \cong \frac{g m_{7}}{C_{g s 7}}, \quad \omega_{P 2} \cong \frac{g m_{3}}{C_{g s 3}(1+K)}  \tag{C-15}\\
S R=\frac{K \cdot I_{T}}{C_{L}}  \tag{C-16}\\
\eta=\frac{G m_{C M}}{I_{\text {total }}}=\frac{K \cdot g m_{1}}{(K+1) I_{T}}  \tag{C-17}\\
\overline{v_{i T}^{2}}=\frac{8 k_{\mathrm{B}} T \gamma}{g m_{1}}\left(1+\frac{g m_{3}}{g m_{1}}+\frac{g m_{5}}{K^{2} g m_{1}}+\frac{g m_{11}}{K^{2} g m_{1}}\right) \cdot \Delta f \\
=\frac{8 k_{\mathrm{B}} T \gamma}{\sqrt{2 \mu_{P} C_{o x} I_{D 1} M_{1}}}\left(1+\sqrt{\frac{\mu_{N}}{\mu_{P}} \frac{M_{3}}{M_{1}}}+\frac{1}{K^{2}} \sqrt{\frac{\mu_{N}}{\mu_{P}} \frac{M_{5}}{M_{1}}}+\frac{1}{K^{2}} \sqrt{\frac{M_{11}}{M_{1}}}\right) \cdot \Delta f \tag{C-18}
\end{gather*}
$$

$$
\begin{align*}
& \overline{v_{i}^{2}} y_{f}^{2}=\frac{2 I_{D}}{g m_{1}^{2} C_{o x} f}\left(K^{2} \frac{K_{F P}}{L_{1}^{2}}+K^{2} \frac{K_{F N}}{L_{3}^{2}}+K \frac{K_{F N}}{L_{5}^{2}}+K \frac{K_{F P}}{L_{11}^{2}}\right) \cdot \Delta f \\
&=\frac{K_{F P}}{\mu_{P} W_{1} L_{1} C_{o x}^{2} f}\left[1+\frac{K_{F N}}{K_{F P}}\left(\frac{L_{1}}{L_{3}}\right)^{2}+\frac{1}{K} \frac{K_{F N}}{K_{F P}}\left(\frac{L_{1}}{L_{5}}\right)^{2}+\frac{1}{K}\left(\frac{L_{1}}{L_{11}}\right)^{2}\right] \cdot \Delta f  \tag{C-19}\\
&\left.\overline{v_{i 1 / f}^{2}}\right|_{\min } \Rightarrow L_{1}=\left[\frac{K_{F N}}{K_{F P}}\left(\frac{1}{L_{3}}\right)^{2}+\frac{1}{K} \frac{K_{F N}}{K_{F P}}\left(\frac{1}{L_{5}}\right)^{2}+\frac{1}{K}\left(\frac{1}{L_{11}}\right)^{2}\right]^{-1 / 2}  \tag{C-20}\\
& \sigma^{2}\left(V_{O S}\right)= 2 \frac{A_{V_{T P}}^{2}}{W_{1} L_{1}}\left[1+\frac{\mu_{N}}{\mu_{P}}\left(\frac{A_{V_{T N}}}{A_{V_{T P}}}\right)^{2}\left(\frac{L_{1}}{L_{3}}\right)^{2}+\frac{1}{K} \frac{\mu_{N}}{\mu_{P}}\left(\frac{\left.\left.A_{V_{T V}}^{A_{V_{T P}}}\right)^{2}\left(\frac{L_{1}}{L_{5}}\right)^{2}+\frac{1}{K}\left(\frac{L_{1}}{L_{11}}\right)^{2}\right]}{\left.\sigma^{2}\left(V_{O S}\right)\right|_{\min }} \Rightarrow L_{1}=\left[\frac{\mu_{N}}{\mu_{P}}\left(\frac{A_{V_{T N}}}{A_{V_{T P}}}\right)^{2}\left(\frac{1}{L_{3}}\right)^{2}+\frac{1}{K} \frac{\mu_{N}}{\mu_{P}}\left(\frac{A_{V_{T N}}}{A_{V_{T P}}}\right)^{2}\left(\frac{1}{L_{5}}\right)^{2}+\frac{1}{K}\left(\frac{1}{L_{11}}\right)^{2}\right]^{-1 / 2}\right.\right. \tag{C-21}
\end{align*}
$$

C. Folded Cascode OTA (FC)

$$
\begin{gather*}
G m_{F C}=g m_{1}  \tag{C-23}\\
A(0)=G m_{F C} R_{O U T} \cong g m_{1} \cdot\left(g m_{5} r_{d s 5}\left(r_{d s 1} \mid \| r_{d s 3}\right) \| g m_{7} r_{d s 7} r_{d s 9}\right)  \tag{C-24}\\
P M=180-\tan ^{-1}\left(\frac{G B W}{\omega_{P 1}}\right)-\tan ^{-1}\left(\frac{G B W}{\omega_{P 2}}\right)  \tag{C-25}\\
G B W=\frac{K \cdot g m_{1}}{C_{L}}, \quad \omega_{P 1}=\frac{1}{R_{O U T} C_{L}}, \quad \omega_{P 2} \cong \frac{g m_{5}}{C_{g s 5}}  \tag{C-26}\\
S R=\frac{I_{T}}{C_{L}}  \tag{C-27}\\
\eta=\frac{G m_{F C}}{I_{\text {total }}}=\frac{g m_{1}}{2 I_{T}} \tag{C-28}
\end{gather*}
$$

$$
\begin{align*}
& \overline{v_{i T}^{2}}=\frac{8 k_{\mathrm{B}} T \gamma}{g m_{1}}\left(1+\frac{g m_{3}}{g m_{1}}+\frac{g m_{9}}{g m_{1}}\right) \cdot \Delta f=\frac{8 k_{\mathrm{B}} T \gamma}{\sqrt{2 \mu_{P} C_{o x} I_{D 1} M_{1}}}\left(1+\sqrt{2 \frac{\mu_{N}}{\mu_{P}} \frac{M_{3}}{M_{1}}}+\sqrt{\frac{M_{9}}{M_{1}}}\right) \cdot \Delta f  \tag{C-29}\\
& \overline{v_{i \not / f}^{2}}=\frac{2 I_{D}}{g m_{1}^{2} C_{o x} f}\left(\frac{K_{F P}}{L_{1}^{2}}+\frac{2 K_{F N}}{L_{3}^{2}}+\frac{K_{F N}}{L_{9}^{2}}\right) \cdot \Delta f \\
& =\frac{K_{F P}}{\mu_{P} W_{1} L_{1} C_{o x}^{2} f}\left[1+2 \frac{K_{F N}}{K_{F P}}\left(\frac{L_{1}}{L_{3}}\right)^{2}+\left(\frac{L_{1}}{L_{9}}\right)^{2}\right] \cdot \Delta f  \tag{C-30}\\
& \left.\overline{v_{i \nmid f}^{2}}\right|_{\text {min }} \Rightarrow L_{1}=\left[2 \frac{K_{F N}}{K_{F P}}\left(\frac{1}{L_{3}}\right)^{2}+\left(\frac{1}{L_{9}}\right)^{2}\right]^{-1 / 2}  \tag{C-31}\\
& \sigma^{2}\left(V_{O S}\right)=2 \frac{A_{V_{T P}}^{2}}{W_{1} L_{1}}\left[1+2 \frac{\mu_{N}}{\mu_{P}}\left(\frac{A_{V_{T X}}}{A_{V_{T P}}}\right)^{2}\left(\frac{L_{1}}{L_{3}}\right)^{2}+\left(\frac{L_{1}}{L_{9}}\right)^{2}\right]  \tag{C-32}\\
& \left.\sigma^{2}\left(V_{O S}\right)\right|_{\text {min }} \Rightarrow L_{1}=\left[2 \frac{\mu_{N}}{\mu_{P}}\left(\frac{A_{V_{T N}}}{A_{V_{T P}}}\right)^{2}\left(\frac{1}{L_{3}}\right)^{2}+\left(\frac{1}{L_{9}}\right)^{2}\right]^{-1 / 2} \tag{C-33}
\end{align*}
$$

## VITA

Rida Shawky Assaad received his Bachelor of Science degree in electrical engineering from Texas A\&M University in May 2002. He joined the Analog and Mixed Signal Center at Texas A\&M University in August 2002 and received his Ph.D. degree in electrical engineering in December 2009. During 2005 he was an assistant lecturer at the Electrical and Computer Engineering Department where he taught the introductory course to electronics. He was an intern IC designer with Mobile Integrated Solutions at Texas Instruments, Dallas, in the fall of 2007 and 2008. His research interests include the design and optimization of base-band amplifiers and high resolution analog-to-digital converters.

Mr. Assaad can be reached at the Analog and Mixed Signal Center, Weisenbaker Engineering Research Center, Texas A\&M University, College Station, TX 77843-3128. His email address is: rida@tamu.edu.


[^0]:    This dissertation follows the style of IEEE Journal of Solid-State Circuits.

[^1]:    * Part of the data reported in this chapter is reprinted with permission from "The recycling folded cascode: a general enhancement of the folded cascode amplifier," by R. S. Assaad and J. Silva-Martinez, 2009. Journal of Solid-State Circuits, vol. 44, no. 9, pp. 2535-2542, Copyright 2009 by IEEE. This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of Texas A\&M University's products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org. By choosing to view this material, you agree to all provisions of the copyright laws protecting it.

[^2]:    * Part of the data reported in this chapter is reprinted with permission from "Optimization of direct autozeroing offset cancellation in low voltage applications using dual level CMFB," by R. Assaad and J. SilvaMartinez, 2009. Electronics Letters, vol. 45, no. 16, pp. 809-811, Copyright 2009 by IET.

[^3]:    * Simulation Results

