## KEY FRONT-END CIRCUITS IN MILLIMETER-WAVE SILICON-BASED WIRELESS TRANSMITTERS FOR PHASED-ARRAY APPLICATIONS

A Dissertation Presented to The Academic Faculty

by

Tso-Wei Li

In Partial Fulfillment
of the Requirements for the Degree
Doctor of Philosophy in the
School of Electrical and Computer Engineering

Georgia Institute of Technology December 2020

COPYRIGHT© 2020 BY TSO-WEI LI

## KEY FRONT-END CIRCUITS IN MILLIMETER-WAVE SILICON-BASED WIRELESS TRANSMITTERS FOR PHASED-ARRAY APPLICATIONS

#### Approved by:

Dr. Hua Wang, Advisor School of Electrical and Computer Engineering Georgia Institute of Technology

Dr. Gee-Kung Chang School of Electrical and Computer Engineering Georgia Institute of Technology

Dr. Farrokh Ayazi School of Electrical and Computer Engineering Georgia Institute of Technology Dr. John D Cressler School of Electrical and Computer Engineering Georgia Institute of Technology

Dr. Hossein Miri Lavasani Department of Electrical, Computer and Systems Engineering Case Western Reserve University

Date Approved: December 3, 2020



## **ACKNOWLEDGEMENTS**

I want to thank my advisor, Dr. Hua Wang, for providing feedback and brainstorm throughout my works in my Ph.D. career. It is also my pleasure to join one of the best labs in the RF/millimeter-wave integrated circuit field in the world, including cutting-edge equipment and intelligent people. I truly appreciate my family for their unlimited support; I cannot obtain my Ph.D. degree without them.

# TABLE OF CONTENTS

| ACKNOWLEDGEMENTS                                                   | iv        |
|--------------------------------------------------------------------|-----------|
| LIST OF TABLES                                                     | vii       |
| LIST OF FIGURES                                                    | viii      |
| SUMMARY                                                            | XIV       |
| CHAPTER 1. INTRODUCTION                                            | 1         |
| CHAPTER 2. HARMONICALLY-TUNED PA OPERATION                         | 7         |
| 2.1 Introduction                                                   | 7         |
| 2.1.1 Conventional Class-F PA operation                            | 7         |
| 2.1.2 Conventional Class-F-1 PA operation                          | 9         |
| 2.1.3 Continuous-mode Class-F and Class-F-1 PA                     | 12        |
| 2.1.4 Proposed Continuous-mode PA Output Network                   | 15        |
| 2.2.1 Continuous-Mode Class-F-1 PA Output Network                  | 15        |
| 2.2.2 Continuous-mode Hybrid Class-F/F-1 PA Output Network         | 20        |
| 2.1.5 Continuous-Mode Harmonically-Tuned PA Implementation         | 25        |
| 2.3.1 Design 1: A Two-Stage Continuous-mode Class-F-1 PA           | 25        |
| 2.3.2 Design 2: A One-Stage Continuous-mode Class-F/F-1 PA         | 25        |
| 2.3.3 Design 3: A Two-Stage Continuous-mode Class-F/F-1 PA         | 26        |
| 2.1.6 Measurement Results                                          | 28        |
| 2.1.7 Chapter Summary                                              | 40        |
| CHAPTER 3. A POWER AMPLIFIER WITH NMOS/PMOS NONLINEAITY            |           |
| CANCELLATION SCHEME                                                | 41        |
| 3.1 Introduction                                                   | 41        |
| 3.2 MOSFET AM-AM and AM-PM Distortion Analysis                     | 44        |
| 3.3 Proposed NMOS/PMOS AM-PM Nonlinearity Cancellation Scheme      | 51        |
| 3.3.1 Concept of doubly NMOS/PMOS nonlinearity cancellation scheme | 51        |
| 3.3.2 PA output network NMOS/PMOS Linearity Improvement Scheme     | 53        |
| 3.3.3 Series-Parallel DAT Output Power Combiner                    | 53        |
|                                                                    | <b>56</b> |
| 3.4 PA Implementation 3.5 Measurement Results                      | 58        |
| 3.6 Chapter Summary                                                | 68        |
| 5.0 Chapter Summary                                                | UO        |
| CHAPTER 4. A MILLIMETER-WAVE FULLY INTEGRATED PASSIVE              |           |
| REFLECTION-TYPE PHASE SHIFTER WITH                                 |           |
| TRANSFORMER-BASED MULTI-RESONANCE LOADS FO                         | R         |
| 360° PHASE SHIFTING                                                | 69        |
| 4.1 Introduction                                                   | 69        |
| 4.2 Reported RTPS Topologies                                       | 69        |
| 4.3 RTPS Design Challenge                                          | 71        |

| 4.4 I        | Reflective Load Designs in RTPS                            | 71  |
|--------------|------------------------------------------------------------|-----|
| 4.4.1        | Capacitive load (CL)                                       | 73  |
| 4.4.2        | Series L-C resonant load (SLC)                             | 75  |
| 4.4.3        | Parallel L-C Resonant Load (PLC)                           | 77  |
| 4.4.4        | CLC $\pi$ -resonator load with one tunable capacitor       | 79  |
| 4.4.5        | CLC $\pi$ -resonator load with two tunable capacitors      | 82  |
| 4.4.6        | Transformer-based multi-resonance reflective load          | 87  |
| <b>4.5</b> T | <b>Transformer-based 90° Coupler Design</b>                | 93  |
| 4.6          | Circuit Implementation and Simulation of the Proposed RTPS | 96  |
| 4.7 N        | Aeasurement Results                                        | 101 |
| 4.8          | Chapter Summary                                            | 109 |
| <b>4.9</b> A | appendix                                                   | 109 |
| 4.9.1        | Capacitive load (CL)                                       | 110 |
| 4.9.2        | Series L-C resonant load (SLC)                             | 112 |
| 4.9.3        | Parallel L-C resonant load (PLC)                           | 113 |
| 4.9.4        | CLC $\pi$ -resonant load with one-side capacitive tuning)  | 116 |
| 4.9.5        | CLC $\pi$ -resonant load with two-side capacitive tuning   | 116 |
| 4.9.6        | Transformer-based multi-resonance reflective load          | 118 |
| REFER        | ENCES                                                      | 122 |

# LIST OF TABLES

| Table 2.1 | – PA Load Impedance Behaviors for Different Operations                                                                                          | 14  |
|-----------|-------------------------------------------------------------------------------------------------------------------------------------------------|-----|
| Table 2.2 | - CW Performance Comparison with State-of-the art Silicon-based Mm-Wave Power Amplifier at Related Frequency                                    | 39  |
| Table 2.3 | <ul> <li>Modulation Performance Comparison With State-of-the art</li> <li>Silicon-based Mm-Wave Power Amplifier at Related Frequency</li> </ul> | 40  |
| Table 3.1 | <ul> <li>CW Performance Comparison with The State-Of-The-Art<br/>Silicon-Based Mm-Wav E PA at Related Frequency</li> </ul>                      | 67  |
| Table 3.2 | <ul> <li>Modulation Comparison with The State-Of-The-Art Silicon-<br/>Based Mm-Wave PA at Related Frequency</li> </ul>                          | 68  |
| Table 4.1 | <ul> <li>Comparison table of maximum phase shift for different reflective<br/>loads in an RTPS</li> </ul>                                       | 93  |
| Table 4.2 | Performance comparison of reported fully integrated RTPS designs in silicon                                                                     | 108 |

## LIST OF FIGURES

| Figure 1.1 | <ul> <li>PAE comparison with state-of-the-art mm-Wave silicon-based PAs in (a) SiGe and (b) CMOS process (20-50 GHz) [26].</li> </ul>                                                                                                                                                                                                                                                                                                                                                                                                      |    |  |  |
|------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----|--|--|
| Figure 1.2 | <ul> <li>Average Efficiency vs Average Output Power for SiGe and CMOS</li> <li>PAs in modulation with 64QAM (20-50 GHz) [26].</li> </ul>                                                                                                                                                                                                                                                                                                                                                                                                   | 6  |  |  |
| Figure 2.1 | <ul> <li>Theoretical current and voltage waveforms composed of the<br/>fundamental, 2nd- and 3rd-harmonics for (a) conventional/continuous-<br/>mode Class-F and (b) conventional/continuous-mode Class-F-1<br/>operations.</li> </ul>                                                                                                                                                                                                                                                                                                     | 8  |  |  |
| Figure 2.2 | <ul> <li>Fundamental, 2nd- and 3rd-harmonic load impedances for (a) continuous-mode Class-F and (b) continuous-mode Class-F-1 PA operations.</li> </ul>                                                                                                                                                                                                                                                                                                                                                                                    | 11 |  |  |
| Figure 2.3 | – EM and schematic of (a) continuous-mode Class-F-1 PA and (b) continuous-mode hybrid Class-F/-F-1 PA.                                                                                                                                                                                                                                                                                                                                                                                                                                     | 17 |  |  |
| Figure 2.4 | – Simplified of (a) differential-mode and (b) common-mode half circuits of the continuous-mode Class-F-1 PA output network at (c) the fundamental frequency (d) the 3rd-harmonic of lower band, and (e) the 2nd-harmonic frequencies, respectively.                                                                                                                                                                                                                                                                                        | 18 |  |  |
| Figure 2.5 | – Trajectories of the half-circuit load impedance at fundamental, 2nd-and 3rd-order harmonic frequencies (characteristic impedance Z0 = $50\Omega$ ).                                                                                                                                                                                                                                                                                                                                                                                      | 19 |  |  |
| Figure 2.6 | – Simplified half circuits of the (a) differential-mode, (b) common-mode of the continuous-mode hybrid Class-F/F-1 PA output network at (c) the fundamental frequency ( $\omega L \le \omega \le \omega H$ ) and (d) the fundamental equivalent circuit, (e) the 3rd-order harmonic of lower band ( $\omega = 3\omega L$ ), (f) the 3rd-order harmonic of higher band ( $\omega = 3\omega H$ ), (g) the 2nd-order harmonic of lower band ( $\omega = 2\omega L$ ), and (h) the 2nd-order harmonic of higher band ( $\omega = 2\omega H$ ). | 23 |  |  |
| Figure 2.7 | – (a) Trajectories of half-circuit load impedance at fundamental, 2nd-and 3rd-order harmonic frequencies (Z0 =50 $\Omega$ ), and (b) impedance response.                                                                                                                                                                                                                                                                                                                                                                                   | 24 |  |  |
| Figure 2.8 | – Schematic of (a) two-stage continuous-mode Class-F-1 PA, (b) one-and (c) two-stage continuous-mode hybrid Class-F/F-1 PAs.                                                                                                                                                                                                                                                                                                                                                                                                               | 27 |  |  |
| Figure 2.9 | - Chin microphotograph of two-stage continuous-mode Class-F-1 PA                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | 29 |  |  |

| Figure 2.10 | - (a) measured CW large-signal performance at 28.5 GHz, (b) measured CW large-signal performance vs. frequency and (c) measured small-signal S-parameter of two-stage continuous-mode Class-F-1 PA. |    |  |  |  |  |
|-------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----|--|--|--|--|
| Figure 2.11 | - (a) 64-QAM constellation, (b) 256-QAM constellation, (c) 64-QAM spectrum and (d) 256-QAM spectrum at fcarrier of 28.5 GHz of Design 3.                                                            |    |  |  |  |  |
| Figure 2.12 | <ul> <li>Chip microphotograph of one-stage continuous-mode hybrid Class-F/F-1 PA.</li> </ul>                                                                                                        |    |  |  |  |  |
| Figure 2.13 | One-stage continuous-mode hybrid Class-F/F-1 PA CW large-signal measurement at (a) 28 GHz, (b) 37 GHz and (c) 39 GHz, (d) Psat/PAE vs. frequency, and (e) small-signal S-parameter.                 |    |  |  |  |  |
| Figure 2.14 | <ul> <li>Chip microphotograph of two-stage continuous-mode hybrid Class-F/F-1 PA.</li> </ul>                                                                                                        | 36 |  |  |  |  |
| Figure 2.15 | - Two-stage continuous-mode hybrid Class-F/F-1 PA CW large-signal measurement at (a) 28 GHz, (b) 37 GHz and (c) 39 GHz, (d) Psat/PAE vs. frequency, and (e) small-signal S-parameter.               | 37 |  |  |  |  |
| Figure 2.16 | <ul> <li>Two-stage continuous-mode hybrid Class-F/F-1 PA modulation<br/>measurement results (0.5 GSym/s) at (a) 24 GHz, (b) 28 GHz, (c) 37<br/>GHz, (d) 39 GHz respectively.</li> </ul>             | 38 |  |  |  |  |
| Figure 3.1  | – A neutralized PA and its simplified single-ended half circuit.                                                                                                                                    | 49 |  |  |  |  |
| Figure 3.2  | – The extracted gate-to-source Cgs, gate-to-drain Cgd and drain-to-source <i>Cds</i> , and transconductance gm and drain-to-source conductance <i>gds</i> in (a) NMOS and (b) PMOS transistor.      | 50 |  |  |  |  |
| Figure 3.3  | - The AM-PM behaviors of NMOS and PMOS PAs.                                                                                                                                                         | 51 |  |  |  |  |
| Figure 3.4  | <ul> <li>The concept of the proposed doubly nonlinearity cancellation<br/>scheme.</li> </ul>                                                                                                        | 52 |  |  |  |  |
| Figure 3.5  | – The schematic of the hybrid NMOS/PMOS nonlinearity cancellation scheme.                                                                                                                           | 54 |  |  |  |  |
| Figure 3.6  | <ul> <li>The 3D EM model of the hybrid NMOS/PMOS nonlinearity<br/>cancellation PA with 4-way series-parallel DAT.</li> </ul>                                                                        |    |  |  |  |  |
| Figure 3.7  | <ul> <li>The simulated passive efficiency of the proposed series-parallel<br/>DAT.</li> </ul>                                                                                                       | 55 |  |  |  |  |

| Figure 3.8  | – The schematic of the proposed PA.                                                                                                                                                                                                                                                                                                                                                                 | 57 |  |  |  |  |
|-------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----|--|--|--|--|
| Figure 3.9  | - The chip microphotographic.                                                                                                                                                                                                                                                                                                                                                                       | 57 |  |  |  |  |
| Figure 3.10 | <ul> <li>Measured CW performance vs. the gate voltages, e.g., (a)AM-AM distortion, (b) AM-PM distortion, (c) Psat, (d) OP1dB (e) Gp, (f) PAEmax, (g) PAEP1dB, and (h) PAE6dB_PBO</li> </ul>                                                                                                                                                                                                         |    |  |  |  |  |
| Figure 3.11 | – (a) the measured CW large-signal performance at 55GHzfor High-Linearity-Mode (HLM), (b) the measured CW large-signal performance at 55GHz for High-Efficiency-Mode (HEM), (c) the AM-AM distortion, (d) the AM-PM distortion, (f) the delivery power, (g) the efficiency and (h) The S-parameters.                                                                                                | 62 |  |  |  |  |
| Figure 3.12 | – The 5G NR 64-QAM CP-OFDM modulation results with 1-CC FR2 400MHz.                                                                                                                                                                                                                                                                                                                                 | 63 |  |  |  |  |
| Figure 3.13 | - The 5G NR 64-QAM CP-OFDM modulation results with 1-CC FR2 400MHz.                                                                                                                                                                                                                                                                                                                                 | 64 |  |  |  |  |
| Figure 3.14 | $^{\rm -}$ The 5G NR 64-QAM CP-OFDM modulation results with 2-CC FR2 400MHz.                                                                                                                                                                                                                                                                                                                        | 64 |  |  |  |  |
| Figure 3.15 | <ul> <li>The 5G NR 64-QAM CP-OFDM modulation results with 4-CC FR2 100MHz.</li> </ul>                                                                                                                                                                                                                                                                                                               | 65 |  |  |  |  |
| Figure 3.16 | <ul> <li>The 5G NR 64-QAM CP-OFDM modulation results with 4-CC FR2 200MHz.</li> </ul>                                                                                                                                                                                                                                                                                                               | 65 |  |  |  |  |
| Figure 3.17 | – The measured 5G NR 64-QAM CP-OFDM 1-CC FR2 400MHz modulation results, e.g., (a) Pavg, (b) PAEavg and (c) ACPR with various VDD values (VDD = 0.9, 1.1, 1.1 and 1.2 V).                                                                                                                                                                                                                            | 66 |  |  |  |  |
| Figure 4.1  | – A single-ended reflection-type phase shifter (RTPS).                                                                                                                                                                                                                                                                                                                                              | 73 |  |  |  |  |
| Figure 4.2  | – (a) The load impedance tuning range for a lossless capacitive load (CL) by tuning C, and (b) the simulated phase shifting range as the capacitor load tuning range $\alpha$ varies from 1 to its maximum value for different $\alpha$ max (e.g., $\alpha$ max=2, 3, or 4). Assume that the optimum Cmin value, i.e., $C^{min}=1/(\sqrt{\alpha_{max}}Z_0\omega_0)$ is used in this CL RTPS design. | 75 |  |  |  |  |
| Figure 4.3  | – (a) The load impedance trajectory for a lossless parallel L-C load (PLC) by tuning Cp, and (b) the simulated phase shifting range for different maximum capacitor tuning range αmax=Cpmax/Cpmin, e.g., αmax=2, 3, or 4. Assume the optimum Cpmin and Lp values are used in this PLC RTPS design.                                                                                                  | 77 |  |  |  |  |

| Figure 4.4  | – (a) The load impedance trajectory for a lossless parallel L-C load (PLC) by tuning Cp, and (b) the simulated phase shifting range for different maximum capacitor tuning range $\alpha$ max=Cpmax/Cpmin, e.g., $\alpha$ max=2, 3, or 4. Assume the optimum Cpmin and Lp values are used in this PLC RTPS design.                                                                                              | 79 |
|-------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----|
| Figure 4.5  | – The load impedance tuning range for a lossless CLC $\pi$ -resonator load with one tunable capacitor (CLC-1).                                                                                                                                                                                                                                                                                                  | 81 |
| Figure 4.6  | – The simulated CLC-1 phase shifting range versus the capacitor tuning α and capacitor ratio β (i.e., $\beta$ =C3/C2min) for a lossless CLC-1 RTPS load. The phase shifting range is referenced to zero for α =1, i.e., no capacitive tuning. It covers (a) $0 \le \beta \le 3$ and (b) $0 \le \beta \le 20$ .                                                                                                  | 82 |
| Figure 4.7  | – The load impedance tuning range for a lossless CLC $\pi$ -resonator load with two tunable capacitors (CLC-2).                                                                                                                                                                                                                                                                                                 | 85 |
| Figure 4.8  | – The simulated CLC-2 phase shifting range versus the two capacitor tuning $\alpha 2$ and $\alpha 3$ at different capacitor ratios $\beta$ (i.e., $\beta$ =C3min/C2min) for a lossless CLC-2 RTPS load with (a) $\beta$ =0, (b) $\beta$ =1, (c) $\beta$ =3, and (d) $\beta$ =10. The $\alpha$ max is 3 and the $\Delta\theta$ is referenced to zero for $\alpha 2$ = $\alpha 3$ =1, i.e., no capacitive tuning. | 86 |
| Figure 4.9  | <ul> <li>The proposed transformer multi-resonance load in an RTPS with (a) single-ended schematic and (b) simplified schematic.</li> </ul>                                                                                                                                                                                                                                                                      | 91 |
| Figure 4.10 | - The load impedance tuning range for the proposed transformer-based multi-resonance load by varying both Ct and Cv'. All the passive components are assumed to be lossless for simplicity.                                                                                                                                                                                                                     | 92 |
| Figure 4.11 | – The simulated phase shift of our proposed transformer-based multi-resonance load by independently varying Cv and Ct with a maximum capacitive tuning range $\alpha tmax = \alpha vmax = 3$ . (a) Ctmin is obtained from equation (24), and (b) Ctmin is 1.5 times of the value obtained from equation (24).                                                                                                   | 92 |
| Figure 4.12 | $-$ A transformer-based $90^\circ$ coupler with (a) the schematic and (b) 3D EM model in the proof-of-concept 60GHz RTPS design.                                                                                                                                                                                                                                                                                | 94 |
| Figure 4.13 | <ul> <li>(a) Simulated amplitudes for the Thru, Cpl, and Iso ports and the return loss at the In port.</li> <li>(b) Simulated phases for the Thru and Cpl ports.</li> </ul>                                                                                                                                                                                                                                     | 95 |
| Figure 4.14 | - Our proposed RTPS design with (a) the schematic, (b) the layout and EM model, (c) the chip microphotograph, and (d) the EM model of the proposed load.                                                                                                                                                                                                                                                        | 98 |

| Figure 4.15 | 5 – The simulated complex IL at 62GHz for the proposed transformer-based RTPS. The simulation includes 3D EM modeling of the passive structures and device parasitic extraction.                                                                              |     |  |  |  |
|-------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----|--|--|--|
| Figure 4.16 | <ul> <li>The simulated capacitance vs. the control voltage of (a) Vcv and (b)</li> <li>Vct.</li> </ul>                                                                                                                                                        | 99  |  |  |  |
| Figure 4.17 | - The simulated P1dB and IIP3 with (a) Vcv=0.45V and Vct=0.45V Vcv=0.45V and Vct=0.45V, and (b) Vcv=-1V and Vct=-1V.                                                                                                                                          |     |  |  |  |
| Figure 4.18 | <ul> <li>Measured RTPS complex IL at 62GHz for the test chip 1. Minimum IL and constant IL contours (10dB, 11dB, and 12dB) are highlighted.</li> </ul>                                                                                                        |     |  |  |  |
| Figure 4.19 | 19 — Measured complex IL of three independent RTPS test chips at<br>different frequencies. The constant IL contours (IL=10/11/12dB) are<br>highlighted.                                                                                                       |     |  |  |  |
| Figure 4.20 | - Measured RTPS minimum IL vs. phase shift settings at various frequencies for 3 test chips, e.g., (a) Test chip 1, (b)Test chip 2 and (c)Test chip 3.                                                                                                        | 105 |  |  |  |
| Figure 4.21 | - FoM vs. frequency for our RTPS at different frequencies.                                                                                                                                                                                                    | 105 |  |  |  |
| Figure 4.22 | – Measured IL variation vs. 360° phase shifts for different constant IL contours (10, 11 and 12dB) at 62GHz for 3 test chips, e.g., (a) Test chip 1, (b)Test chip 2 and (c)Test chip 3.                                                                       | 106 |  |  |  |
| Figure 4.23 | <ul> <li>Measured return loss vs. 2 independent control voltages (i.e., Vcv and Vct) of three independent RTPS chip samples at 58, 60, 62 and 64GHz for Test chip 1.</li> </ul>                                                                               | 107 |  |  |  |
| Figure 4.24 | <ul> <li>Measured return loss vs. 2 independent control voltages (i.e., Vcv and Vct) of three independent RTPS chip samples at 58, 60, 62 and 64GHz for Test chip 2.</li> </ul>                                                                               | 107 |  |  |  |
| Figure 4.25 | <ul> <li>Measured return loss vs. 2 independent control voltages (i.e., Vcv and Vct) of three independent RTPS chip samples at 58, 60, 62 and 64GHz for Test chip 3.</li> </ul>                                                                               | 108 |  |  |  |
| Figure 4.26 | – (a) A lossy CL model, (b) the maximum phase shift $\Delta\theta$ max vs. $\alpha$ , (c) the load reflection coefficient $ \Gamma $ vs. $\alpha$ , and (d) the load impedance trajectory with different QCmax.                                               | 111 |  |  |  |
| Figure 4.27 | – (a) A lossy SLC model, (b) the load reflection coefficient $ \Gamma $ as $\alpha$ varies from 1 to $\alpha$ max, (c) the $\Delta\theta$ max versus different quality factors, and (d) the load impedance trajectory on Smith chart with the different QCmax | 113 |  |  |  |

- Figure 4.28 (a) A lossy PLC model, (b) the load reflection coefficient  $|\Gamma|$  as  $\alpha$  115 varies from 1 to  $\alpha$ max, (c) the  $\Delta\theta$ max versus different quality factors, and (d) the load impedance trajectory on Smith chart with the different  $OC^{max}$ .
- Figure 4.29 (a) A lossy CLC-1 model, (b) the load impedance trajectory on Smith 117 chart with different quality factors for  $\alpha$ max=3 and  $\beta$ =1, (c)  $\beta$ =2 and (d)  $\beta$ =3.
- Figure 4.30 (a) A lossy CLC-2 model and (b) the lossy CLC-2 load impedance 118 trajectory on Smith chart with the different quality factors for  $\alpha$ max=3 and  $\beta$ =1.
- Figure 4.31 (a) A lossy proposed transformer-based multi-resonance reflective 119 model and (b) the load impedance trajectory on Smith chart with the different quality factors for αmax=3.

#### **SUMMARY**

Millimeter-wave (mm-Wave) phased arrays have been widely used in numerous wireless systems to perform beam forming and spatial filtering that can enhance the equivalent isotropically radiated power (EIRP) for the transmitter (TX). As well as it can minimize the interferences and further increase signal-to-noise ratio (SNR) for the receiver (RX). Regarding the existing phased-array architectures, an mm-Wave transmitter includes several building blocks to perform the desired delivered power and phases for wireless communication.

Power amplifier (PA) is the most important building block. It needs to offer several advantages, e.g., high efficiency, broadband operation and high linearity. With the recent escalation of interest in 5G wireless communication technologies, mm-Wave transceivers at the 5G frequency bands (e.g., 28 GHz, 37 GHz, 39 GHz, and 60 GHz) have become an important topic in both academia and industry. Thus, PA design is a critical obstacle due to the challenges associated with implementing wideband, highly efficient and highly linear PAs at mm-Wave frequencies. However, there exists several fundamental challenges for mm-Wave PA in silicon due to the lower transistor cut-off frequency ( $f_{max}$ ) compared to compound III-V processes, lower breakdown voltage and lossy silicon substrate, which causes high design complexity and difficulty. In this dissertation, we present several PA design innovations to address the aforementioned challenges.

Additionally, phase shifter (PS) also plays a key role in a phased-array system, since it governs the beam forming quality and steering capabilities. A high-performance phase

shifter should achieve a low insertion loss, a wide phase shifting range, dense phase shift angles, and good input/output matching.

For PA design, first, an mm-Wave continuous-mode harmonically-tuned PA is proposed to provide an instantaneously broadband operation, high PA efficiency and an ultra-compact size at 28.5GHz. Based on the harmonically-tuned technique, a continuous-mode Class-F<sup>-1</sup> PA can be realized and present a power-added efficiency and saturated power (P<sub>sat</sub>). Moreover, we combine both continuous-mode Class-F and Class-F<sup>-1</sup> PA operation to realize a continuous-mode hybrid Class-F/F<sup>-1</sup> PA to provide a broader operation bandwidth and also maintain high PA efficiency. Importantly, these continuous-mode harmonically-tuned operation delivers output power and drain efficiency equivalent to that of the standard narrow band Class-F and Class-F<sup>-1</sup> PA operations.

Secondly, an mm-Wave PA with a power combiner-type N-/P-MOS amplitude-to-phase (AM-PM) distortion cancellation scheme is proposed to provide a <1° AM-PM distortion and maintain high PAE at V-band range. Although multiple AM-PM distortion cancellation schemes have been reported, they usually require additional elements or circuits. However, elements or circuits induce extra losses to downgrade the PA performance even they maintain high PA linearity. Our proposed technique not only improve PA linearity but provide high PAE.

For phase shifter design, we present an mm-Wave fully differential transformer-based passive reflection-type phase shifter (RTPS) capable of performing full span 360° continuous phase shift from 58 to 64 GHz. It consists of two transformer-based 90° couplers and two transformer-based multi-resonance reflective loads to provide 360° phase

shift with low loss and ultra-compact chip size. Our proof-of-concept design is implemented in a standard 130-nm BiCMOS process with a core area of 480μm×340μm. It achieves a wide phase shifting range of 367° and a low insertion loss IL (3.7dB<|IL|<10.2dB) at62 GHz and maintains a full span 360° phase shifting range from58 to 64 GHz. Moreover, it supports360° phase shifting with a constant IL, i.e., |IL|=10, 11, 12 dB, at an IL variation of less than 0.74 dB at 62 GHz. To the best of our knowledge, this design achieves a first-ever full span 360° phase shifting (up to 367°),the lowest IL, the smallest IL variation, and the best figure-of-merit of 37.1°/dB among reported 60 GHz fully integrated RTPS in silicon.

#### CHAPTER 1. INTRODUCTION

With the rapid growth of the fifth generation (5G) communication, multiple non-contiguous millimeter-wave (mm-Wave) frequency bands (e.g., 24, 28, 37 and 39 GHz) are being allocated in different countries and regions. For example, the 5G frequency spectrums are allocated as, 27.5-28.35 GHz and 37-40 GHz (USA), 24.25-27.5 GHz and 31.8-33.4 GHz (Europe), and 24.25-27.5 GHz and 37-42.5 GHz (China). The frequency bands will extend to V-band (40-75 GHz) or even higher in the future. An ultra-broadband mm-Wave TX that can cover all these potential 5G bands will enable frequency diversity and international roaming as well as supporting wideband Multiple-Input-Multiple-Output (MIMO) wireless technology with ultra-compact elements by eliminating the need for assembling several single-band TXs. Thus, regarding future commercial the 5G products, it can release design difficulty and save manufacture cost significantly for targeting multiple frequency bands.

PA plays a crucial role in TX for mobile devices since it consumes a majority of the DC power, shortening usage/standby times and transmitting ranges. Thus, the efficiency of a mm-Wave PA is the key metric to perform. Additionally, 5G mm-Wave systems are expected to support wideband spectrum-efficient complex modulation schemes (e.g., 64- and 256-QAM) to achieve Gb/s link throughput revolution. These complex modulation schemes, however, often come with high-density constellations that demand stringent linearity to provide higher spectral efficiency, i.e. large-signal AM-AM and AM-PM distortion, on the PAs. Thus, the 5G communication system can support large and fragmented spectrum, dynamic spectrum access, and short packet transmissions with

loose synchronization requirements. Therefore, in order to support future wideband MIMOs, desired mm-Wave silicon-based PA solutions should offer wide carrier bandwidth for multiple the future 5G frequency bands, high efficiency for longer usage/standby times, high linearity for complex modulation schemes, sufficient output power for required transmitting ranges, and compact size for cost reduction simultaneously [1]-[25].

Figure 1.1 shows continuous-wave (CW) performance comparisons with reported mm-Wave PAs in SiGe and CMOS process from 20 GHz to 50 GHz. According to [26], both SiGe and CMOS PAs exhibit upper performance envelopes for peak PAE vs. P<sub>sat</sub>, showing "device limited regime" in the low/medium P<sub>sat</sub> region determined by the intrinsic power device efficiency and "circuit/combiner limited regime" in the medium/high P<sub>sat</sub> region governed by the combiner efficiency. Thus, these upper performance envelopes are big challenges for PA designers.

Figure 1.2 shows the plot of average PAE (PAE<sub>avg</sub>) vs average output power ( $P_{avg}$ ) for SiGe and CMOS PA in modulation with 64QAM from 20 GHz to 50 GHz. There are two major groups of  $P_{avg}$ , e.g.,  $P_{avg}$ =5-7dBm with PAE<sub>avg</sub>=~10% and  $P_{avg}$ =9-11dBm PAE<sub>avg</sub>=~10-20% respectively. Note that this plot does not distinguish modulation speeds (i.e., data-rate or symbol-rate) of the reported PA designs. For future 5G wireless communication, the proposed data-rate is at least 0.8 Gb/s at mm-Wave frequency bands.

Basic linear PAs, e.g., Class-A and Class-AB PAs, offer design simplicity and good linearity [27]-[31]. However, their simple "all-short" output harmonic terminations fundamentally limit the peak efficiency. On other hand, mm-Wave time-domain switching

PAs, e.g., Class-E PAs, show high peak efficiency but limited linearity [27][28][32]-[34]. They cannot support complex modulations without major digital pre-distortion (DPD) computation, while DPD at Gb/s usually requires substantial power and complexity for future 5G communication. To solve this classic efficiency-linearity challenge, there are multiple reported advanced PA architectures, such as Doherty PAs [7][31] and Outphasing PAs [2][27]. They can boost the back-off efficiency and maintain high linearity for mm-Wave 5G applications. However, the former often requires large area, while the latter also demand extensive DPD to increase design and implementation complexity.

According to the reported PA design recently, a promising alternative solution is the overdriven linear PAs with harmonically-tuned impedance terminations to address the classic efficiency-linearity challenge. Multiple recent designs show that Class-AB, Class-J, Class-F and inverse Class-F (Class-F<sup>-1</sup>) harmonically-tuned terminations on linear PAs can boost their peak efficiency and still preserve high linearity. The reported harmonicallytuned terminations usually consist manifold L-C resonant tanks to provide the desired harmonic impedances. However, these designs either have limited bandwidth due to narrowband harmonic terminations, require area-consuming passive networks, constraining their use in broadband MIMO systems, or increase design and implementation complexity [35]-[39]. To address these PA issues, we present continuous-mode harmonically-tuned PA to achieve wide bandwidth, high efficiency and compact formfactor together, offering future 5G PA solutions. Moreover, we proposed a V-band PA employing our NMOS/PMOS AM-PM distortion cancellation technique to further improve the linearity. Also, this PA design includes a series-parallel distributed-active transformer (DAT), performing low loss and proving the PA optimal load.

The applications of mm-Wave phased array systems include ultra-high data-rate transmission, emerging 5G communication, and radar and imaging systems [127]-[141]. Phase shifters (PS) play a key role in a phased array system, since it governs the beam forming quality and steering capabilities. A high-performance phase shifter should achieve a low insertion loss (IL), a wide phase shifting range, dense phase shift angles, and good input/output matching. We present a millimeter-wave fully differential transformer-based passive reflection-type phase shifter (RTPS) capable of performing full span 360° continuous phase shift from 58GHz to 64GHz.

Taking full advantages of the increasing transistor speed, my Ph.D. research focuses on exploring new silicon-based PA topologies and circuit techniques to achieve state-of-the-art performance for various emerging applications at mm-Wave. Through my Ph.D. career at Georgia Tech GEMS Lab, I did research on the design challenges in mm-Wave PA, and devoted my research efforts in implementing energy efficient, broadband and high linearity using advanced silicon technologies. The major contributions of my dissertation are listed below.

- We propose a mm-Wave silicon-based harmonically-tuned PA designs by using continuous-mode Class-F<sup>-1</sup> and continuous-mode hybrid Class-F/F<sup>-1</sup> PA operations at 5G bands.
- We propose a mm-Wave silicon-based V-band PA with the proposed N-/P-MOS AM-PM distortion cancellation scheme to achieve <1° AM-PM distortion and PAE of 35%.
- 3. We propose a millimeter-wave fully differential transformer-based passive

reflection-type phase shifter which performs full span 360° continuous phase shift from 58GHz to 64GHz with low insertion loss or constant insertion loss.

The remainder of this dissertation is organized as follows.

In this dissertation, the design details of the mm-Wave silicon-based harmonically-tuned Continuous-mode PA designs are discussed in Chapter 2. The proposed power-combining-based NMOS/PMOS AM-PM distortion cancellation scheme are demonstrated in Chapter 3. Later, Chapter 4 shows the design and implementation of the proposed RTPS. Finally, Chapter 5 summarizes this dissertation.



Figure 1.1 - PAE comparison with state-of-the-art mm-Wave silicon-based PAs in (a) SiGe and (b) CMOS process (20-50 GHz) [26].



Figure 1.2 – Average Efficiency vs Average Output Power for SiGe and CMOS PAs in modulation with 64QAM (20-50 GHz) [26].

#### CHAPTER 2. HARMONICALLY-TUNED PA OPERATION

The basic continuous-mode PA operation and our proposed continuous-mode harmonically-tuned are discussed in this chapter. The implementation and measurement results are also presented.

#### 2.1. Introduction

The device output voltage and current waveforms are essential for optimizing the device-level PA performance, e.g., output power, efficiency, or linearity. A harmonically-tuned PA with finite harmonic terminations (e.g., only the fundamental, 2nd-, and 3rd-order harmonics) is designed to achieve high efficiency by loading the proper terminations at its fundamental and harmonic frequencies. In the time domain, proper harmonic terminations shape the voltage and current waveforms on the power transistor drain/collector terminal to minimize the overlap between the current and voltage, boosting the efficiency. In practice, higher order harmonics (>3rd-order) provide limited contributions and are difficult to generate and terminate [40]-[46].

#### 2.1.1. Conventional Class-F PA operation

The conventional Class-F PA output network provides high impedance terminations at odd harmonics and low impedance terminations at even harmonics [40]. Thus, the voltage waveform  $v_F$  on drain/collector terminal behaves as a square waveform and can be expressed (including DC-/1<sup>st</sup>-/3<sup>rd</sup>-term) as

$$v_F(\theta) = V_{DC} - v_{F1}\cos\theta + v_{F3}\cos3\theta, \tag{2-1}$$

$$v_{F1}/V_{DC} = 2/\sqrt{3}, v_{F3}/V_{DC} = 1/3\sqrt{3},$$

where  $V_{DC}$  represents the dc supply voltage, and  $V_{F,1}$  and  $V_{F,3}$  are the voltage swing at fundamental and 3<sup>rd</sup>-harmonic frequencies respectively. Furthermore, the device current behaves as a half-sine waveform (including DC/1<sup>st</sup>/2<sup>nd</sup> term) as

$$i_F(\theta) = I_{DC} + i_{F,1} \cos \theta + i_{F,2} \cos 2\theta,$$
 (2-2)  
 $i_{F,1}/I_{DC} = \pi/2, i_{F,2}/I_{DC} = 2/3,$ 

where  $I_{DC}$  is the DC current, and  $I_{F,1}$  and  $I_{F,2}$  are the current swing at fundamental and  $2^{\text{nd}}$ -harmonic frequencies, respectively. The normalized time-domain current and voltage waveforms are shown in Figure 2.1 (i.e., the blue solid line represents the voltage and the red solid line represents current).



Figure 2.1 – Theoretical current and voltage waveforms composed of the fundamental, 2nd- and 3rd-harmonics for (a) conventional/continuous-mode Class-F and (b) conventional/continuous-mode Class-F<sup>-1</sup> operations.

## 2.1.2. Conventional Class-F<sup>-1</sup> PA operation

The conventional Class-F<sup>-1</sup> PA is a dual of the Class-F PA by exchanging the current and voltage waveforms. Namely, the conventional Class-F<sup>-1</sup> PA generates a half-sinusoidal voltage waveform and a square current waveform with high impedance for even order harmonic output impedance and low impedance for odd order harmonic output impedance. The voltage waveform  $v_{IF}$  can be written as

$$v_{IF}(\theta) = V_{DC} + v_{IF,1} \cos \theta + v_{IF,2} \cos 2\theta,$$
 (2-3)

$$v_{IF,1}/V_{DC} = \sqrt{2}, v_{IF,2}/V_{DC} = 1/2,$$

where  $V_{IF,1}$  and  $V_{IF,2}$  are the voltage swing at fundamental and 3<sup>rd</sup>-order harmonic frequencies, respectively. Thus, the current on a transistor behaves as a square waveform and it can be expressed as

$$i_{IF}(\theta) = I_{DC} - i_{IF,1}\cos\theta + i_{IF,3}\cos3\theta \tag{2-4}$$

$$i_{IF,1}/I_{DC} = 1.162, i_{IF,3}/I_{DC} = 0.162,$$

where  $I_{IF,1}$  and  $I_{IF,3}$  are the current swing at fundamental and 3<sup>rd</sup>-order harmonic frequencies, respectively [44]. The normalized time-domain current and voltage waveforms are shown in Figure 2.1(b) (i.e., the blue solid line represents voltage and the red solid line represents current respectively).

Moreover, the PA efficiency  $\eta$ , (i.e., drain efficiency for MOSFET and collector for Bipolar) can be expressed as,

$$\eta = \frac{P_{delivery}}{P_{DC}} = \frac{1}{2} \left( \frac{v_{F,1} \text{ or } v_{IF,1}}{V_{DD}} \right) \left( \frac{i_{F,1} \text{ or } i_{IF,1}}{I_{DC}} \right) = \eta_{\text{max}} \left( 1 - \frac{V_k}{V_{DC}} \right), \tag{2-5}$$

where  $\eta_{\text{max}}$  represents the maximum PA efficiency,  $V_k$  is the knee voltage which represents a minimum limit on the swing across the transistor. An ideal Class-F PA with fundamental,  $2^{\text{nd}}$ -and  $3^{\text{rd}}$ - harmonics can achieve an  $\eta_{\text{max}}$  of 90.6 % while a Class F<sup>-1</sup> PA can achieve an  $\eta_{\text{max}}$  of 81.6 %, respectively [27]. To maintain high PA efficiency, conventional Class-F and Class-F<sup>-1</sup> PA output networks need to provide multiple accurate harmonic impedance terminations, resulting in narrow carrier bandwidths [27][37][38].



Figure 2.2 – Fundamental, 2nd- and 3rd-harmonic load impedances for (a) continuous-mode Class-F and (b) continuous-mode Class-F-1 PA operations.

## 2.1.3. Continuous-mode Class-F and Class-F-1 PA

Continuous-mode harmonically tuned PAs generalize the optimum harmonic termination conditions and thus substantially expand the carrier frequency range. In reference [41], the author introduces a continuous-mode PA, alleviating the precise harmonic requirements of the conventional ones by offering multiple impedance terminations that can be dynamically distributed over the desired operation bandwidth, while preserving the desired output power and efficiency. Given by [41], for the continuous-mode Class-F PA operation, the voltage waveform in (2-1) is extended by multiplying an additional defining term, shown as

$$v_{CF}(\theta) = (V_{DC} - v_{F1}\cos\theta + v_{F3}\cos3\theta) \times (1 - \gamma\sin\theta). \tag{2-6}$$

The first bracket of (2-6) is the voltage waveform formulation for the conventional Class-F as expressed in (2-1) with  $\gamma$ =0. The last bracket in (6) is a defining term  $(1-\gamma\sin\theta)$  that performs a new design space. Thus, the parameter  $\gamma$  varies between -1 and 1 (i.e., -1 $\leq \gamma \leq 1$ ), forming a family of voltage waveforms that provide multiple solutions Figure 2.2(a), gray dot lines for  $\gamma$ <0 and purple dash lines for  $\gamma$ >0) to maintain constant power and efficiency. As result, each value of corresponding to the particular PA output fundamental,  $2^{\text{nd}}$ - and  $3^{\text{rd}}$ -order harmonic impedances can be expressed as [44],

$$Z_{CF,1} = R_{opt} \frac{2}{\sqrt{3}} + j\gamma R_{opt}, Z_{CF,2} = jR_{opt} \frac{7\sqrt{3}\pi}{24} \gamma, Z_{CF,3} = \infty,$$
 (2-7)

where  $R_{opt}$  is the optimum impedance of the standard Class-B operation with all harmonics short-circuited. The impedance trajectories can be presented on the Smith chart (Figure 2.2a).

On other hand, for the continuous-mode Class-F<sup>-1</sup> PA operation, the current waveform in (2-2) can be also extended by multiplying an additional defining term, expressed as

$$i_{CIF}(\theta) = (I_{DC} - i_{IF,1}\cos\theta + i_{IF,3}\cos3\theta) \times (1 - \xi\sin\theta). \tag{8}$$

The first bracket of (2-8) is the conventional voltage waveform formulation for the conventional Class-F<sup>-1</sup> operation as expressed in (2-2) with  $\xi$ =0. The last bracket of (2-8) is also a defining term  $(1-\xi\sin\theta)$ , offering a new design space. The parameter  $\xi$  varies between -1 and 1 (i.e., -1 $\leq$  $\xi$  $\leq$ 1), forming a family of current waveforms that provide multiple solutions Figure 2.2(b), magenta dot lines for  $\xi$ <0 and dark-yellow dash lines for  $\xi$ >0) to maintain constant delivery power and efficiency. Each value of corresponding to the continuous-mode PA output fundamental, 2<sup>nd</sup>-order and 3<sup>rd</sup>-order harmonic impedances can be expressed as the following [44],

$$Y_{CIF,1} = G_{opt} \sqrt{2} i_{IF,1} + j G_{opt} \sqrt{2} i_{DC} \xi, \tag{9}$$

$$Y_{CIF,2} = jG_{opt} 2(i_{IF,1} + i_{IF,3})\xi, Y_{CIF,3} = \infty,$$

where  $G_{opt}$  (=1/ $R_{opt}$ ) is as the optimum admittance. The impedance trajectories can be shown on the Smith chart (Figure 2.2b). These continuous-mode PA operations can be realized over the desired operation bandwidth by applying the required harmonic

impedances for the different  $\gamma$  or  $\xi$  values. Additionally, these continuous-mode PAs can deliver output power and efficiency almost equivalent to that of the conventional PAs [41]- [44].

The PA load impeance (e.g.,  $Z_L$ ) behaviours for different PA operation are summarized in Table II. It is obvious that the coninuoious-mode PA operation (i.e.,  $\gamma \neq 0$  and  $\xi \neq 0$ .) causes fundamental impedance (e.g., Capacitive/Inductive or Inductive/Capacitive) and  $2^{nd}$ -harmonic impedance (e.g., Inductive/Capacitive or Capacitive/Inductive) out of phase. The  $3^{rd}$ -harmonic impedances stay high or low.

**Table 2.1 – PA Load Impedance Behaviors for Different Operations** 

| Operation            |             | Fundamental<br>Impedance              |                         | 2 <sup>nd</sup> harmonic<br>Impedance |                         | 3 <sup>rd</sup> harmonic<br>Impedance |         |
|----------------------|-------------|---------------------------------------|-------------------------|---------------------------------------|-------------------------|---------------------------------------|---------|
|                      |             | $ Z_L $                               | $Z_{L}$                 | $ Z_L $                               | $Z_{L}$                 | $ Z_L $                               | $Z_{L}$ |
| Class-F              | <b>γ</b> =0 | $R_{opt,F}$                           | 0                       | Low                                   | 0                       | High                                  | 0       |
| Class-F-1            | ξ=0         | $R_{opt,IF}$                          | 0                       | High                                  | 0                       | Low                                   | 0       |
| Continuous-Class-F   | γ>0<br>γ<0  | $R_{\text{opt,F}}$ $R_{\text{opt,F}}$ | Capacitive<br>Inductive | Low<br>Low                            | Inductive<br>Capacitive | High                                  | either  |
| Continuous-Class-F-1 | ξ>0<br>ξ<0  | $R_{opt,IF}$ $R_{opt,IF}$             | Inductive<br>Capacitive | High<br>High                          | Capacitive Inductive    | Low                                   | either  |

#### 2.1.4. Proposed Continuous-mode PA Output Network

Most existing continuous-mode and harmonically-tuned PA output networks require multiple passive components and transmission lines for multi-resonance tuning, inevitably increasing the network complexity, losses, and size. Our PA output network exploits and uses parasitic elements in one on-chip transformer to achieve continuous-mode harmonic tuning at both differentials- and common-mode with substantial network simplification and area-saving (Figure 2.3).

#### 2.2.1. Continuous-Mode Class-F<sup>-1</sup> PA Output Network

The proposed differential continuous-mode Class-F<sup>-1</sup> PA output network is as shown in Figure 2.3(a). It consists of one 1:1 transformer and three harmonic tuning capacitors ( $2\times C_{d1}$  and  $C_{c1}$ ). This structure utilizes two symmetrically embedded branches  $L_{d1}$  inside the transformer for the  $3^{rd}$ -order harmonic impedance tuning in differential-mode, and two extended branches  $L_{c1}$  and  $L_{c11}$  for the  $2^{nd}$ -order harmonic impedance tuning in common-mode respectively. Figure 2.4 shows the simplified differential and common-mode half-circuits at the fundamental,  $2^{nd}$ - and  $3^{rd}$ -order harmonic frequencies respectively.  $L_{dm1}/L_{cm1}$  and  $L_{dm2}/L_{cm2}$  are the differential/common-mode half-circuit inductances of the transformer, and the output leads are absorbed into the transformer secondary coil (Figure 2.4a-b). Moreover,  $L_{m1}$  and  $L_{k1}$  are the magnetizing and leakage inductances of the transformer in the differential-mode half-circuit (Figure 2.4d).

In the differential mode, the center-tap of the transformer is virtual ground, so  $L_{c1}/L_{c11}$  and  $C_{c1}$  do not affect the fundamental and  $3^{rd}$ -order harmonic impedances.  $C_{d1}$ - $L_{d1}$ - $L_{dm2}$ 

form a multi-resonance tank  $Z_1$  with high-frequency resonance. At the fundamental frequency, the series network  $C_{d1}$ - $L_{d1}$  behaves like a small capacitor, resulting in a high impedance to the transformer. Thus, the transformer performs matching with the PA output capacitance  $C_{out}$  and provides the desired fundamental load impedance to the PA (Figure 2.4c). At the  $3^{rd}$ -order harmonic frequency,  $C_{d1}$ - $L_{d1}$  is slightly below its series resonance, which shorts out  $L_{dm2}$  and forms a series resonance of  $C_{d1}$ - $L_{d1}$ - $L_{m1}$ - $L_{k1}$  to provide a desired low impedance. In the common-mode half-circuit, the network of  $C_{c1}/2$ ,  $2 \times L_{c1}$ , and  $2 \times L_{c11}$  forms a multi-resonance tank  $Z_2$  (Figure 2.4b). At the  $2^{nd}$ -order harmonic frequency,  $Z_2$  provides a high impedance, resulting in the remaining series tank of  $C_{d1}$ - $L_{d1}$  as a capacitor. Thus, the  $2^{nd}$ -order harmonic impedance is dominated by  $C_{out}$ ,  $L_{cm1}$  and the effective capacitance due to series  $C_{d1}$ - $L_{d1}$ , which achieve desired  $2^{nd}$ -order harmonic impedance (Figure 2.4e).

The trajectories of half-circuit load impedance at fundamental,  $2^{nd}$ - and  $3^{rd}$ -order harmonics with the PA output capacitance  $C_{out}$  are shown on the Smith Chart in Figure 2.5. The fundamental load impedance is mostly inductive for lower frequency  $(0 \le \xi \le 1)$  and capacitive for higher frequency  $(-1 \le \xi \le 0)$ , and vice versa for the  $2^{nd}$ -order harmonic impedance. The fundamental and the  $2^{nd}$ -order harmonic impedances of the upper operation bandwidth follow the constant conductance circles, while the  $3^{rd}$ -order-harmonic impedance is kept low. These aspects verify that the PA achieves continuous-mode Class- $F^{-1}$  harmonic terminations for its fundamental,  $2^{nd}$ -, and  $3^{rd}$ -order impedances [41]-[44].



Figure 2.3 - EM and schematic of (a) continuous-mode Class-F<sup>-1</sup> PA and (b) continuous-mode hybrid Class-F/-F<sup>-1</sup> PA.



Figure 2.4 – Simplified of (a) differential-mode and (b) common-mode half circuits of the continuous-mode Class- $F^{-1}$ PA output network at (c) the fundamental frequency (d) the 3rd-harmonic of lower band, and (e) the 2nd-harmonic frequencies, respectively.



Figure 2.5 – Trajectories of the half-circuit load impedance at fundamental,  $2^{nd}$ - and  $3^{rd}$ -order harmonic frequencies (characteristic impedance  $Z_0$  =50 $\Omega$ ).

## 2.2.2. Continuous-mode Hybrid Class-F/F-1 PA Output Network

This continuous-mode hybrid Class-F/F<sup>-1</sup> PA output network combines continuous-mode hybrid Class-F and F<sup>-1</sup> PA operations together to further extend the bandwidth. Compared to the continuous-mode Class-F<sup>-1</sup> PA output network, the two matching capacitors (i.e.,  $2\times C_L$ ) can facilitate the fundamental operation bandwidth extension, and the longer branches  $L_{c2}$  can provide a larger inductance for  $2^{nd}$ -order harmonic impedance, as shown in Figure 2.3(b). It also consists of one 1:1 transformer, three harmonic tuning capacitors (i.e.,  $2\times C_{d2}$  and  $C_{c2}$ ), and two matching capacitors (i.e.,  $2\times C_L$ ) to realize hybrid Class-F and Class-F<sup>-1</sup> operations at the lower ( $\omega_L$ ) and higher frequency ( $\omega_H$ ) bands, respectively, as shown in Figure 2.3(b). The PA output harmonic termination network is explained in Figure 2.6(a)-(h). Here,  $L_{dm3}/L_{cm3}$  and  $L_{dm4}/L_{cm4}$  represent the differential-/common-mode half-circuit inductances of the transformer, and the output leads are absorbed into the secondary coil.  $C_{d2}$ - $L_{d2}$ - $L_{dm4}$  forms a multi-resonance tank  $Z_3$  (Figure 2.6a). In the common-mode half-circuit, the network of  $C_{c2}$ /2,  $2\times L_{c2}$ , and  $2\times L_{c22}$  forms a multi-resonance tank  $Z_4$  (Figure 2.6b).

At fundamental operation frequencies ( $\omega_L \le \omega \le \omega_H$ ), the series network  $C_{d2}$ - $L_{d2}$  behaves as a small capacitor (Figure 2.6c) to provide a high impedance. So, this high impedance branch can be ignored. Thus,  $Z_{L,diff}$  can be converted to a simplified model as shown in Figure 2.6(d).  $k^2 \times L_p$  and  $(1-k^2) \times L_p$  are the magnetization and leakage inductances respectively of the transformer in the half-circuit differential-mode (Figure 2.6d). The equivalent inductance  $L_p$  is roughly equal to  $L_{dm3}$  and  $L_{dm4}$ . Thus, Figure 2.6(d) forms the

matching network with the PA output capacitance C<sub>out</sub> and provides PA the desired fundamental load impedance.

At the  $3^{rd}$ -order harmonic of the higher band ( $\omega$ =3 $\omega_H$ ), the series network  $C_{d2}$ -L $_{d2}$  impedance is slightly below its series resonance, which shorts out L $_{dm4}$  and forms a series resonance of  $C_{d2}$ -L $_{d2}$ -L $_{m3}$ -L $_{k2}$  to offer a low load impedance (Figure 2.6e). In this case, L $_{m2}$  and L $_{k2}$  represent the magnetization and leakage inductances of coil L $_{dm3}$  of the transformer in the half-circuit differential-mode. Also, at the  $3^{rd}$ -order harmonic of the lower band ( $\omega$ =3 $\omega_L$ ), Z $_{L,diff}$  sees a high impedance by L $_{dm3}$  and Z $_3$  in parallel with  $C_{out}$ . At the 2 $^{nd}$ -order harmonic of the higher band ( $\omega$ =2 $\omega_H$ ), Z $_4$  provides a high impedance, while the remaining  $C_{d2}$ -L $_{d2}$  series tank behaves as a capacitor (Figure 2.6g). Therefore, the 2 $^{nd}$ -order harmonic impedance Z $_{L,com}$  is dominated by  $C_{out}$ , L $_{cm3}$  and the effective capacitance due to series  $C_{d2}$ -L $_{d2}$  branch, achieving the desired continuous-mode 2 $^{nd}$ -order harmonic impedance.

Additionally, at the  $2^{nd}$ -order harmonic of the lower band ( $\omega$ = $2\omega_L$ ),  $Z_2$  becomes inductive. Moreover, the series network  $C_{d2}$ - $L_{d2}$  remains capacitive. Therefore,  $Z_{L,com}$  can present a low overall impedance. The trajectories of the half-circuit load impedance at fundamental,  $2^{nd}$ - and  $3^{rd}$ -order harmonics with the absorbed PA output capacitance  $C_{out}$  are shown on the Smith Chart in Figure 2.7(a).

The fundamental load impedance  $(\omega_L \le \omega \le \omega_H)$  is inductive, while the  $2^{nd}$ -order harmonic load impedances  $(\omega = 2\omega_L)$  or  $\omega = 2\omega_H$  are capacitive and provide  $-1 \le \gamma < 0$  for continuous-mode class-F PA operation and  $-1 \le \xi < 0$  for continuous-mode Class-F<sup>-1</sup> PA operation, respectively. Compared with the fundamental, the  $3^{rd}$ -oder harmonic load impedance is low for lower band  $(\omega = 3\omega_L)$  while it is high for higher band  $(\omega = 3\omega_H)$ . The

load impedance trajectories demonstrate the continuous-mode hybrid Class-F/F<sup>-1</sup> PA. The impedance responses of each harmonic over frequency are shown in Figure 2.7(b).

The design procedure of the PA output network is starting from building a  $4^{th}$ -order matching network using passive lumped elements (e.g., inductors and capacitors) at fundamental frequency, like Figure 2.6(d). The designed values of the passive lumped elements are depended on the load-pull simulation. For the harmonic-thing, the importance is that the required lumped inductors and capacitors are necessary arranged properly in the differential and common modes to realize the  $2^{nd}$ - and  $3^{rd}$ -order harmonic impedance terminations for the continuous-mode operation, respectively. Here, the values of the inductors and capacitors are determined by the continuous-mode design equations, as discussed in Section II. Then, a transformer with the routing traces and pads (i.e., GSGSG pads) is to replace the lumped elements. As a result, the transformer needs to be well optimized and simulated to determine the inductances and capacitances (e.g.,  $C_{c1}$ ,  $C_{c2}$ ,  $C_{d1}$ ,  $C_{d2}$ ,  $C_{d1}$ ,  $C_{d2}$ ,  $C_{d1}$ ,  $C_{d2}$ ,  $C_{d1}$ ,  $C_{d2}$ , and so on).



Figure 2.6 – Simplified half circuits of the (a) differential-mode, (b) common-mode of the continuous-mode hybrid Class-F/F<sup>-1</sup> PA output network at (c) the fundamental frequency ( $\omega_L \le \omega \le \omega_H$ ) and (d) the fundamental equivalent circuit, (e) the 3<sup>rd</sup>-order harmonic of lower band ( $\omega = 3\omega_L$ ), (f) the 3<sup>rd</sup>-order harmonic of higher band ( $\omega = 3\omega_H$ ), (g) the 2<sup>nd</sup>-order harmonic of lower band ( $\omega = 2\omega_L$ ), and (h) the 2<sup>nd</sup>-order harmonic of higher band ( $\omega = 2\omega_H$ ).



Figure 2.7 – (a) Trajectories of half-circuit load impedance at fundamental,  $2^{nd}$ -and  $3^{rd}$ -order harmonic frequencies ( $Z_0 = 50\Omega$ ), and (b) impedance response.

### 2.1.5. Continuous-Mode Harmonically-Tuned PA Implementation

## 2.3.1. Design 1: A Two-Stage Continuous-mode Class-F<sup>-1</sup> PA

Figure 2.8(a) shows the schematic of the two-stage continuous-mode Class-F<sup>-1</sup>PA design. It is composed of a driver (DR) stage and a PA stage followed by the proposed differential transformer-based harmonically-tuned PA output network, implemented in GlobalFoundries 0.13 µm SiGe BiCMOS process. The transformer-based harmonicallytuned PA output network has been fully analyzed in the previous section. The transistor size of DR (i.e.,  $Q_1$  and  $Q_2$ ) is 21  $\mu$ m/120 nm and the transistor size of PA (i.e.,  $Q_3$  and  $Q_4$ ) is 32  $\mu$ m/120 nm, respectively. Two series input resistors R<sub>b</sub> are 10  $\Omega$  for both DR and PA stages to improve stability. Both PA and DR stages utilize neutralization capacitors  $(C_{N DR}=30 \text{ fF} \text{ and } C_{N PA}=40 \text{ fF})$  to improve power gain, reverse isolation and stability. The input and inter-stage matching networks are realized by two 1-to-1 transformers respectively, and two input capacitors (C<sub>in1</sub>=128 fF) and two inter-stage capacitors (C<sub>int</sub>=55 fF). The DR stage is biased (V<sub>B</sub> DR) at 0.84 V and DC supply voltage (V<sub>CC</sub> DR) is 0.9 V, and the PA stage is biased (V<sub>B PA</sub>) at 0.83 V and DC supply voltage (V<sub>CC PA</sub>) is 1.9 V, respectively. To enhance PA linearity, a harmonic trap network (i.e.,  $R_{s1}=R_{s11}=25~\Omega$  and  $C_{s1}=C_{s11}=200$  fF) is added at the PA input to provide a low 2<sup>nd</sup>-order harmonic source impedance. The DC and peak current of the PA stage are 16 mA and 48 mA, respectively.

## 2.3.2. Design 2: A One-Stage Continuous-mode Class-F/F<sup>-1</sup> PA

Figure 2.8(b) shows the schematic of the one-stage differential hybrid continuous-mode Class-F/F<sup>-1</sup> PA design. It is composed of only the PA stage followed by the proposed differential transformer-based PA output network, implemented in a Globalfoundries 45nm

CMOS SOI process. The PA is realized using a cascode topology with identical sizes  $(W/L=6\times30~\mu\text{m}/40~\text{nm})$  for  $M_1$ ,  $M_2$ ,  $M_3$ , and  $M_4$  that are biased at  $V_G=0.3~\text{V}$ ,  $V_{cas}=1.3~\text{V}$ , and  $V_{DD}=2~\text{V}$ . The capacitive neutralization scheme  $(C_n=55~\text{fF})$  is used for the bottom transistor pair  $(M_1/M_2)$ . The input matching network is composed of a 1:1 transformer with parallel capacitors  $C_{in}=160~\text{fF}$  and a parallel resistor  $R_g=170~\Omega$ . The DC and peak current of the PA stage is 24 mA and 65/70/68~mA at 28/37/39~GHz, respectively.

### 2.3.3. Design 3: A Two-Stage Continuous-mode Class-F/F<sup>-1</sup> PA

Figure 2.8(c) shows the schematic of the two-stage differential hybrid continuous-mode Class-F/F<sup>-1</sup> PA design, consisting of a DR stage and a PA stage. This design is an extended version of the one-stage differential hybrid continuous-mode Class-F/F<sup>-1</sup> PA. Thus, both designs cooperate the identical continuous-mode hybrid Class-F/F<sup>-1</sup> PA output networks. The DR stage is realized using a CS topology with identical sizes (W/L=6×30  $\mu$ m/40 nm) for M<sub>5</sub> and M<sub>6</sub>. It is biased at V<sub>G\_DR</sub>=0.32 V, and V<sub>DD\_DR</sub>=0.8 V. In addition, the PA stage is also realized using a cascode topology with identical sizes (W/L=6×30  $\mu$ m/40 nm) for M<sub>7</sub>, M<sub>8</sub>, M<sub>9</sub>, and M<sub>10</sub> that are biased at V<sub>G\_PA</sub>=0.32 V, V<sub>cas\_PA</sub>=1.4 V, and V<sub>DD\_PA</sub>=2 V. The neutralization capacitors (C<sub>n\_DR</sub>) for DR stage are 55 fF and the neutralization capacitors (C<sub>n\_PA</sub>) for PA stage are 55 fF, respectively. The input and interstage matching networks of this two-stage PA design are also realized by two 1:1 transformer, and two input capacitors (C<sub>in2</sub>=140 fF). The DC and peak current of the PA stage is 25 mA and 70/80/87 mA at 28/37/39 GHz, respectively.



Figure 2.8 – Schematic of (a) two-stage continuous-mode Class-F<sup>-1</sup> PA, (b) one- and (c) two-stage continuous-mode hybrid Class-F/F<sup>-1</sup> PAs.

#### 2.1.6. Measurement Results

## 2.4.1. Design 1: A Two-Stage Continuous-Mode Class-F-1 PA

The first PA design occupies a 0.91×0.32 mm<sup>2</sup> core area excluding pads (Figure 2.9a). All designs are measured by direct probing. The CW large-signal and small-signal S-parameter measurements are shown in Figure 2.10. At 28.5 GHz, this PA achieves P<sub>sat</sub> of 17 dBm and P<sub>1dB</sub> of 15.2 dBm, the power gain (G<sub>p</sub>) of 20 dB and peak PAE (PAE<sub>max</sub>) of 43.5 % (Figure 2.10a). Figure 2.10(c) shows 20.3 dB of peak S<sub>21</sub> for the measured smallsignal S-parament. The measured P<sub>sat</sub> is from 16.4 to 17.4 dBm from 19 to 29.5 GHz, achieving a 43.3 % P<sub>sat</sub> 1-dB (BW<sub>1dB</sub>) bandwidth (Figure 2.10b). The measured PAE includes the loss of the DR stage, PA stage and the output network. This PA is first measured using 64-QAM signals at 1.5 GSym/s (9 Gb/s) and 3 GSym/s (18 Gb/s) at the carrier frequency (f<sub>carrier</sub>) of 28.5 GHz (Figure 2.11a). Without DPD, the measured EVM is below <-25 dB for all data rates. At 3 GSym/s, the EVM is -25 dB with average output power (P<sub>avg</sub>) of 9.8 dBm and average PAE (PAE<sub>avg</sub>) of 18.4 %. Next, this PA is measured using 256-QAM signals at 0.8 GSym/s (6.4 Gb/s) and 1GSym/s (8 Gb/s) at f<sub>carrier</sub> of 28.5 GHz (Figure 2.11b). The EVM is kept below -30 dB for all data rates. At 1 GSym/s, the EVM is -30 dB with P<sub>avg</sub> of 8.7 dBm and PAE<sub>avg</sub> of 16.3 %. Noted that the roll-off factor  $(\alpha)$  of the raised-cosine shaped filter is 0.35, the same setting as the following modulation measurements.



Figure 2.9 – Chip microphotograph of two-stage continuous-mode Class-F-1 PA



Figure 2.10 - (a) measured CW large-signal performance at 28.5 GHz, (b) measured CW large-signal performance vs. frequency and (c) measured small-signal S-parameter of two-stage continuous-mode Class-F-1 PA.



Figure 2.11 – (a) 64-QAM constellation, (b) 256-QAM constellation, (c) 64-QAM spectrum and (d) 256-QAM spectrum at f<sub>carrier</sub> of 28.5 GHz of Design 3.

## 2.4.2. Design 2: A One-Stage Continuous-mode Class-F/F<sup>-1</sup> PA

The second PA design occupies a  $0.55\times0.25~\text{mm}^2$  core area, as shown in Figure 2.12. Figure 2.13 shows the measured CW large-signal performance at 28, 37 and 39 GHz respectively. At 28 GHz, this proposed PA design achieves  $P_{\text{sat}}$  of 18.6 dBm,  $PAE_{\text{max}}$  of 45.7 % and  $G_p$  of 11.4 dB. At 37 GHz, the PA demonstrates  $P_{\text{sat}}$  of 18.6 dBm,  $PAE_{\text{max}}$  of 40.2 % and  $G_p$  of 10.7 dB. At 39 GHz, the PA achieves  $P_{\text{sat}}$  of 18.5 dBm,  $PAE_{\text{max}}$  of 41.2 % and  $G_p$  of 10.5 dB. This PA achieves high efficiency (i.e.,  $PAE_{\text{max}} \ge 40$  %) and delivers almost constant  $P_{\text{sat}}$  at all the potential 5G bands (28/37/39 GHz). The  $P_{\text{sat}}$  1-dB bandwidth is 54.3 % from 23.5 GHz to 41 GHz and the peak PAE is 46 % at 29 GHz, as shown in Figure 2.13(c). Also, it maintains over 30 % PAE from 25.5 GHz to 41 GHz (46.6 %). Figure 2.13(d) also demonstrates that the Class-F mode operates around 28 GHz, while Class-F-1 mode operates around 38 GHz. A mode transition is clearly shown around 35 GHz, which matches our analysis in Section III. In Figure 2.13(e), the measured small-signal S-parameter shows that 3-dB bandwidth is 51 % (25.9-43.7 GHz.)



Figure 2.12 – Chip microphotograph of one-stage continuous-mode hybrid Class-F/F<sup>-1</sup> PA.



Figure 2.13 – One-stage continuous-mode hybrid Class- $F/F^{-1}$  PA CW large-signal measurement at (a) 28 GHz, (b) 37 GHz and (c) 39 GHz, (d) Psat/PAE vs. frequency, and (e) small-signal S-parameter.

## 2.4.3. Design 3: A Two-Stage Continuous-mode Class-F/-F<sup>-1</sup> PA

The third PA design occupies a  $0.82 \times 0.25$  mm<sup>2</sup> core area, as shown in Figure 2.14. Figure 2.15 shows the measured CW large-signal performance at 28, 37 and 39 GHz. At 28 GHz, this PA achieves P<sub>sat</sub> of 18.9 dBm, PAE<sub>max</sub> of 43.2 % and G<sub>p</sub> of 18.7 dB. At 37 GHz, the PA demonstrates P<sub>sat</sub> of 18.9dBm, PAE<sub>max</sub> of 37 % and Gp of 18 dB. At 39 GHz, the PA achieves P<sub>sat</sub> of 18.9 dBm, PAE<sub>max</sub> of 36 % and G<sub>p</sub> of 15.6 dB. The measured CW large-signal performance vs. frequency is shown in Figure 2.15(d). The P<sub>sat</sub> 1-dB bandwidth is 55.1 % from 23 GHz to 40.5 GHz. This design maintains over 30 % PAE from 24 GHz to 40 GHz (50 %). A mode transition is clearly shown around 33 GHz. The measured small-signal S-parameter shows 3-dB bandwidth is 49.4 % from 23.8 GHz to 39.4 GHz (Figure 2.15d). This PA is measured using 64-QAM signals at 0.5 GSym/s (3 Gb/s) at f<sub>carrier</sub> of 24 GHz, 28 GHz, 37 GHz and 39 GHz respectively (Figure 2.16). Without DPD, the measured EVM is below <-25 dB. At  $f_{carrier}$  of 24 GHz,  $P_{avg}$  of 9.8 dBm and PAE of 9 %. At f<sub>carrier</sub> of 28 GHz, P<sub>avg</sub> is 10.3 dBm and PAE<sub>avg</sub> is 13.1 %. At f<sub>carrier</sub> of 37 GHz, Pavg is 11.7 dBm and PAEavg is 11.9 %. At f<sub>carrier</sub> of 39 GHz, Pavg is 11 dBm and PAEavg is 10.2 %. This PA design satisfies the stringent linearity requirement for future 5G bands (28/37/39 GHz). The large-signal and modulation comparisons with other PAs are listed in Table 2-1 and Table 2-2, respectively.



Figure 2.14 – Chip microphotograph of two-stage continuous-mode hybrid Class- $F/F^{-1}$  PA.



Figure 2.15 – Two-stage continuous-mode hybrid Class-F/F<sup>-1</sup> PA CW large-signal measurement at (a) 28 GHz, (b) 37 GHz and (c) 39 GHz, (d) Psat/PAE vs. frequency, and (e) small-signal S-parameter.



Figure 2.16 – Two-stage continuous-mode hybrid Class-F/F<sup>-1</sup> PA modulation measurement results (0.5 GSym/s) at (a) 24 GHz, (b) 28 GHz, (c) 37 GHz, (d) 39 GHz respectively.

Table 2.2 – CW Performance Comparison with State-of-the art Silicon-based Mm-Wave Power Amplifier at Related Frequency

|                   | P <sub>sat</sub> 1-dB<br>Freq.<br>(GHz) | P <sub>sat</sub> 1dB<br>BW (%) | Operation<br>Freq. (GHz) | P <sub>sat</sub> (dBm) | PAE <sub>max</sub> (%) | Gain<br>(dB)         | OP1dB<br>(dBm)       | VDD<br>(V) | Process             | Topology                                             | Size<br>(mm²) |
|-------------------|-----------------------------------------|--------------------------------|--------------------------|------------------------|------------------------|----------------------|----------------------|------------|---------------------|------------------------------------------------------|---------------|
| Design 1          | 19-29.5                                 | 43.3                           | 28.5                     | 17                     | 43.5                   | 20                   | 15.2                 | 1.9        | 130nm<br>SiGe       | 2-stg. Continuous Class-F-1                          | 0.29          |
| Design 2          | 23.5-41                                 | 54.3                           | 28<br>37<br>39           | 18.6<br>18.6<br>18.5   | 45.7<br>40.2<br>41.2   | 11.4<br>10.7<br>10.5 | 16.6<br>16.3<br>16.3 | 2          | 45nm<br>SOI<br>CMOS | 1-stg. Continuous Hybrid<br>Class-F/ F <sup>-1</sup> | 0.14          |
| Design 3          | 23-40.5                                 | 55.1                           | 28<br>37<br>39           | 18.9<br>18.9<br>18.9   | 43.2<br>37<br>36       | 18.7<br>18<br>15.6   | 16.9<br>17<br>17.4   | 2          | 45nm<br>SOI<br>CMOS | 2-stg. Continuous Hybrid<br>Class-F/ F <sup>-1</sup> | 0.21          |
| [7] Hu            | 28-42                                   | 40                             | 28<br>37<br>39           | 16.8<br>17.1<br>17     | 20.3<br>22.6<br>21.4   | 18.2<br>17.1<br>16.6 | 15.2<br>15.5<br>15.4 | 1.5        | 130nm<br>SiGe       | 2-stg. Doherty                                       | 1.76          |
| [35] Sarkar       | 27-39                                   | 7.1                            | 28                       | 18.6                   | 35.3                   | 15.3                 | 15.5                 | 3.6        | 130nm<br>SiGe       | 1-stg. Continuous Class-AB                           | 0.27          |
| [20] Ali          | 26-34                                   | 26.7                           | 29                       | 14.75                  | 46.4                   | 10                   | 13.2                 | 1.1        | 65nm<br>CMOS        | 1-stg. Continuous Class-F                            | 0.12          |
| [22] Ali          | 27-30*                                  | 10.5*                          | 28                       | 15.6                   | 41                     | 15.8                 | 14                   | 1.1        | 65nm<br>CMOS        | 2-stg Continuous Class-F<br>w/ Xfmr AM/PM correction | 0.24          |
| [15] Vigilante    | 25-48***                                | 63*                            | 43                       | 16.6                   | 24.2                   | 20.8                 | 13.4                 | 0.9        | 28nm<br>CMOS        | Class-AB w/ power combiner                           | 0.16          |
| [18] Zhang        | 26-29*                                  | 10.9                           | 27                       | 18.1                   | 41.5                   | 20.5                 | 16.8                 | 1          | 40nm<br>CMOS        | 2-Way 2-stg. CS<br>w/ Inductive degeneration         | 0.36          |
| [8] Indirayanti   | 28-33*                                  | 22.2*                          | 32                       | 19.8                   | 21                     | 22                   | 16                   | 1          | 28nm<br>CMOS        | 2-Way Xfmr-based Doherty                             | 0.59          |
| [37]<br>Mortazavi | 24-31                                   | 25.5                           | 27                       | 17.1                   | 40                     | 10.3                 | 15                   | 2.2        | 130nm<br>SiGe       | 1-stg. Hybrid Class- F-1/F                           | 0.27          |
| [38]<br>Mortazavi | 36-39.5*                                | 9.3*                           | 38                       | 16.5                   | 38.5                   | 16                   | 15                   | 2.4        | 130nm<br>SiGe       | 2-stg. Class- F-1                                    | 0.5           |
| [16] Shakib       | 27-31*                                  | 13.8                           | 30                       | 15.3                   | 36.6                   | 16.3                 | 14.3                 | 1.15       | 28nm<br>CMOS        | 2-stg. CS w/ Inductive degeneration                  | 0.16          |
| [17] Shakib       | 26-33*                                  | 23.3*                          | 27                       | 15.1                   | 33.7                   | 22.4                 | 13.7                 | 1.1        | 40nm<br>CMOS        | 3-stg. w/<br>dual-resonance Xfmr                     | 0.23          |
| [19] Park         | 26-28.5*                                | 9.2*                           | 28                       | 19.8                   | 28.5                   | 13.6                 | 18.6                 | 2.2        | 28nm<br>CMOS        | 1-stg. 2-stacked Class-<br>AB                        | 0.28          |
| [21] Huang        | 22-30*                                  | 30.7                           | 28                       | 26                     | 34.1                   | 16.3                 | 23.2                 | 2.4        | 90nm<br>CMOS        | 2-Way 1-stg. Cascode w/<br>Xfmr combiner             | 0.4**         |
| [1] Rabet         | -                                       | -                              | 28                       | 23                     | 41.4                   | -                    | -                    | 4          | 130nm<br>SiGe       | Outphasing<br>w/ Triaxial Balun                      | 0.56**        |
| [32] Datta        | 39-43*                                  | 4*                             | 41                       | 23.4                   | 34.9                   | 12.5                 | -                    | 4.5        | 130nm<br>SiGe       | 2-stacked Class-E                                    | 1             |
| [10] Chappidi     | 30-55                                   | 58.8                           | 40                       | 23.7                   | 28.5                   | 23.4                 | -                    | 4          | 130nm<br>SiGe       | Dual-Freq. PBO reconfigurable                        | 0.96          |

\*Graphically estimated from reported figures, \*\*Pads included, \*\*\*Small-signal -3dB BW.

Table 2.3 – Modulation Performance Comparison With State-of-the art Siliconbased Mm-Wave Power Amplifier at Related Frequency

|                                         | Carrier<br>Frequency (GHz) | Modulation<br>Scheme             | Data Rate (Gb/s | ) EVM (dB)                       | P <sub>out</sub> @EVM (dBm) | PAE@EVM (%)            |
|-----------------------------------------|----------------------------|----------------------------------|-----------------|----------------------------------|-----------------------------|------------------------|
| Design 1                                | 28.5                       | 64-QAM 1-CC<br>256-QAM 1-CC      | 9/18<br>6.4/8   | -26.8*/-25*<br>-30.5*/-30.5*     | 10.7/9.8<br>8.8/8.7         | 21.5/18.4<br>16.7/16.3 |
| Design 3                                | 24/28/37/39                | 64-QAM 1-CC                      | 3               | 25.2*/-28.1*/-29.2*<br>/-28.1*   | 9.8/10.3/11.7/11            | 9/13.1/11.9/10.2       |
| [7] Hu                                  | 28/3739                    | 64-QAM                           | 6/3/3           | -26.6*/-30.3*/-28.7*             | 7.2/9.5/9.3                 | 14.44/19.24/17.24      |
| [20] Ali                                | 28                         | 64-QAM<br>256-QAM                | 2<br>0.4        | -25.6 <b>*</b><br>-31.7 <b>*</b> | 10.4<br>9.4                 | 19.3<br>16.3           |
| [21] Huang<br>[18] Zhang<br>[35] Sarkar | 28<br>27<br>28             | 256-QAM<br>64-QAM<br>16-QAM OFDM | 0.8<br>6<br>3.2 | -32*<br>-25#<br>-22*             | 20<br>8.4<br>12.6*          | 8.8<br>11.5            |
| [38] Mortazavi                          | 38                         | 64-QAM/128-<br>QAM               | 0.049/0.049     | -26*                             | 13.5/13.5                   | 20*/20*                |
| [15] Vigilante<br>[8] Indirayanti       | 34<br>32                   | 64-QAM<br>64-QAM                 | 6<br>15         | -25#<br>-25 <b>*</b>             | 5.9<br>11.7                 | 2.3<br>5.75            |
| [16] Shakib                             | 30                         | 64-QAM OFDM<br>1-CC              | 1.5             | -25#                             | 5.3                         | 9.6                    |
| [17] Shakib                             | 27                         | 64-QAM OFDM<br>8-CC              | 4.8             | -25#                             | 6.7                         | 11                     |
| [1] Rabet<br>[10] Chappidi              | 28<br>30/50                | 64-QAM OFDM<br>16-QAM            | 0.48+<br>4      | -30.5#<br>-18.2*/19.2*           | 14.3<br>16.4/16.9           | 25.3<br>19.9&/24.6&    |

#Normalized w.r.t average power, \*Normalized w.r.t peak power, \*Memoryless DPD, &Collector efficiency, \*Graphical estimation.

### 2.1.7. Chapter Summary

In this chapter, we present three differential fully integrated continuous-mode PAs, implemented in 130 µm SiGe (design 1) and 45 nm CMOS SOI (design 2 and 3) processes. These PAs utilize our proposed transformer-based continuous-mode harmonically-tuned PA output networks to provide the required fundamental, 2<sup>nd</sup>- and 3<sup>rd</sup>-order harmonic load impedance terminations respectively. All designs achieve high PAE and ultra-wide P<sub>sat</sub> 1dB bandwidth. Importantly, our continuous-mode PA output network only occupies one single transformer footprint and does not require any additional tunable elements or switches, providing an ultra-compact design for massive MIMO applications. Additionally, the modulation measurements meet the stringent 5G linearity requirement.

# CHAPTER 3. A POWER AMPLIFIER WITH NMOS/PMOS NONLINEAITY CANCELLATION SCHEME

### 3.1. Introduction

The next-generation millimeter-wave (mm-Wave) wireless communication systems are required to support spectrum-efficient modulation schemes (e.g., 64-QAM, 256-QAM or 5G NR) with Gb/s link throughput. Also, these modulations accompany single and/or multiple component-carriers (CCs), i.e., orthogonal frequency-division multiplexing (OFDM) with multiple CCs [53]-[56]. These high-order modulation schemes entail complex constellations and pose a highly stringent demand on the linearity of the mm-wave front-end circuits, especially power amplifiers (PAs), i.e., large-signal amplitude-to-amplitude (AM-AM) and amplitude-to-phase (AM-PM) distortions, as well as the PA energy efficiency [53]-[82]. Therefore, the future mm-Wave PA solutions need to offer high linearity, high efficiency, required output power, simple design and compact size simultaneously for phased array and/or massive MIMO applications [83]-[86].

To boost the PA efficiency, the mm-Wave PAs usually are biased in deep Class-AB toward Class-B regions, instead of Class-A region. By this means, the PA efficiency is enhanced effectively but it sacrifices the PA linearity, resulting PAs with poor error-vector-magnitude (EVM) and adjacent channel power ratio (ACPR) performances. Those metrics are the major performance indexes of the next generation wireless communication systems. In other words, those high-efficiency-oriented mm-Wave PAs cannot accommodate the stringent linearity requirements for future spectrum-efficient modulation schemes.

Practically, the major PA AM-PM distortion comes from the nonlinear capacitors, such as drain-to-source  $C_{gs}$  and gate-to-drain  $C_{gd}$  capacitors in MOSFET devices. Mm-Wave PAs often utilize neutralization capacitors  $C_n$  to improve the power gain  $G_p$  and stability. However, fixed  $C_n$  capacitors cannot completely cancel the varying  $C_{gd}$  values as well as its varying Miller input capacitances due to device gain compression. Moreover, the  $C_{gs}$  also vary substantially versus input voltage amplitudes and lead to AM-PM distortions. Therefore, several RF/mm-Wave PA linearity improvement techniques without DPD are reported, such as input PMOS capacitance cancellation [87], multi-gated transistors (MGTRs) [88][89], adaptive biasing, harmonic trapping [90], PMOS neutralization capacitors [91] and push-pull NMOS/PMOS transistor compensation [92][93]. Those PA linearity improvement techniques can minimize the nonlinearity, but they usually require additional biasing circuitries/routings, complicate devices arrangements, sophisticated PA output matching network designs/layouts and extra resonators including multiple inductors and capacitors. Even though PA linearity can be improved by those techniques, they inevitably induce extra loss to further degrade the PA overall performance, especially at mm-Wave frequency ranges. Thus, this is a challenge to improve the mm-Wave PA linearity and maintain performance simultaneously.

To avoid the aforementioned drawbacks, the NMOS/PMOS transistor compensation technique is an excellent candidate [92]-[94]. Reference [92] combines NMOS and PMOS devices since they exhibit the opposite nonlinear behaviors for  $C_{gs}$ . However, this push-pull NMOS/PMOS compensation enables cancellation of multiple device-level nonlinearities (capacitors, transconductance, and output conductance), the existing designs arrange NMOS/PMOS in an inverter-like configuration with less-than the

supply voltage ( $V_{DD}$ ) output voltage swings and undesired bias interactions, limiting their use in scaled CMOS technologies for high efficiency mm-Wave PAs. The similar topology is reported in ref. [92]. Although, in the AC perspective, NMOS and PMOS transistors are not physically connected since the primary coil of the PA output transform is inserted between PMOS and NMOS, in the DC perspective, this topology is still an inverter-like one, still limiting the PA performance at mm-Wave frequencies.

To address this challenge, we present a doubly hybrid NMOS/PMOS V-band PA topology with a transformer-based 4-way series-parallel distributed-active-transformer (DAT) output matching network. This topology preserves the cancellation of multiple device nonlinearities but allows large output voltage swings close-to two times of supply voltage (i.e.,  $2 \times V_{DD}$ ) to avoid compromising output power or efficiency due to device knee voltages  $(V_{knee})$  and limited voltage headroom. Moreover, it decouples the DC gate and drain biasing of the NMOS/PMOS devices, and different biasing points and duty-cycles can be applied on the NMOS and PMOS PA devices to collectively optimize the AM-PM and AM-AM linearity, saturated power (P<sub>sat</sub>), output 1-dB compression point (OP<sub>1dB</sub>), and maximum/power-backoff (PBO) PAE, etc. Further, NMOS and PMOS power devices have been mixed at the PA, driver (DR), and pre-driver (PDR) stages to achieve doubly-hybrid NMOS/PMOS nonlinearity cancellation. Also, the 4-way series-parallel DAT output matching network provides low loss power combining and desired PA optimum loads. Our PA design covers 50.4 GHz-58.6 GHz with 35.4 % peak PAE, 16.3dBm Psat, and only 0.4° AM-PM distortion at 55 GHz. The AM-PM distortion is below 2° over 50-57GHz. It supports 64-QAM modulation with 15.6 % PAE<sub>avg</sub> and 9.5 dBm P<sub>avg</sub> at 0.5 GSym/s as well as 14.5 % PAE<sub>avg</sub> and 10.8 dBm of P<sub>avg</sub> for 5G NR CP-OFDM 64-QAM 1-CC 400 MHz modulation. This design not only supports single CC but also multiple CCs, showing the capacity of accommodate advanced complex modulations.

### 3.2. MOSFET AM-AM and AM-PM Distortion Analysis

Figure 3.1 shows a differential PA with the input and output matching networks (e.g.,  $L_{in}$  and  $L_{out}$ ), and its simplified small-signal single-ended half-circuit. Here,  $C_{gs}$ ,  $C_{gd}$ ,  $C_{ds}$ ,  $g_m$ ,  $g_{ds}$ ,  $R_s$  and  $R_L$  represent the small-signal parasitic drain-to-source, gate-to-drain, drain-to-source capacitors, transconductance, gate-to-source conductance, source and load resistors, respectively. The output gate-to-source resistor is reverse to  $g_{ds}$ , i.e.,  $R_{ds} = 1/g_{ds}$ . For delivering maximum output power, we assume  $R_L$  has been converted to the optimum load  $R_{opt}$  which is determined by the breakdown voltage,  $V_{knee}$ , and maximum output current by the output matching network. Also, most mm-Wave PAs utilize the neutralization capacitor  $C_n$  pairs to improve the power gain  $G_p$  and stability.

The input and output referred  $C_{gd}$  capacitors due to Miller's effect with  $C_n$  can be expressed respectively as

$$C_{od}' = (1 + g_m R_{out} \| R_{ds}) (C_{od} - C_n),$$
 (3-1)

and

$$C_{gd}$$
" =  $(1+1/g_m R_{opt} || R_{ds}) (C_{gd} - C_n)$ . (3-2)

Thus, the effective input capacitor  $C_{in}$  and output capacitor  $C_{out}$  can be represented as  $C_{in}=C_{gs}+C_{gd}$ ' and  $C_{out}=C_{ds}+C_{gd}$ " respectively. Figure 3.2(a) shows the various extracted small-signal parasitics parameters (i.e., 45nm SOI process) of a NMOS transistor

versus the gate-to-source voltage  $V_{gs}$ . Again, to boost PA efficiency, the PA is biased in deep Class-AB toward Class-B regions. So, when a PA is fed by an input signal with a small voltage amplitude, the input voltage swing crosses the saturation region and cutoff region, (e.g., the red waveform in Figure 3.2a). The conduction angle  $\alpha$  is defined as the proportion of the input voltage swing in the saturation region to the cutoff region per cycle. Likely, when a PA is fed by an input signal with a large voltage amplitude (e.g., the blue waveform in Figure 3.2b), the duration angle  $\beta$  is defined as the proportion of the input voltage swing in triode region to saturation region per cycle. For example,  $\beta = 0$  means the transistor works in the saturation ( $\alpha = 2\pi$ ) or saturation-and-cutoff ( $\alpha < 2\pi$ ) regions and  $\beta > 0$  means the transistor works in triode region partially per cycle and the PA starts to be compressed, depended on the input voltage swings. Thus, considering  $\alpha$  and  $\beta$ , the effective input capacitor can be re-written as

$$C_{in}(\alpha, \beta) = \left(1 - \frac{\beta}{2\pi}\right) \left(\frac{\alpha}{2\pi}\right) C_{gs}^{sat.} + \left(1 - \frac{\alpha}{2\pi}\right) C_{gs}^{cutoff} + \left(\frac{\beta}{2\pi}\right) C_{gs}^{triode} + \left[1 + g_m(\alpha, \beta)R_L \|R_{ds}(\alpha, \beta)\right] \times \left[\left(1 - \frac{\beta}{2\pi}\right) \left(\frac{\alpha}{2\pi}\right) C_{gd}^{sat.} + \left(\frac{\beta}{2\pi}\right) \left(1 - \frac{\alpha}{2\pi}\right) C_{gd}^{triode} - C_n\right],$$
(3-3)

where the  $C_{gs}^{sat.}$  and  $C_{gs}^{cutoff}$  are the values of  $C_{gs}$  when the PA works in the saturation and cutoff regions respectively. Similarly, the  $C_{gd}^{sat.}$  and  $C_{gd}^{triode.}$  are the values of  $C_{gd}$  when the PA works in the saturation and triode region respectively. Thus, the effective input capacitor can be re-expressed as

$$C_{out}(\alpha, \beta) = \left(1 - \frac{\beta}{2\pi}\right) \left(\frac{\alpha}{2\pi}\right) C_{ds}^{sat.} + \left(1 - \frac{\alpha}{2\pi}\right) C_{ds}^{cutoff}$$

$$+ \left(\frac{\beta}{2\pi}\right) C_{ds}^{triode} + \left[1 + \frac{1}{g_m(\alpha, \beta) R_L \|R_{ds}(\alpha, \beta)\|}\right]$$

$$\times \left[\left(1 - \frac{\beta}{2\pi}\right) \left(\frac{\alpha}{2\pi}\right) C_{gd}^{sat.} + \left(\frac{\beta}{2\pi}\right) \left(1 - \frac{\alpha}{2\pi}\right) C_{gd}^{triode} - C_n\right].$$
(3-4)

It can be observed that The effective input and output capacitors are time-average and considerably modulated by  $\alpha$  and  $\beta$  with respect to the input voltage amplitude, generating undesired AM-PM distortion. Additionally, the drain current and  $g_m$  of the device shows an exponential behavior for applying a low  $V_{gs}$ , leading the device to a quadratic region with a dramatic change. Since those nonlinear transconductance  $g_m(\alpha,\beta)$ and parasitic drain-to-source resistor  $R_{ds}(\alpha,\beta)$  generate higher-order components and also are also associated to the  $\alpha$  and  $\beta$ , they achieve gain peaking or/and compression on AM-AM distortion [94]-[97]. The capacitance value of the neutralization capacitor  $C_n$  is chosen to maximize the PA stability and keep the PA unconditionally stable, so it can be determined as  $C_n = C_{gs}^{sat}$ . Although  $C_n$  seems to delete  $C_{gs}$  and cancel out the drain-to-gate feedback, practically it contributes  $2 \times C_n$  from the drain terminal to the gate terminal at the second harmonic frequency, which remixes with the fundamental components to generate unwanted third-order nonlinearity. Figure 3.2(a) also shows that  $C_{gd}$  exhibits a constant value cross the saturation and triode regions, i.e.,  $C_{ds}^{sat.} \approx C_{ds}^{triode}$ , as well as  $C_{gd}$  also shows a flat variation over the all operating regions, i.e.,  $C_{dg}^{sat.} \approx C_{dg}^{triode} \approx C_{dg}^{cutoff}$ . Therefore, the effective input and output capacitors can be simplified respectively as

$$C_{in}(\alpha,\beta) = \left(1 - \frac{\beta}{2\pi}\right) \left(\frac{\alpha}{2\pi}\right) C_{gs}^{sat.} + \left(1 - \frac{\alpha}{2\pi}\right) C_{gs}^{cutoff} + \left(\frac{\beta}{2\pi}\right) C_{gs}^{triode}, \tag{3-5}$$

$$C_{out}\left(\alpha,\beta\right) = \left(1 - \frac{\beta}{2\pi}\right)\left(\frac{\alpha}{2\pi}\right)C_{ds}^{sat.} + \left(1 - \frac{\alpha}{2\pi}\right)C_{ds}^{cutoff} + \left(\frac{\beta}{2\pi}\right)C_{ds}^{triode}.$$
 (3-6)

Moreover, the output voltage  $V_L$  cross  $R_L$  can be written as

$$V_{L}(j\omega) = g_{m}I_{in}\frac{R_{ds}R_{opt}}{R_{S}}\frac{1 + j\omega C_{in}R_{S} - \omega^{2}L_{in}C_{in}}{1 + j\omega C_{out}(R_{ds} + R_{L}) - \omega^{2}L_{out}C_{out}}.$$
(3-7)

Then, the phase of a NMOS PA at certain operating frequency  $\omega_0$  can be presented as

$$\angle V_L(j\omega_0)|_{NMOS} = \tan^{-1}\left(\frac{\omega_0 C_{in} R_S}{1 - \omega_0^2 L_{in} C_{in}}\right) - \tan^{-1}\left[\frac{\omega_0 C_{out} (R_{ds} + R_L)}{1 - \omega_0^2 L_{out} C_{out}}\right],$$
 (3-8)

To achieve the desired PA performance, the inductors  $L_{in}$  and  $L_{out}$  of the input and output networks are designed to resonate out the  $C_{in}$  and  $C_{out}$  at  $P_{1dB}$  point separately. So, their values can be determined as

$$L_{in} = \frac{1}{\omega_0^2 C_{in} \left( \alpha = \alpha_{P1dB}, \beta = 0 \right)} = \frac{1}{\omega_0^2 \left[ \left( \frac{\alpha_{P1dB}}{2\pi} \right) C_{gs}^{sat.} + \left( 1 - \frac{\alpha_{P1dB}}{2\pi} \right) C_{gs}^{cutoff} \right]},$$
 (3-9)

as well as

$$L_{out} = \frac{1}{\omega_0^2 C_{out} \left( \alpha = \alpha_{P1dB}, \beta = 0 \right)} = \frac{1}{\omega_0^2 \left[ \left( \frac{\alpha_{P1dB}}{2\pi} \right) C_{ds}^{sat.} + \left( 1 - \frac{\alpha_{P1dB}}{2\pi} \right) C_{ds}^{cutoff} \right]}, \quad (3-10)$$

 $P_{1dB}$  point. However, due to the fixed inductors  $L_{in}$  and  $L_{out}$ , the effective input and output capacitors  $C_{in}$  and  $C_{out}$  cannot be perfectly tuned out in all operating regions. Thus, the AM-PM distortion of a NMOS PA can be expressed in (3-11), where  $\alpha_{ss}$  represent the conduction angle when an NMOS PA operates in the small-signal region. It is obvious that

the AM-PM distortion with a given transistor size is associate to  $\alpha_{ss}$ ,  $\alpha_{P1dB}$ ,  $R_L$ . In other words, we can set the PA with an appropriate bias voltage and the optimum load to reach very low AM-PM distortion but this PA cannot approach the desired power and efficiency, which is a well-known and troublesome trade-off in PA design.

$$\left| \frac{ZV_L'(j\omega_0)}{ZV_L'(j\omega_0)} \right|_{NMOS}$$

$$= tan^{-1} \left\{ \frac{\omega_0 \left[ \left( \frac{\alpha_{ss}}{2\pi} \right) C_{gs}^{sat.} \right|_{NMOS} + \left( 1 - \frac{\alpha_{ss}}{2\pi} \right) C_{gs}^{cutoff} \right|_{NMOS}}{1 - \frac{\left( \frac{\alpha_{ss}}{2\pi} \right) C_{gs}^{sat.} \right|_{NMOS} + \left( 1 - \frac{\alpha_{ss}}{2\pi} \right) C_{gs}^{cutoff} \right|_{NMOS}}{\left( \frac{\alpha_{PldB}}{2\pi} \right) C_{gs}^{sat.} \right|_{NMOS} + \left( 1 - \frac{\alpha_{PldB}}{2\pi} \right) C_{gs}^{cutoff} \right|_{NMOS}}$$

$$- tan^{-1} \left\{ \frac{\omega_0 \left[ \left( \frac{\alpha_{ss}}{2\pi} \right) C_{ds}^{sat.} \right|_{NMOS} + \left( 1 - \frac{\alpha_{ss}}{2\pi} \right) C_{ds}^{cutoff} \right|_{NMOS}}{1 - \frac{\left( \frac{\alpha_{ss}}{2\pi} \right) C_{ds}^{sat.} \right|_{NMOS} + \left( 1 - \frac{\alpha_{ss}}{2\pi} \right) C_{ds}^{cutoff} \right|_{NMOS}}{\left( \frac{\alpha_{PldB}}{2\pi} \right) C_{ds}^{sat.} \left|_{NMOS} + \left( 1 - \frac{\alpha_{PldB}}{2\pi} \right) C_{ds}^{cutoff} \right|_{NMOS}}$$

$$(3-11)$$

Figure 3.2(b) exhibits the various extracted small-signal parameters of a PMOS transistor versus the gate-to-source voltage  $V_{gs}$ . So, the AM-PM distortion of a PMOS PA can be expressed in (3-12). The corresponding AM-PM behaviors of NMOS and PMOS PAs are shown in Figure 3.3, caused by the time-average effective  $C_{in}$  and  $C_{out}$  [98]. The NMOS PA presents a negative AM-PM distortion at  $P_{1dB}$ , on the contrary, the PMOS PA presents a positive AM-PM distortion at  $P_{1dB}$ . Our idea is to utilize this opposite behavior to generate a flat AM-PM distortion at  $P_{1dB}$  or even beyond.



Figure 3.1 – A neutralized PA and its simplified single-ended half circuit.



Figure 3.2 – The extracted gate-to-source  $C_{gs}$ , gate-to-drain  $C_{gd}$  and drain-to-source  $C_{ds}$ , and transconductance  $g_{m}$  and drain-to-source conductance  $g_{ds}$  in (a) NMOS and (b) PMOS transistor.



Figure 3.3 – The AM-PM behaviors of NMOS and PMOS PAs.

### 3.3. Proposed NMOS/PMOS AM-PM Nonlinearity Cancellation Scheme

### 3.3.1. Concept of doubly NMOS/PMOS nonlinearity cancellation scheme

Figure 3.4 shows the design concept and diagram of our proposed doubly NMPS/PMOS nonlinearity cancellation scheme. According to Equation (11) and (12), the NMOS PA shows a static negative phase shift (i.e.,  $\angle \theta$ ) and contrarily the PMOS PA shows a static positive phase shift (i.e.,  $\angle \phi$ ) at P<sub>1dB</sub>, as also shown in Figure 3.3. Thus, the PDR stage and DR stage provide negative and positive AM-PM distortions  $\angle -\theta_{PDR}$  and  $\angle \phi_{DR}$ , respectively. The first hybrid nonlinearity cancellation scheme cascade PDR and DR stages to achieve the overall AM-PM distortion  $\angle (-\theta_{PDR} + \varphi_{DR})$  at the output of DR stage. So, we can minimize  $\angle (-\theta_{PDR} + \varphi_{DR})$  by optimizing proper bias voltage settings for PDR and DR stages, which provides a high-quality signal to feed the PA stage. Similarly, the NMOS PA and PMOS PA provide negative and positive AM-PM distortion  $\angle -\theta_{PA}$  and  $\angle -\theta_{PA}$ , respectively. So, the second hybrid nonlinearity cancellation scheme combines NMOS PA and PMOS PA at PA stage to achieve the total AM-PM distortion of  $\angle (-\theta_{PDR} + \varphi_{DR} + \varphi_{PA} - \theta_{PA})$  at the load  $R_L$ . Also, the total AM-PM distortion can be

optimized by adopting appropriate bias voltage settings for NMOS PA and PMOS PA at PA stage. Namely, we mix the PRD-DR and hybrid NMOS/PMOS PA nonlinearity cancellation to offer a doubly nonlinearity cancellation for the superior linearity performance. Besides, AM-AM peaking, caused by the nonlinear transconductance  $g_m$  ( $\alpha$ ,  $\beta$ ), can be suppressed by our doubly NMPS/PMOS nonlinearity cancellation technique with appropriate biasing settings, allowing an overall AM-AM cancellation.



Figure 3.4 – The concept of the proposed doubly nonlinearity cancellation scheme.

### 3.3.2. PA output network NMOS/PMOS Linearity Improvement Scheme

Our proposed PA leverages these device nonlinearity properties by combining the output drain currents of NMOS and PMOS PAs through a DAT power combiner for nonlinearity cancellation (Figure 3.5). The differential input signals  $V_{inn}^+/V_{inn}^-$  and  $V_{inp}^+/V_{inp}^-$  drive the NMOS and PMOS PAs, and the resulting NMOS/PMOS PA drain currents are  $I_n$  and  $I_p$ , respectively. The drain currents In and Ip further induces  $I_n$  and  $I_p$  and combines as  $I_{n/p}$  at the secondary coil of the combiner, where they cancel the opposite AM-AM/AM-PM behaviors caused by the nonlinear capacitors and transconductances  $g_m$ . The AM-AM and AM-PM distortion curves for NMOS/PMOS-only PAs and the proposed hybrid PA are shown in Figure 3.4. The combined current  $I_{n/p}$  presents complementary amplitude/phase behaviors and minimizes the AM-AM and AM-PM distortions at P1dB for superior linearity performance. Moreover, since NMOS and PMOS PAs are both loaded by transformers, both can achieve full voltage swings and avoid headroom issues due to  $V_{knee}$  and limited  $V_{DD}$  without losing power and efficiency.

### 3.3.3. Series-Parallel DAT Output Power Combiner

This series-parallel DAT-based power combiner consists of a differential transformer network with eight single-ended (i.e., four differential) primary inputs and a differential secondary output connected to the load ( $R_L$ ). The eight single-ended inputs driven by the differential outputs of total four NMOS/PMOS PAs. Additionally, two pairs of NMOS and PMOS PAs are placed oppositely for simple DAT routing and  $V_{DD}$ /GND arrangement (Figure 3.6).

A DAT-based power combiner is realized by a slab type transformer to achieve both higher quality factor and series power combining for the two NMOS PAs or two PMOS PAs, while the parallel combining is used to combine the NMOS/PMOS PA outputs for nonlinearity cancellation [99]. The outer diameter of the series-parallel DAT is 130  $\mu$ m. The coil width is 8 $\mu$ m, and coil-coil spacing is 3  $\mu$ m. The primary coils use one 3.9 $\mu$ m copper layer (OB layer) for the signal traces and the DC paths to NMOS PA  $V_{DD}$  with another 3.9  $\mu$ m copper layer (OA layer) as the DC underpasses to PMOS PA GND, while the secondary coil uses the top 4.1 $\mu$ m aluminum (LD layer). Importantly, the two primary coils are not connected while the correct DC voltages DAT operation are provided. The simulated passive efficiency of the proposed output network is over 80% from 40 to 70 GHz (Figure 3.7).



Figure 3.5- The schematic of the hybrid NMOS/PMOS nonlinearity cancellation scheme.



Figure 3.6 – The 3D EM model of the hybrid NMOS/PMOS nonlinearity cancellation PA with 4-way series-parallel DAT.



Figure 3.7 – The simulated passive efficiency of the proposed series-parallel DAT.

### 3.4. PA Implementation

Figure 3.8 shows the schematic of the proposed doubly hybrid NMOS/PMOS nonlinearity cancellation PA. This PA is implemented in GlobalFoundries 45nm CMOS SOI process. To provide desired gain and output power, this PA employs two PA branches, each branch includes three stages, such as PA, DR, and PDR. All three stages adopt common-source topology with neutralization capacitors to improve power gain, reverse isolation and stability. Moreover, each PA consists of both NMOS and PMOS devices, while the DRs and the PDRs adopt PMOS and NMOS devices respectively. Such a doublehybrid configuration achieves further PA nonlinearity cancellations as discussed in the previous section. The  $V_{DD}$  for the PA, DR and PDR stages are 1.1V, 0.9V and 0.8V, respectively. Moreover, the neutralization capacitors for each stage (e.g.,  $C_{n\_nmos}$ ,  $C_{n\_pmos}$ ,  $C_{n\_DR}$  and  $C_{n\_PDR}$ ) are 30 fF, 37 fF, 26 fF and 12 fF, respectively. The size ratio between PA, DR and PDR is  $3(\times 2)$ :3:1, and the practical PA size is marked in Fig. 3.9. The interstage transformer matching network between the PA and DR stages also serves as a 1:2 power splitter to feed both PMOS and NMOS PAs. Proceeding each PDR, a transformer balun transfers the single-ended input signal to the differential signal. For acquiring a better matching between each branch, a single-ended signal is directly split by two equal length transmission lines, instead of using a differential signal. Thus, the input matching network ties the balun of each path by the transmission lines (180  $\mu$ m, Z0=66  $\Omega$ ) and employs two parallel 30 fF capacitors and one series capacitor 35 fF at the pad for input matching. Figure 3.9 shows the chip microphotographic of our proposed PA, it occupies a 0.66×0.27 mm<sup>2</sup> core area excluding pads. This PA is measured by direct probing.



Figure 3.8 – The schematic of the proposed PA.



Figure 3.9 – The chip microphotographic.

### 3.5. Measurement Results

A giant benefit of our proposed PA is that the bias voltages (e.g.,  $V_{Gnmos}$  and  $V_{Gpmos}$ ) for the NMOS and PMOS PAs can be set independently. Unlike ref. [109], for this inverter-like NMOS/PMOS topology, the NMOS and PMOS transistors are physically connected, which cannot arbitrarily set gate voltage bias for NMOS and PMOS PA respectively, since their bias points are associated substantially, the same as the similar topology in ref. [109]. Thus, our hybrid NMOS/PMOS PA topology can provide contour plots (Figure 3.10) with different NMOS and PMOS gate voltage bias settings of the measured (a)AM-AM distortion, (b) AM-PM distortion, (c)  $P_{sat}$ , (d)  $OP_{1dB}$  (e)  $G_p$ , (f)  $PAE_{max}$ , (g)  $PAE_{P1dB}$ , and (h) PAE at 6dB PBO ( $PAE_{6dB\_PBO}$ ) at 55 GHz, respectively. In those contours, the  $V_{Gnmos}$  is varied from 0.2 V to 0.4 V, while  $V_{Gpmos}$  is varied from 0.7 V to 0.9 V, and the voltage step is 0.02V for each one.

Those contours show that the PA can be optimized for one or multiple performance specifications. A high-linearity mode (HLM) and a high-efficiency mode (HEM) are selected with different bias settings. HLM is set as ( $V_{Gnmos}$ ,  $V_{Gpmos}$ ) = (0.26 V, 0.82 V) and HLM is set as ( $V_{Gnmos}$ ,  $V_{Gpmos}$ ) = (0.22 V, 0.86 V), respectively. The detailed PA performance for each mode are shown in the following section. Figure 3.11 shows the CW large-signal measurements for HLM and HEM respectively at 55 GHz, highlighted as yellow and magenta stars. HLM achieves 16.3 dBm  $P_{sat}$ , 13.5 dBm  $OP_{1dB}$ , 18.8dB  $G_p$ , 35.4%  $PAE_{max}$ , 28.9% PAE at  $P_{1dB}$  ( $PAE_{P1dB}$ ) and 16.9 %  $PAE_{6dB_PBO}$ . On other hand, HEM achieves 16.1 dBm  $P_{sat}$ , 15 dBm  $OP_{1dB}$ , 16.4 dB  $G_p$ , 36%  $PAE_{max}$ , 35.2 %  $PAE_{P1dB}$  and 18.6 %  $PAE_{6dB_PBO}$  at 55 GHz. Additionally, at 55 GHz and  $P_{1dB}$ , HLM achieves almost no AM-AM peaking and 0.4 ° AM-PM distortion, and HEM also achieves almost no AM-AM

AM peaking and 3.5  $^{\circ}$  AM-PM distortion. The measured large-signal performance and small-signal S-parameters are summarized over frequency. The peak  $S_{21}$  is 18.8 dB with 3-dB bandwidth of 52.7-57.2 GHz. Table 3.1 summarizes a comparison table to show the CW performance with other state-of-the-art PA designs at the related frequencies.

Figure 3.12 shows the single-carrier 64-QAM modulation measurements without digital predistortion (DPD). At 55 GHz, our PA achieves 15.6/12.1 % PAEavg, 9.5/9.4 dBm Pavg, -25.2/-25.4 dB rms EVM (EVM<sub>rms</sub>), and 30.3/31 dBc Adjacent Channel Power Ratio (ACPR) for 0.5/1 GSym/s 64-QAM, respectively. In addition, at 2/3 GSym/s, it achieves 10.5/9.1 % PAEavg, 8.7/8.1 dBm Pavg and -25.2/-24.5 dB EVMrms. The PA achieves the single-carrier 64-QAM highest modulation rate of 3 GSym/s (18 Gb/s) among the reported V-band silicon PAs. Note that due to the limited equipment bandwidth, the ACPR values cannot be accurately measured at 1 GSym/s and 3 GSym/s.

From Figure 3.13 to Figure 3.16, it shows the 5G NR CP-OFDM FR2 64-QAM modulation measurements at fcarrier of 55 GHz without DPD for various CC and modulation bandwidth. For 1-CC/2-CC FR2 400 MHz (total 400/800 MHz), our PA achieves 14.5/9.5 % PAE<sub>avg</sub>, 10.8/8.67 dBm P<sub>avg</sub>, -23.5/-23.1 dB EVM<sub>rms</sub>, and -25.8/-26.4 dBc ACPR. Additionally, For 4-CC, our PA is measured with FR2 100 MHz and 200 MHz (total 400 MHz and 800 MHz) respectively, it achieves 12.3/8.1 % PAE<sub>avg</sub>, 10/8.9 dBm P<sub>avg</sub>, -23.1/-23.3 dB EVM<sub>rms</sub>, and -24.5/-24.6 dBc ACPR. The PA performance comparisons with other state-of-the-art mm-Wave PA designs at the related frequencies are exhibited in Table 1. The PA achieves the single-carrier 64-QAM highest modulation rate of 3 GSym/s (18 Gb/s) among the reported V-band silicon PAs.

Figure 3.17 shows the 5G NR CP-OFDM 64-QAM 1-CC FR2 400 MHz modulation measurements at  $f_{carrier}$  of 55 GHz with respect to the different supply voltages  $V_{DD}$ , e.g., (a)  $P_{avg}$ , (b)  $PAE_{avg}$  and (c) ACPR. When the  $V_{DD}$  increases, the  $P_{avg}$  increases accordingly at the  $EVM \le -24$  dB. On other hand, for  $PAE_{avg}$ , at lower EVM values, for example  $EVM \ge -23$  dB, the  $PAE_{avg}$  for each  $V_{DD}$  value is close. However, the PA supplied with lower  $V_{DD}$  cannot maintain enough linearity for the low EVM requirement (i.e., EVM  $\le -24$  dB). Figure 13(c) shows ACPR values are very similar when the proposed PA is supplied with the given  $V_{DD}$  values. Table 3.2 also summarizes a comparison table to show the modulation measurements with other state-of-the-art PA designs at the related frequencies.



Figure 3.10 – Measured CW performance vs. the gate voltages, e.g., (a)AM-AM distortion, (b) AM-PM distortion, (c) P<sub>sat</sub>, (d) OP<sub>1dB</sub> (e) G<sub>p</sub>, (f) PAE<sub>max</sub>, (g) PAE<sub>P1dB</sub>, and (h) PAE<sub>6dB\_PBO</sub>.



Figure 3.11 – (a) the measured CW large-signal performance at 55GHz for High-Linearity-Mode (HLM), (b) the measured CW large-signal performance at 55GHz for High-Efficiency-Mode (HEM), (c) the AM-AM distortion, (d) the AM-PM distortion, (f) the delivery power, (g) the efficiency and (h) The S-parameters.



Figure 3.12 – The single-carrier 64-QAM modulation results at (a) 0.5 GSym/s, (b) 1 GSym/s, (c) 2 GSym/s and (d) 3 GSym/s. For 1 GSym/s and 3 GSym/s, due to the limited equipment bandwidth, their ACPR values cannot be accurately measured.



Figure 3.13 – The 5G NR 64-QAM CP-OFDM modulation results with 1-CC FR2 400MHz.



Figure 3.14 – The 5G NR 64-QAM CP-OFDM modulation results with 2-CC FR2 400MHz.



Figure 3.15 – The 5G NR 64-QAM CP-OFDM modulation results with 4-CC FR2 100MHz.



Figure 3.16 – The 5G NR 64-QAM CP-OFDM modulation results with 4-CC FR2 200MHz.



Figure 3.17 – The measured 5G NR 64-QAM CP-OFDM 1-CC FR2 400MHz modulation results, e.g., (a)  $P_{avg}$ , (b)  $PAE_{avg}$  and (c) ACPR with various  $V_{DD}$  values ( $V_{DD}$  = 0.9, 1.1, 1.1 and 1.2 V).

 $\begin{tabular}{ll} Table 3.1-CW Performance Comparison with The State-Of-The-Art Silicon-Based Mm-Wav E PA at Related Frequency \\ \end{tabular}$ 

|                       | V <sub>DD</sub> (V) | Freq.<br>(GHz) | Gain<br>(dB) | P <sub>sat</sub> (dBm) | OP <sub>1dB</sub><br>(dBm) | PAE <sub>max</sub><br>(%) | PAE <sub>P1dB</sub> (%) | PAE <sub>6dB_PBO</sub> (%) | Tech.                  | Topology                                     | Core<br>area<br>(mm²) |
|-----------------------|---------------------|----------------|--------------|------------------------|----------------------------|---------------------------|-------------------------|----------------------------|------------------------|----------------------------------------------|-----------------------|
| HLM                   | - 1.1               | 55             | 18.8         | 16.3                   | 13.5                       | 35.4                      | 28.9                    | 16.9                       | 45nm<br>CMOS           | Doubly Hybrid<br>NMOS/PMOS                   | 0.18                  |
| HEM                   | 1.1                 |                | 16.4         | 16.1                   | 15                         | 36                        | 35.2                    | 18.6                       | SOI                    | compensation w/<br>4-way series-parallel DAT | 0.18                  |
| Zhao<br>JSSC 13'      | 1                   | 60             | 17           | 17                     | 13.9                       | 30.3                      | 21.7                    | 8.4*                       | 40nm<br>CMOS           | Diff.<br>2-way Class-AB                      | 0.07                  |
| Kulkarni<br>TMTT 16'  | 1.8                 | 60             | 22.4         | 16.4                   | 13.9                       | 23                        | 18.9                    | 8                          | 40nm<br>CMOS           | N/PMOS<br>push-pull                          | 0.08                  |
| Chappidi<br>ISSCC 16' | 4                   | 55             | 18.8         | 23.6                   | 19.9                       | 27.7                      | 15.7                    | 11*                        | 130nm<br>SiGe          | Asymmetric<br>2-way combiner                 | 1.02                  |
| Chou<br>TMTT 16'      | 1.2                 | 60             | 20.1         | 20.6                   | 17.6                       | 20.3                      | 8*                      | 3*                         | 90nm<br>CMOS           | Class-AB<br>w/ radial combiner               | 0.43                  |
| Chen<br>ISSCC 11'     | 1                   | 60             | 20.3         | 18.6                   | 15                         | 15.1                      | 6.8                     | 2                          | 65nm<br>CMOS           | 3-stg. Class-AB                              | 0.28                  |
| Jen<br>TMTT 09'       | 1.8                 | - 60           | 26.1<br>26.6 | 14.5<br>18             | 10.5<br>14.5               | 10.2<br>12.2              | -                       | -                          | 90nm<br>CMOS           | 3-stg. w/ DAT combiner                       | 0.64                  |
| Varonen<br>JSSC 08'   | 1.2                 | 60             | 12.8         | 7                      | 1.5                        | 4*                        | -                       | -                          | 65nm<br>CMOS           | 3-stg. Sigle-ended Class-AB                  | 0.61                  |
| Bassi<br>JSSC 15'     | 1                   | 53             | 13           | 13.3                   | 12                         | 16                        | -                       | -                          | 28nm<br>CMOS           | 2-stg. Diff. Class-AB                        | 0.334#                |
| Nguyen<br>RFIC 19'    | 2                   | 60             | 12.9         | 20.1                   | 19.3                       | 26.5                      | 25.9                    | 16.6<br>(7dB PBO)          | 45nm<br>CMOS<br>SOI    | Coupler-Based Differential<br>Doherty        | 0.76                  |
| Sun<br>IMS 19'        | 2.4                 | 58             | 19.6         | 22.2                   | 19.7                       | 17.8                      | -                       | -                          | 90nm<br>CMOS           | 2-stacked & 2-way<br>Combiner                | 0.264                 |
| Chang<br>TMTT 19'     |                     | 60             | 29.7         | 23.7                   | 19.9                       | 22.1                      | 11.1                    | -                          | 65nm<br>CMOS           | 3-stg. w/ 4-way combiner                     | 0.653                 |
| Chu<br>RFIC 20'       | 0.95                | 65             | 21.4         | 17.9                   | 13.5                       | 26.5                      | 15                      | 18                         | 16nm<br>FinFET<br>CMOS | 2/4-way<br>Non-uniform combiner              | 0.14                  |
| Gong<br>IMS 20'       | 2                   | 60             | 17.3         | 24.4                   | 23.9                       | 14.2                      | -                       | -                          | 90nm<br>SiGe           | Balanced PA w/ asymmetric coupled lines      | 1.22                  |
| Nguyen<br>ISSCC 18'   | 1.9                 | 65             | -            | 19.4                   | 19.2                       | 28.3                      | 27.5                    | 20.1                       | 45nm<br>CMOS<br>SOI    | Multi-feed antenna-based<br>Doherty          | -                     |

<sup>\*</sup>Graphically estimated from reported figures, #The area including pads.

Table 3.2 – Modulation Comparison with The State-Of-The-Art Silicon-Based Mm-Wave PA at Related Frequency

|             | Carrier Freq. (GHz) | Modulation Scheme        | Data Rate (Gb/s)  | EVM (dB) | $P_{avg}(dBm)$ | PAE <sub>avg</sub> (%) |  |
|-------------|---------------------|--------------------------|-------------------|----------|----------------|------------------------|--|
|             |                     |                          | 3                 | -25.2    | 9.5            | 15.6                   |  |
|             |                     | Single Carrier<br>64-QAM | 6                 | -25.4    | 9.4            | 12.1                   |  |
|             |                     |                          | 12                | -25.2    | 8.7            | 10.5                   |  |
|             |                     |                          | 18                | -24.5    | 8.1            | 9.1                    |  |
| This design |                     |                          | 2.4               |          |                |                        |  |
|             | 55                  |                          | (1-CC FR2 400MHz, | -23.5    | 10.8           | 8                      |  |
|             |                     |                          | total 400MHz)     |          |                |                        |  |
|             |                     |                          | 3.2               | -23.1    | 10             | 12.3                   |  |
|             |                     |                          | (2-CC FR2 400MHz, |          |                |                        |  |
|             |                     | 5G NR CP-OFDM            | total 800MHz)     |          |                |                        |  |
|             |                     | 64-QAM                   | 2.4               |          | 8.7            | 9.5                    |  |
|             |                     |                          | (4-CC FR2 100MHz, | -23.1    |                |                        |  |
|             |                     |                          | total 400MHz)     |          |                |                        |  |
|             |                     |                          | 4.8               |          | 8              | 8.1                    |  |
|             |                     |                          | (4-CC FR2 200MHz, | -23.2    |                |                        |  |
|             |                     |                          | total 800MHz)     |          |                |                        |  |
| Chappidi    | 55                  | Single Carrier           | 3                 | 25**     | 12.8           | _                      |  |
| SSCC 16'    |                     | 64-QAM                   |                   | 23       | 12.0           |                        |  |
| Kulkarni    | 60                  | Single Carrier           | 3                 | -25.2    | 7              |                        |  |
| TMTT 16'    |                     | 64-QAM                   |                   | -23.2    | ,              |                        |  |
| Chu         | 65                  | Single Carrier           | 6                 | -21.6    | 11.2           | 9.4                    |  |
| RFIC 20'    |                     | 64-QAM                   |                   | -21.0    | 11.2           | 7.4                    |  |
| Iguyen      | 60                  | Single Carrier           | 3                 | -23.1    | 13.8           | 15.7                   |  |
| P.FIC 19'   |                     | 64-QAM                   |                   | -23.1    | 13.0           | 13.7                   |  |
| Iguyen      | 65                  | Single Carrier           | 6                 | -26.7*   | 14.2           | 20.2                   |  |
| SSCC 18'    | 03                  | 64-QAM                   | U                 | -20.7    | 14.2           | 20.2                   |  |

\*EVM is normalized to peak, \*\*No information about EVM normalization

# 3.6. Chapter Summary

We present a V-band doubly hybrid NMOS/PMOS differential four-way DAT-based power amplifier, implemented in 45-nm CMOS SOI process. The proposed doubly hybrid NMOS/PMOS nonlinearity cancellation can substantially improve the PA linearity. Additionally, the independent gate bias scheme can perform various contours of the measured PA performance metrics to optimize the PA performance in different operation modes, e.g., high-linearity mode (HLM) and high-efficiency mode (HEM) in this design. Also, the DAT-based transformer can combine both NMOS and PMOS PAs and show a low loss, compact and simple PA output network design. Thus, this PA design address the PA linearity-efficiency tradeoff. It provides a high linearity and efficiency simultaneously, achieving superior measured CW and modulation performance at V-band.

# PASSIVE REFLECTION-TYPE PHASE SHIFTER WITH TRANSFORMER-BASED MULTI-RESONANCE LOADS FOR 360° PHASE SHIFTING

### 4.1. Introduction

Passive phase shifters provide multiple benefits over active vector-modulator phase shifters [142]-[148], such as zero DC power consumption and superior linearity, which are essential in large-scaled and power-constrained phased array systems. Commonly used passive phase shifter topologies include switched-line [145], [146], travelling-wave [146], [148], loaded-line [147], switched-filter [149], [150], and reflection-type [151]-[172]. Switched-line and travelling-wave topologies both require one or multiple transmission lines, leading to excessive chip areas even at mm-Wave frequencies. Switched-filter topology replaces the transmission lines by high-/low-pass filters for size reduction, but it cannot provide continuous or dense phase shift. Loaded-line topology also experiences these similar issues.

# 4.2. Reported RTPS Topologies

Reflection-type phase shifter (RTPS) exhibits several unique advantages over the other passive phase shifter topologies, including continuous and dense phase shift, moderate size, and cascadable operation [151]-[172]. Therefore, RTPS is an excellent candidate to meet the stringent requirements in high performance phased array systems.

A single-ended RTPS typically consists of a 90° coupler and two identical tunable passive reflective loads, as shown in Figure 4.1. Popular tunable reflective load topologies include tunable capacitors, series L-C resonators, parallel L-C resonators, and C-L-C  $\pi$ networks, and the impedance of the reflective load is tuned to achieve the desired phase shifting angle and range [151]-[172]. The design and limitation of these passive reflective load networks will be comprehensively analyzed in this paper. The phase shifting range of an RTPS is solely governed by the load reactance tuning range [151]-[172]. However, the passive loss substantially limits the reactance tuning range of the reflective loads and also the phase shifting range of the RTPS. In practice, the phase shifting range of one mm-Wave RTPS is usually around or below 180°[159]-[164]. Based on the reported RTPS, a highorder reflective load can extend the reactance tuning range and increase the phase shifting range. However, a high-order reflective load typically requires complicated passive designs and exhibits higher passive loss. Although some fully integrated low-GHz RTPS with highorder reflective loads achieve a 360° phase shifting range, they suffer from severe IL even at those low GHz frequencies [155]. Other low-loss 360° RTPS designs are discrete designs with high-quality passive loads at low GHz frequencies [151]-[154], and these 360° designs are unsuitable for mm-Wave fully integrated RTPS applications. Another practical approach is to cascade multiple mm-Wave RTPS designs to realize a wide phase shifting range [159]. Although this approach may achieve 360° phase shift, it often exhibits exceedingly high IL. For example, a reported 180° mm-Wave RTPS consists of two  $\pi$ network loads with IL of 7.5dB [159], and cascading two of such RTPS designs for 360° phase shift will result in a total IL of 15dB. Thus, there exists a major challenge to realize a mm-Wave RTPS which can achieve full span 360° phase shift and low IL simultaneously.

# 4.3. RTPS Design Challenge

To address this challenge, we propose a fully integrated differential mm-Wave RTPS that employs two transformer-based 90° couplers and two transformer-based multiresonance reflective loads to simultaneously achieve a full span 360° phase shift, low IL, and an ultra-compact chip area [171]. Other non-silicon technologies, such as MEMS RTPS or hybrid integration of MEMS and CMOS RTPS, can achieve higher FoM [152], [153]. However, using phase shifters of non-silicon technologies with silicon-based transceiver circuits will require heterogeneous integration or careful packaging, which will increase the cost, integration complexity, and possible performance degradation for mm-Wave operations in practice. Our proof-of-concept design is fully integrated in commercially available standard 130nm BiCMOS process. It covers a full span 360° phase shifting range from 58GHz to 64GHz, and demonstrates the lowest IL, the smallest IL variation, and the best RTPS figure-of-merit (FoM) of 37.1°/dB compared with other reported 60GHz RTPS designs [159]-[161], [164].

# 4.4. Reflective Load Designs in RTPS

In this section, we will present several reported reflective load topologies and their load impedance tuning range. The phase shift limit of each reflective load will also be discussed. We will also present our proposed transformer-based multi- resonance load and its achievable 360° phase shifting range.

Figure 4.1 demonstrates a generic RTPS design with its signal path and the input/output phase shift. First, the input signal is split to two paths with 90° phase difference by the 90° coupler. Next, the resulting two split signals are reflected from the

two reflective loads at the through and couple ports. Assume that these two reflective loads are identical, the two reflected signals will be combined in-phase at the RTPS output (the isolation port) to generate the desired output phase shift. At the same time, the two reflected signals will cancel each other at the input port and result in the desired input matching. Assume that the two reflective loads and the 90° coupler are lossless, the total phase shift can be obtained as

$$\theta = -90^{\circ} - 2 \tan^{-1} \left( \frac{X_L}{Z_0} \right) = -90^{\circ} - \angle \Gamma,$$
 (4-1)

where  $X_L$  is the reflective load impedance;  $Z_0$  is the characteristic impedance of the 90° coupler;  $\Gamma$  is the reflection coefficient of  $X_L$  normalized by  $Z_0$  [159], [163], [168]. Therefore, the output phase shift of an RTPS is now equivalently expressed as the phase of the load reflection coefficient  $\angle\Gamma$ . This relationship will be used to facilitate the RTPS load design and analysis in this paper. Note that  $\Gamma$  is on the unit circle of the Smith chart for lossless reflective loads. To synthesize the desired phase shift,  $X_L$  is varied from  $X_L^{\min}$  to  $X_L^{\max}$ , and the phase shifting range  $\Delta\theta$  is thus found as

$$\Delta \theta = \theta_{\text{max}} - \theta_{\text{min}} = 2 \left[ \tan^{-1} \left( \frac{X_L^{\text{max}}}{Z_0} \right) - \tan^{-1} \left( \frac{X_L^{\text{min}}}{Z_0} \right) \right] = \angle \Gamma \left( X_L^{\text{min}} \right) - \angle \Gamma \left( X_L^{\text{max}} \right). \tag{4-2}$$

From equation (4-2), the phase difference of the two reflection coefficients, i.e.,  $\Gamma(X_L^{\text{max}})$  and  $\Gamma(X_L^{\text{min}})$ , should be maximized to achieve a wide phase shifting range  $\Delta\theta$ . Next, we will present several reported reflective load configurations and our proposed transformer-based multi-resonance load. Their load impedance tuning limits and the achievable phase shifting ranges are also analyzed. For simplicity, the following analyses

assume that the passive reflective loads and the  $90^{\circ}$  coupler are both lossless.



Figure 4.1 – A single-ended reflection-type phase shifter (RTPS).

# 4.4.1. Capacitive load (CL)

The simplest reflective load in an RTPS is a capacitive load (CL) [155]-[158], e.g., a varactor or a switch-controlled capacitor bank. Figure 4.2(a) shows the CL topology and its load impedance tuning trajectory along the unit circle on the Smith chart at a fixed operating frequency. The load impedance  $Z_L$  of the capacitance C is located at the half side of the Smith chart, and C is varied from  $C^{\min}$  to  $C^{\max}$ , e.g., the achievable minimum and maximum capacitance. Thus, the phase shift of CL is the phase difference of the load reflection coefficients by  $C^{\min}$  and  $C^{\max}$ . In practice, CL only provides a marginal phase shifting range due to its limited impedance tuning range. For a CL RTPS, the  $\Delta\theta$ , i.e., equation (4-2), can be expressed as

$$\Delta\theta = 2 \tan^{-1} \left( \frac{1}{Z_0 \omega_0 C^{\min}} \right) - 2 \tan^{-1} \left( \frac{1}{Z_0 \omega_0 C^{\max}} \right)$$

$$= 2 \tan^{-1} \left( \frac{1}{Z_0 \omega_0 C^{\min}} \right) - 2 \tan^{-1} \left( \frac{1}{Z_0 \omega_0 \alpha C^{\min}} \right). \tag{4-3}$$

Assume  $\alpha$  is the tuning range of the load capacitor and  $\alpha_{max}$  is its maximum value, i.e.,  $1 \le \alpha \le \alpha_{max} = C^{max}/C^{min}$ , and  $\omega_0$  is the operating frequency. Further, the  $\Delta\theta$  of CL can be simplified as

$$\Delta \theta = 2 \tan^{-1} \left[ \frac{Z_0 \omega_0 C^{\min} (\alpha - 1)}{\alpha (Z_0 \omega_0 C^{\min})^2 + 1} \right]. \tag{4-4}$$

For a given maximum capacitor tuning range  $\alpha_{\rm max}$ , the  $\Delta\theta$  of CL can be maximized by choosing  $C^{\rm min}=1/(\sqrt{\alpha_{\rm max}}Z_0\omega_0)$ , and the resulting maximum phase shifting range  $\Delta\theta_{\rm max}$  of CL is

$$\Delta \theta_{\text{max}} = 2 \tan^{-1} \left( \frac{\alpha_{\text{max}} - 1}{2 \sqrt{\alpha_{\text{max}}}} \right). \tag{4-5}$$

Figure 4.2(b) summarizes the  $\Delta\theta$  versus  $\alpha_{max}$ . For  $\alpha_{max}$ =3, i.e., a typical tuning range for a varactor in silicon process, the  $|\Delta\theta_{max}|$  is only 60° (Figure 4.2b), matching the results in [156]. For  $\alpha_{max}$ =4, the  $|\Delta\theta_{max}|$  only increases by 13.4°. In reality, the varactors or switch-controlled capacitor banks present passive loss, and the  $|\Delta\theta_{max}|$  is considerably reduced from the theoretical value. Thus, a CL RTPS only achieves limited phase shift in practice [156].



Figure 4.2 -(a) The load impedance tuning range for a lossless capacitive load (CL) by tuning C, and (b) the simulated phase shifting range as the capacitor load tuning range  $\alpha$  varies from 1 to its maximum value for different  $\alpha_{max}$  (e.g.,  $\alpha_{max}$ =2, 3, or 4). Assume that the optimum  $C^{min}$  value, i.e.,  $C^{min}$ =1/ $(\sqrt{\alpha_{max}}Z_0\omega_0)$  is used in this CL RTPS design.

# 4.4.2. Series L-C resonant load (SLC)

To further extend the phase shifting range, series L-C resonant loads (SLC) are used in RTPS [155]-[158]. Assuming all the passive components are lossless, an SLC load generates a series resonance as a short circuit (S.C.) for the load impedance  $Z_L$  (Figure 4.3a). Increasing capacitance  $C_s$  makes  $Z_L$  inductive and then moves  $Z_L$  to approach  $Z_1$ ;  $Z_1$  is defined as the SLC load impedance  $Z_L$  at the maximum  $C_s$  as  $C_s^{max}$ . On the other hand, decreasing  $C_s$  makes  $Z_L$  capacitive and pushes  $Z_L$  to reach  $Z_2$ , which is the SLC load impedance  $Z_L$  at the minimum  $C_s$  as  $C_s^{min}$ . The phase shift of SLC load is realized by the phase difference between the reflection coefficients of  $Z_1$  and  $Z_2$ . However, due to the limited tuning range of  $C_s$  in practice, an SLC RTPS cannot achieve a full span 360° phase shift [155]-[158].

The phase shifting range  $\Delta\theta$  of an SLC RTPS can be expressed below, assuming  $C_{\rm s}$  is varied from  $C_{\rm s}^{\rm min}$  to  $C_{\rm s}^{\rm max}$ ,

$$\Delta\theta = 2 \tan^{-1} \left( \frac{\omega_0^2 L_s C_s^{\text{max}} - 1}{Z_0 \omega_0 C_s^{\text{max}}} \right) - 2 \tan^{-1} \left( \frac{\omega_0^2 L_s C_s^{\text{min}} - 1}{Z_0 \omega_0 C_s^{\text{min}}} \right), \tag{4-6}$$

which can be further simplified as

$$\Delta\theta = 2 \tan^{-1} \left\{ \frac{Z_0 \omega_0 C_s^{\min} \left(\alpha - 1\right)}{1 - \omega_0^2 C_s^{\min} \left[\left(\alpha + 1\right) L_s + \alpha C_s^{\min} \left(\omega_0^2 L_s + Z_0^2\right)\right]} \right\},\tag{4-7}$$

where  $\alpha$  stands for the load capacitor tuning range and  $\alpha$ max is its maximum value, i.e.,  $1 \le \alpha \le \alpha_{\text{max}} = C_s^{\text{max}}/C_s^{\text{min}}$ .

The optimum series inductance value  $L_s$  can be found as  $L_s = (\alpha_{\text{max}} + 1)/(2\alpha_{\text{max}}\omega_0^2 C_s^{\text{min}})$  by differentiating equation (4-7) with respect to  $L_s$ . By choosing  $C_s^{\text{min}}$  as  $C_s^{\text{min}} = 1/(\sqrt{\alpha_{\text{max}}}Z_0\omega_0)$ , the maximum phase shifting range  $\Delta\theta_{\text{max}}$  of an SLC RTPS can be obtained in equation (4-8), aligning well with the results in [156],

$$\Delta\theta_{\text{max}} = 4 \tan^{-1} \left( \frac{\alpha_{\text{max}} - 1}{2 \sqrt{\alpha_{\text{max}}}} \right). \tag{4-8}$$

The  $\Delta\theta$  of an SLC RTPS for different  $\alpha_{max}$  is plotted in Figure 4.3. It shows that, for  $\alpha_{max}$ =3, the  $|\Delta\theta_{max}|$  of an SLC RTPS is only 120°, and the  $|\Delta\theta|$  is only increased by 27.5° for  $\alpha_{max}$ =4, agreeing well with the results in [156]. Thus, an SLC RTPS cannot achieve 360° phase shift with a practical capacitor tuning range  $\alpha$ . The passive losses limit the  $\Delta\theta_{max}$  [155]-[158].



Figure 4.3 - (a) The load impedance trajectory for a lossless parallel L-C load (PLC) by tuning  $C_p$ , and (b) the simulated phase shifting range for different maximum capacitor tuning range  $\alpha_{max} = C_p^{max}/C_p^{min}$ , e.g.,  $\alpha_{max} = 2$ , 3, or 4. Assume the optimum  $C_p^{min}$  and  $L_p$  values are used in this PLC RTPS design.

### 4.4.3. Parallel L-C Resonant Load (PLC)

As the dual implementation of the SLC load, parallel L-C resonant load (PLC) can also be used for RTPS designs [163]. The PLC load impedance trajectory along the unit circle of the Smith chart is shown in Figure 4.4(a), indicating that PLC generates a parallel resonance as an open circuit (O.C.) for the load. In practice, a PLC load also cannot offer a full span 360° phase shifting range, due to the limited tuning range of  $C_p$  [163]. Different from an SLC load, decreasing capacitance  $C_p$  makes the PLC load impedance  $Z_L$  inductive and approaching  $Z_1$ , which is the load impedance  $Z_L$  at the minimum  $C_p$  as  $C_p^{min}$ . On the other hand, increasing  $C_p$  makes the PLC load impedance capacitive and pushes  $Z_L$  to reach  $Z_2$ , which is the load impedance  $Z_L$  at the maximum  $C_p$  of  $C_p^{max}$ . Similar to an SLC RTPS, the  $\Delta\theta$  of PLC RTPS is determined by the phase difference of the reflection coefficients of  $Z_1$  and  $Z_2$ , and it can be expressed in equation (9).

$$\Delta\theta = 2 \tan^{-1} \left[ \frac{\omega_0 L_p}{Z_0 \left( 1 - \omega_0^2 L_p C_p^{\text{max}} \right)} \right] - 2 \tan^{-1} \left[ \frac{\omega_0 L_p}{Z_0 \left( 1 - \omega_0^2 L_p C_p^{\text{min}} \right)} \right]$$

$$= 2 \tan^{-1} \left[ \frac{Z_0 \omega_0^3 L_p^2 C_p^{\text{min}} \left( \alpha - 1 \right)}{Z_0^2 - (\alpha + 1) Z_0^2 \omega_0^2 L_p C_p^{\text{min}} + \alpha Z_0^2 \left( \omega_0^2 L_p C_p^{\text{min}} \right)^2 + \left( \omega_0 L_p \right)^2} \right]. \tag{4-9}$$

The optimum parallel inductance  $L_{\rm p}$  value can be found as  $L_{\rm p}=2/\left[(\alpha_{\rm max}+1)\omega_0^2C_{\rm p}^{\rm min}\right]$  by differentiating equation (4-9) with respect to  $L_{\rm p}$  for achieving the maximum phase shift. Moreover, choosing  $C_{\rm p}^{\rm min}=1/\left(\sqrt{\alpha_{\rm max}}Z_0\omega_0\right)$ , the  $\Delta\theta_{\rm max}$  is obtained as

$$\Delta\theta_{\text{max}} = 2 \tan^{-1} \left[ \frac{4\sqrt{\alpha_{\text{max}}} \left(\alpha_{\text{max}} - 1\right)}{\left(\alpha_{\text{max}} - 1\right)^2 - 4\alpha_{\text{max}}} \right]. \tag{4-10}$$

The  $\Delta\theta_{max}$  of a PLC RTPS for different  $\alpha_{max}$  is plotted in Figure 4.4(b). From Figure 4.3(b) and Figure 4.4(b), it can be summarized that a 2<sup>nd</sup>-order L-C resonator (e.g. SLC and PLC) only covers one resonance point on the Smith chart, i.e., S.C. or O.C.; it performs a larger phase shifting range than a CL RTPS, but it still cannot achieve 360° full-range phase shift [163]. Increasing the  $\alpha_{max}$  can only marginally improve the phase shifting range (Figure 4.3b and Figure 4.4b), and cascading multiple stages directly suffers from passive loss penalty. However, the analysis above indicates that employing high-order reflective loads with multi-resonance will expand the load trajectory on the Smith chart, cover multiple S.C. or O. C. resonance points, and thus increase the RTPS phase shifting range [158]-[160].



Figure 4.4 - (a) The load impedance trajectory for a lossless parallel L-C load (PLC) by tuning  $C_p$ , and (b) the simulated phase shifting range for different maximum capacitor tuning range  $\alpha_{max}=C_p^{max}/C_p^{min}$ , e.g.,  $\alpha_{max}=2$ , 3, or 4. Assume the optimum  $C_p^{min}$  and  $L_p$  values are used in this PLC RTPS design.

### 4.4.4. CLC $\pi$ -resonator load with one tunable capacitor

The CLC  $\pi$ -resonator load with one tunable capacitor (CLC-1) is widely adopted in RTPS to realize a high-order reflective load [155], [158]-[159]. The load reflection coefficient behavior is demonstrated in Figure 4.5. CLC-1 can cover two resonance points, i.e., an O.C. and an S.C., which extend the phase shifting range significantly. When L<sub>1</sub> and C<sub>2</sub> form a series resonance as an S.C. load, the load impedance Z<sub>L</sub> of CLC-1 is zero. When C<sub>2</sub> decreases, the L<sub>1</sub>-C<sub>2</sub> series resonator becomes capacitive, making Z<sub>L</sub> reach Z<sub>2</sub>, which is defined as the load impedance Z<sub>L</sub> at the minimum C<sub>2</sub> value as C<sub>2</sub><sup>min</sup>. On the other hand, when C<sub>2</sub> increases, the L<sub>1</sub>-C<sub>2</sub> resonator becomes inductive and can form a parallel resonance with the fixed capacitor C<sub>3</sub> to realize an O.C. load, denoted as Z<sub>1</sub>. Moreover, C<sub>2</sub> can be further increased to move Z<sub>L</sub> beyond O.C. and reach Z<sub>3</sub>, which is defined as Z<sub>L</sub> at the maximum C<sub>2</sub> value as C<sub>2</sub><sup>max</sup>.

The phase shifting range is thus more than  $180^{\circ}$ . The CLC-1 load impedance  $Z_L$  is expressed as

$$Z_{L} = \frac{1 - \omega_{0}^{2} L_{1} C_{2}}{j \omega_{0} \left( C_{3} + C_{2} - \omega_{0}^{2} L_{1} C_{2} C_{3} \right)}.$$
(4-11)

The phase difference can be written as

$$\Delta\theta =$$

$$2 \tan^{-1} \left[ \frac{1 - \omega_0^2 L_1 C_2^{\text{max}}}{Z_0 \omega_0 \left( C_3 + C_2^{\text{max}} - \omega_0^2 L_1 C_2^{\text{max}} C_3 \right)} \right] - 2 \tan^{-1} \left[ \frac{1 - \omega_0^2 L_1 C_2^{\text{min}}}{Z_0 \omega_0 \left( C_3 + C_2^{\text{min}} - \omega_0^2 L_1 C_2^{\text{min}} C_3 \right)} \right]. \tag{4-12}$$

Further, since  $C_2$  varies from  $C_2^{\min}$  to  $C_2^{\max}$ , the total phase shifting range can be derived using equation (4-12). Assume the same capacitor equation  $C_2^{\min} = 1/(\sqrt{\alpha_{\max}} Z_0 \omega_0)$  is used as the CL, SLC, and PLC loads. Let  $C_3 = \beta C_2^{\min}$  with  $\beta$  as the ratio of  $C_3$  over  $C_2^{\min}$ . The optimum inductance value  $L_1$  can be found as  $L_1 = \left[ (\alpha_{\max} + \beta)^2 + \alpha_{\max} (1 + \beta^2) \right] / \left[ 2\alpha_{\max} \omega_0^2 C_2^{\min} (\alpha_{\max} + \beta^2) \right]$  that achieves the maximum phase shift for given  $\alpha_{\max}$ ,  $\beta$ , and  $C_2^{\min}$ . Finally, the  $\Delta\theta_{\max}$  of a CLC-1 RTPS is derived in equation (4-13).

$$\Delta\theta_{\text{max}} = 2 \tan^{-1} \left[ \frac{4\alpha_{\text{max}} \left( -\alpha_{\text{max}}^{3} - 2\alpha_{\text{max}}^{2} \beta^{2} + \alpha_{\text{max}}^{2} - \alpha_{\text{max}} \beta^{4} + 2\alpha_{\text{max}} \beta^{2} + \beta^{4} \right) \sqrt{\alpha_{\text{max}}}}{-\alpha_{\text{max}}^{5} + \alpha_{\text{max}}^{4} \left( 6 - 3\beta^{2} \right) + \alpha_{\text{max}}^{3} \left( 10\beta^{2} - 3\beta^{4} - 1 \right)} + \alpha_{\text{max}}^{2} \beta^{2} \left( 6\beta^{2} - 3 - \beta^{4} \right) + \alpha_{\text{max}} \beta^{4} \left( 2\beta^{2} - 3 \right) - \beta^{6} \right].$$

$$(4-13)$$

Figure 6a depicts the  $\Delta\theta$  of a CLC-1 RTPS versus the capacitor tuning range  $\alpha$  of  $C_2$  ( $1 \le \alpha \le 3 = \alpha_{max}$ ) and capacitor ratio  $\beta$  ( $0 \le \beta \le 3$ ). It shows that if  $\alpha$  and  $\beta$  equal 3, the  $|\Delta\theta_{max}|$  is 266.3°. Note that when  $\beta$  equals 0, CLC-1 is reduced to SLC, and therefore the  $|\Delta\theta_{max}|$  is

120°, matching well with the results for SLC RTPS (Figure 4.3). Compared with CL, SLC and PLC designs, CLC-1 demonstrates that a high-order resonator load indeed improves the phase shifting range for the same  $\alpha_{max}$ . Figure 4.6 shows that the total phase shift of a CLC-1 RTPS versus  $\alpha$  and large values of  $\beta$  (up to 20). It can be found that the  $|\Delta\theta_{max}|$  can reach 360° (e.g.,  $|\Delta\theta_{max}|=357^{\circ}$  for  $\alpha=\alpha_{max}=3$  and  $\beta=20$ ) however with drastic phase shift changes at large  $\beta$  values. This drastic phase shift change is not desired in practice. In reality, the  $|\Delta\theta_{max}|$  of a CLC-1 RTPS is generally smaller than 360° with practical capacitive tuning and inevitable passive loss. For example, the phase shifting range of a CLC-1 RTPS in [155] and [158] is only 85° at 10.5GHz and only 147° in [159] at 60GHz.



Figure 4.5 – The load impedance tuning range for a lossless CLC  $\pi$ -resonator load with one tunable capacitor (CLC-1).



Figure 4.6 – The simulated CLC-1 phase shifting range versus the capacitor tuning  $\alpha$  and capacitor ratio  $\beta$  (i.e.,  $\beta$ =C<sub>3</sub>/C<sub>2</sub><sup>min</sup>) for a lossless CLC-1 RTPS load. The phase shifting range is referenced to zero for  $\alpha$  =1, i.e., no capacitive tuning. It covers (a)  $0 \le \beta \le 3$  and (b)  $0 \le \beta \le 20$ .

### 4.4.5. CLC $\pi$ -resonator load with two tunable capacitors

A CLC  $\pi$ -resonator load with two tunable capacitors (CLC-2) is also employed in RTPS designs. Different from CLC-1, CLC-2 allows both  $C_2$  and  $C_3$  to be tunable (Figure 4.7) [158], [160]. This tunable  $C_3$  provides an additional degree of freedom to extend the RTPS phase shift. Like CLC-1, CLC-2 also covers two resonance points with one S.C. and one O.C. The load impedance trajectory of the CLC-2 load is shown in Figure 4.7. When  $L_1$  and  $C_2$  resonate and form an S.C. load, the load impedance  $Z_L$  is zero, similar to the CLC-1 load. When the  $L_1$ - $C_2$  resonator becomes capacitive, the tunable capacitance  $C_3$  can further extend the load trajectory toward O.C. by reducing the  $C_3$  value, achieving a wider phase shift than CLC-1. The location of  $Z_3$  on the Smith chart depends on the tuning ratio of  $C_3$  and the relative values of  $C_2$  and  $C_3$ . Similarly, when the CLC-2 load forms an O.C.

load as the CLC-1 load, increasing the  $C_3$  value will move the load impedance further into the capacitive region of the Smith chart, also extending the phase shift. Equation (4-14) expresses the  $\Delta\theta$  of a CLC-2 RTPS.

$$\Delta\theta = 2 \tan^{-1} \left[ \frac{1 - \omega_0^2 L_1 C_2^{\text{max}}}{Z_0 \omega_0 \left( C_3^{\text{max}} + C_2^{\text{max}} - \omega_0^2 L_1 C_2^{\text{max}} C_3^{\text{max}} \right)} \right]$$

$$-2 \tan^{-1} \left[ \frac{1 - \omega_0^2 L_1 C_2^{\text{min}}}{Z_0 \omega_0 \left( C_3^{\text{min}} + C_2^{\text{min}} - \omega_0^2 L_1 C_2^{\text{min}} C_3^{\text{min}} \right)} \right].$$
(4-14)

To investigate the  $\Delta\theta_{\rm max}$  of the CLC-2 RTPS, for simplicity, we assume  $C_2$  and  $C_3$  have the same  $\alpha_{\rm max}$ , i.e.,  $\alpha_2^{\rm max} = C_2^{\rm max}/C_2^{\rm min} = \alpha_{\rm max}$  and  $\alpha_3^{\rm max} = C_3^{\rm max}/C_3^{\rm min} = \alpha_{\rm max}$ , although  $C_2$  and  $C_3$  can vary independently to achieve the target phase shift. Also, we assume  $C_3^{\rm min} = \beta C_2^{\rm min}$ , where  $\beta$  is the capacitor ratio between  $C_3^{\rm min}$  and  $C_2^{\rm min}$ . Using the same design capacitor design equation  $C_2^{\rm min} = 1/(\sqrt{\alpha_{\rm max}}Z_0\omega_0)$  as for the CL, SLC, PLC, and CLC-1 cases, the optimum  $L_1$  can be found in equation **Error! Reference source not found.** Thus, the  $\Delta\theta_{\rm max}$  of a CLC-2 RTPS is derived in equation **Error! Reference source not found.** for given  $\alpha_{\rm max}$  and  $\beta$  values.

$$L_{1} = \frac{\left(\sqrt{\beta \left(\omega_{0} Z_{0} C_{2}^{\min}\right)^{2} \left\{\beta \left(\alpha_{\max} C_{2}^{\min}\right)^{2} \left[Z_{0}^{2} \omega_{0}^{2} \left(\beta+1\right)^{2}+1\right] + \beta-2 \alpha_{\max}\right\} + 1}{+\alpha_{\max} \beta \left(1+\beta\right) \omega_{0}^{2} Z_{0}^{2} \left(C_{2}^{\min}\right)^{2}-1}}{\alpha_{\max} \left(\alpha_{\max}+1\right) \left(\beta \omega_{0}^{2} Z_{0} C_{2}^{\min}\right)^{2} C_{2}^{\min}}.$$
 (4-15)

$$2 \tan^{-1} \left\{ \frac{\sqrt{\alpha_{\max}} \left[ \beta (1 - \alpha_{\max} \beta) - 1 + \sqrt{\beta^{2} \left( \alpha_{\max} + \frac{1}{\alpha_{\max}} \right) + (\beta + 1)^{2} + \beta^{3} (\beta + 2)} \right]}{\beta \left[ (2\alpha_{\max} - \beta) \beta + (1 + \beta) - \sqrt{\beta^{2} \left( \alpha_{\max} + \frac{1}{\alpha_{\max}} \right) + (\beta + 1)^{2} + \beta^{3} (\beta + 2)} \right]} \right\} - 2 \tan^{-1} \left\{ \frac{\left[ (\alpha_{\max} \beta - \alpha_{\max} - \beta^{2}) + (\beta + 1)^{2} + \beta^{3} (\beta + 2) \right]}{\beta \left[ (\alpha_{\max} \beta - \alpha_{\max} - \beta^{2}) + (\beta + 1)^{2} + \beta^{3} (\beta + 2) \right]} \right\}. \quad (4-16)$$

$$-2 \tan^{-1} \left\{ \frac{\left[ (2\alpha_{\max} \beta - \alpha_{\max} - \beta^{2}) + (\beta + 1)^{2} + \beta^{3} (\beta + 2) \right]}{\beta \left[ (2\alpha_{\max} - \beta^{2}) \beta - \alpha_{\max} \beta \sqrt{\alpha_{\max} + \frac{1}{\alpha_{\max}} \beta^{2} + (\beta + 1)^{2} + \beta^{3} (\beta + 2)} \right]} \right\}. \quad (4-16)$$

Figure 4.8 shows the total phase shifting range of a CLC-2 RTPS with independent  $C_2$  and  $C_3$  tuning and different capacitor ratios  $\beta$  (i.e.,  $\beta = C_3^{\text{min}}/C_2^{\text{min}}$ ). Assume that the maximum tuning ranges of  $C_2$  and  $C_3$  are the same as  $\alpha_{\text{max}}$ . When  $\beta$  equals 0, CLC-2 is reduced to SLC, and the  $|\Delta\theta_{\text{max}}|$  is 120° (Figure 4.8a), agreeing well with the results in Fig. 3a. Furthermore, when  $\beta$  increases, the total phase shifting range also increase accordingly, as shown in Figure 4.8(b)-(d). If  $\alpha_{\text{max}}$  and  $\beta$  both equal 3, the  $|\Delta\theta_{\text{max}}|$  is 360° (Figure 4.8c).

Compared with CLC-1, the CLC-2 demonstrates that an additional tunable capacitor  $C_3$  indeed extends the total phase shift. Moreover, if  $\beta$  is sufficiently large (e.g.,  $\beta$ =10), the  $|\Delta\theta_{max}|$  of a CLC-2 RTPS is over 360° but also with undesired drastic phase changes (Figure 4.8Figure 4.8d). CLC-2 is a popular reflective load due to its large phase shift and simple design [158], [160]. Again, once the passive loss is considered, the  $|\Delta\theta_{max}|$ 

is reduced substantially in practice. For example, the  $|\Delta\theta|$  of a CLC-2 RTPS in [158] is 340° with 12.6dB loss at 2.4GHz and is only 180° with 7.5dB loss at 60GHz in [159].

Theoretically, a fourth-order (or even higher-order) reflective load can achieve a larger phase shift to the full range 360°. However, increasing the load order by simply adding more passive components will substantially increase the passive loss and area [155], [158]. Our proposed transformer-based multi-resonance load overcomes these issues and achieves a 360° full-span phase shift with low loss and a compact area; its operation is introduced in the next section.



Figure 4.7 – The load impedance tuning range for a lossless CLC  $\pi$ -resonator load with two tunable capacitors (CLC-2).



Figure 4.8 – The simulated CLC-2 phase shifting range versus the two capacitor tuning  $\alpha_2$  and  $\alpha_3$  at different capacitor ratios  $\beta$  (i.e.,  $\beta$ =C3<sup>min</sup>/C2<sup>min</sup>) for a lossless CLC-2 RTPS load with (a)  $\beta$ =0, (b)  $\beta$ =1, (c)  $\beta$ =3, and (d)  $\beta$ =10. The  $\alpha_{max}$  is 3 and the  $\Delta\theta$  is referenced to zero for  $\alpha_2$ = $\alpha_3$ =1, i.e., no capacitive tuning.

### 4.4.6. Transformer-based multi-resonance reflective load

Our proposed transformer-based multi-resonance load consists of a physical transformer and two parallel tunable capacitors as  $C_v$  and  $C_t$  (Figure 4.9a). The transformer can be modeled as an ideal 1:n/k transformer connected with a magnetizing inductor  $L_m = k^2 L_{p1}$  and a leakage inductor  $L_k = (1-k^2)L_{p1}$ ;  $L_{p1}$  is the primary self-inductance; n and k are the secondary/primary turn ratio and magnetic coupling coefficient, respectively [174]-[178]. The capacitor  $C_v$  at the secondary-side can be converted by the ideal 1:n/k transformer to the primary-side as  $C_v$ , i.e.,  $C_v$ , i.e.,  $C_v$ , i.e.,  $C_v$  is Figure 9b shows the simplified load model from the primary side to help our following analysis the behavior of the transformer-based multi-resonance load.

The load reflection coefficient trajectory is shown on the Smith chart in Figure 4.10, with  $C_v$ ' and  $C_t$  being varied independently. First, when  $L_m$  and  $C_v$ ' form a parallel resonance, the load impedance  $Z_L$  is fully determined by  $C_t$  as  $Z_1$ . By varying  $C_t$ , the lower left part of the unit circle on the Smith chart can be covered. As  $C_v$ ' increases, the  $L_m$ - $C_v$ ' parallel resonator becomes capacitive, which forms a series resonance with  $L_k$  and result in an S.C. load as  $Z_2$ . Moreover, if  $C_v$ ' further increases, the  $L_k$ - $L_m$ - $C_v$ ' branch becomes inductive, which can form a parallel resonance with  $C_t$  and produces an O.C. load as  $Z_3$ , assuming  $C_t$  is tuned to its maximum value  $C_t^{max}$  for this O.C. load for simplicity. On the other hand, decreasing  $C_v$ ' makes the  $L_m$ - $C_v$ ' resonator behave inductive; this inductive  $L_m$ - $C_v$ ' resonator can combine with  $L_k$  in series and form a parallel resonance with  $C_t$  to realize another O.C. load as  $Z_4$ , assuming that  $C_t$  is tuned to its minimum value  $C_t^{min}$  for this O.C. load also for simplicity. The total RTPS phase shift is determined by the phase difference between  $Z_3$  and  $Z_4$ . Thus, by covering three resonances, i.e., two O.C. points and one S.C.,

our proposed transformer-based multi-resonance load can achieve full span  $360^{\circ}$  phase shift, if it is employed in an RTPS design. Next, the design equations of our transformer-based multi-resonance RTPS load will be presented. The load impedance  $Z_L$  of the proposed transformer-based reflective load is expressed as

$$Z_{L} = \frac{j\omega_{0}L_{p1}\left[1 - \omega_{0}^{2}n^{2}\left(1 - k^{2}\right)L_{p1}C_{v}\right]}{1 - \omega_{0}^{2}n^{2}L_{p1}C_{v} - \omega_{0}^{2}L_{p1}C_{t}\left[1 - \omega_{0}^{2}n^{2}\left(1 - k^{2}\right)L_{p1}C_{v}\right]},$$
(4-17)

and the RTPS output phase can be expressed in equation (4-18).

$$\theta = 2 \tan^{-1} \left( \frac{\omega_0 L_{p1} \left[ 1 - \omega_0^2 n^2 \left( 1 - k^2 \right) L_{p1} C_v \right]}{Z_0 \left\{ 1 - \omega_0^2 n^2 L_{p1} C_v - \omega_0^2 L_{p1} C_t \left[ 1 - \omega_0^2 n^2 \left( 1 - k^2 \right) L_{p1} C_v \right] \right\}} \right). \tag{4-18}$$

$$\Delta\theta = 2 \tan^{-1} \left\{ \frac{\omega_{0} L_{p1} \left[ 1 - \omega_{0}^{2} n^{2} \left( 1 - k^{2} \right) L_{p1} C_{v}^{\text{max}} \right] / \left( Z_{0} C_{t}^{\text{max}} \right)}{1 / C_{t}^{\text{max}} - \omega_{0}^{2} n^{2} L_{p1} - \omega_{0}^{2} L_{p1} \left[ 1 - \omega_{0}^{2} n^{2} \left( 1 - k^{2} \right) L_{p1} C_{v}^{\text{max}} \right]} \right\}$$

$$-2 \tan^{-1} \left\{ \frac{\omega_{0} L_{p1} \left[ 1 - \omega_{0}^{2} n^{2} \left( 1 - k^{2} \right) L_{p1} C_{v}^{\text{min}} \right] / \left( Z_{0} C_{t}^{\text{min}} \right)}{1 / C_{t}^{\text{min}} - \omega_{0}^{2} n^{2} L_{p1} - \omega_{0}^{2} L_{p1} \left[ 1 - \omega_{0}^{2} n^{2} \left( 1 - k^{2} \right) L_{p1} C_{v}^{\text{min}} \right]} \right\}.$$

$$(4-19)$$

Thus, the phase shifting range of such an RTPS design can be expressed in equation (4-19). To facilitate our derivation, we assume that  $C_t$  is tuned to its maximum  $C_t^{\text{max}}$  when  $Z_L$  reaches the O.C. load  $Z_3$ , while  $C_t$  is tuned to its minimum  $C_t^{\text{min}}$  when  $Z_L$  achieves the O.C. load  $Z_4$ , guaranteeing a 360° phase shifting range. Furthermore, the load impedance  $Z_L$ ' excluding  $C_t$  can be expressed as

$$Z_{L}' = j\omega_{0}L_{p_{1}}\left[\left(1 - k^{2}\right) + k^{2}/\left(1 - \omega_{0}^{2}n^{2}L_{p_{1}}C_{v}\right)\right]. \tag{4-20}$$

At  $C_{\rm v}^{\rm min}$  and  $C_{\rm v}^{\rm max}$ ,  $Z_{\rm in}'$  can be found as

$$\begin{split} Z_{\rm L}'\left(C_{\nu}^{\rm min}\right) &= j\omega_{0}L_{p1}\Big[\Big(1-k^{2}\Big) + k^{2}\Big/\Big(1-\omega_{0}^{2}n^{2}L_{p1}C_{\nu}^{\rm min}\Big)\Big],\\ Z_{\rm L}'\left(C_{\nu}^{\rm max}\right) &= j\omega_{0}L_{p1}\Big[\Big(1-k^{2}\Big) + k^{2}\Big/\Big(1-\omega_{0}^{2}n^{2}L_{p1}\alpha C_{\nu}^{\rm min}\Big)\Big]. \end{split} \tag{4-21}$$

In Figure 4.10, for simplicity, we assume that  $C_t^{\min}$  resonates with  $Z_L'(C_v^{\min})$ , and  $C_t^{\max}$  resonates with  $Z_L'(C_v^{\max})$  to achieve the two O.C. resonance points ( $Z_3$  and  $Z_4$ ). Thus, one can obtain  $C_t^{\min} = 1/[\omega_0 Z_L'(C_v^{\min})]$  and  $C_t^{\max} = 1/[\omega_0 Z_L'(C_v^{\max})]$ . The tuning ranges of the two capacitors  $C_v$  and  $C_t$  should satisfy

$$\frac{1}{\alpha_{t}} = \frac{C_{t}^{\min}}{C_{t}^{\max}} = \frac{\left(1 - k^{2}\right) + k^{2} / \left(1 - \omega_{0}^{2} n^{2} L_{pl} \alpha_{v} C_{v}^{\min}\right)}{\left(1 - k^{2}\right) + k^{2} / \left(1 - \omega_{0}^{2} n^{2} L_{pl} C_{v}^{\min}\right)},$$
(4-22)

where  $\alpha_{\rm v} \leq \alpha_{\rm v}^{\rm max} = C_{\rm v}^{\rm max}/C_{\rm v}^{\rm min}$  is the tuning range for  $C_{\rm v}$  and  $\alpha_{\rm t} \leq \alpha_{\rm t}^{\rm max} = C_{\rm t}^{\rm max}/C_{\rm t}^{\rm min}$  is the tuning range for  $C_{\rm t}$ . To simplify the evaluation, we also assume that  $C_{\rm v}$  and  $C_{\rm t}$  have the same maximum tuning range  $\alpha_{\rm max}$ , i.e.,  $\alpha_{\rm v}^{\rm max} = \alpha_{\rm max}^{\rm max} = \alpha_{\rm max}$ , although they are tuned independently to achieve the target phase shift. Assuming  $C_{\rm v}^{\rm min} = 1/(\sqrt{\alpha_{\rm max}}Z_0\omega_0)$ , based on equation (4-22),  $L_{\rm p1}$  can be further obtained as

$$L_{\rm pl} = Z_0 \frac{\left(\alpha_{\rm max} + 1\right) + \sqrt{\left(\alpha_{\rm max} + 1\right)^2 - 4\alpha_{\rm max}/\left(1 - k^2\right)}}{2\sqrt{\alpha_{\rm max}}\omega_0 n^2},$$
(4-23)

and, by using equation (4-21),  $C_t^{\min}$  can be found as

$$C_{t}^{\min} = \frac{1 - \omega_{0} n^{2} L_{p1}}{\omega_{0}^{2} L_{p1} \left[ Z_{0} \sqrt{\alpha_{\max}} - \left(1 - k^{2}\right) \omega_{0} n^{2} L_{p1} \right]}.$$
(4-24)

Figure 4.11 demonstrates the simulated phase shifting range  $\Delta\theta$  of our proposed load by independently tuning the capacitors  $C_{\rm v}$  and  $C_{\rm t}$  with an exemplar design setting as  $\alpha_{max}$ =3, k=0.5, and n=1. Note that the maximum capacitor tuning range is  $\alpha_{max}$ =3. It can be observed that the phase angle of  $Z_3$  (Figure 4.11a) as  $Z_{\rm L}$  ( $C_{\rm v}^{\rm max}$ ,  $C_{\rm t}^{\rm max}$ ) is 0°. Similarly, the phase angle of  $Z_4$  as  $Z_{\rm L}$  ( $C_{\rm v}^{\rm min}$ ,  $C_{\rm t}^{\rm min}$ ) is -360°. Therefore, the  $|\Delta\theta|$  of our proposed load achieves full span 360°, matching well with our qualitative analysis on the Smith chart (Figure 4.10). Unlike CLC-1 (Fig. 6) and CLC-2 (Fig. 8) RTPS, our proposed design avoids any sharp phase change during the capacitive tuning.

Figure 4.11(b) shows the total phase shifting range, when  $C_t^{\min}$  is selected to be 1.5 times larger than the value obtained in equation (24); the resulting  $|\Delta\theta|$  is 405.2° also with a smooth phase change during the tuning. Moreover, the above design equations (23) and (24) can serve as the design equations for our proposed transformer-based multi-resonance load, and these equations yield the load trajectory on Figure 4.11 and ensure a full span 360° phase shift. The other design parameters (i.e., k and n) can be optimized to further extend the total phase shifting range. The simulation results of a proof-of-concept RTPS design with our proposed transformer-based multi-resonance load will be shown in the following section. Table 4.1 summarizes the maximum phase shifting range  $|\Delta\theta_{\max}|$  of each reflective load when employed in an RTPS. Our transformer-based multi-resonance RTPS load achieves a full span 360° phase shift without any drastic phase change.





Figure 4.9 – The proposed transformer multi-resonance load in an RTPS with (a) single-ended schematic and (b) simplified schematic.



Figure 4.10 – The load impedance tuning range for the proposed transformer-based multi-resonance load by varying both  $C_t$  and  $C_v$ '. All the passive components are assumed to be lossless for simplicity.



Figure 4.11 – The simulated phase shift of our proposed transformer-based multiresonance load by independently varying  $C_v$  and  $C_t$  with a maximum capacitive tuning range  $\alpha_t^{max} = \alpha_v^{max} = \alpha_{max} = 3$ . (a)  $C_t^{min}$  is obtained from equation (24), and (b)  $C_t^{min}$  is 1.5 times of the value obtained from equation (24).

Table 4.1 – Comparison table of maximum phase shift for different reflective loads in an RTPS

| Reflective load type                       | Max. phase shift $ \Delta \theta_{\max} ^*$ | Phase change |
|--------------------------------------------|---------------------------------------------|--------------|
| CL                                         | 60°                                         | Moderate     |
| SLC                                        | 120°                                        | Moderate     |
| PLC                                        | 120°                                        | Moderate     |
| CLC-1                                      | <360°                                       | Drastic      |
| CLC-2                                      | ≥360°                                       | Drastic      |
| Transformer-based multi-<br>resonance load | ≥360°                                       | Moderate     |

<sup>\*</sup>Assume that the maximum capacitor tuning range  $\alpha_{max}$  equals 3.

# 4.5. Transformer-based 90° Coupler Design

We employ a transformer-based 90° coupler in the proposed RTPS design to achieve an ultra-compact size [176]. Figure 4.12(a) shows the coupler with the input, couple, through and isolation ports, denoted as In, Cpl, Thru and Iso respectively. The capacitors  $C_g$  and  $C_M$  are added to balance the output amplitude of the Cpl and Thru ports, achieve a high isolation at the Iso port, and obtain a good input matching at the In port. In practice, these two capacitors  $C_g$  and  $C_M$  absorb the parasitic capacitances of the transformer-based 90° coupler and the transformer-based reflective load. The 90° coupler utilizes the top three metal layers in a standard 130nm BiCMOS process, i.e., a 4 $\mu$ m-thick aluminum, a 1.42 $\mu$ m-thick aluminum, and a 0.55 $\mu$ m-thick copper layer. The 90° coupler only occupies a 120 $\mu$ m×120 $\mu$ m footprint for our proof-of-concept design at 62GHz. The outer diameter  $D_{out}$ , inner diameter  $D_{in}$ , width W and spacing S are 120 $\mu$ m, 80 $\mu$ m, 6 $\mu$ m and 4 $\mu$ m (Figure 4.12b). The simulated phase and amplitude of the 90° coupler are shown in Figure 4.13, when the In port is driven by an input signal.

Our transformer-based 90° coupler maintains a nearly constant 90° phase difference between the Thru and Cpl ports within  $\pm 1^\circ$  phase error from 54GHz to 66GHz (Figure 4.13b). The Thru port and the Cpl port magnitude responses are -4.3dB and -4dB at the center frequency of 62GHz (Figure 4.13a). Considering the fundamental 3dB IL loss by the 1:2 power splitting, the additional loss due to the 90° coupler is only 1.3dB at 62GHz. The amplitude variation is within  $\pm 1.3$ dB from 51GHz to 70GHz. The return loss (RL) at the In port is 15.8dB, and the isolation is 10.2dB at 62GHz. The simulation verifies that the transformer-based 90° coupler exhibits excellent phase and amplitude balance and low passive loss from 50GHz to 70GHz.



Figure 4.12-A transformer-based  $90^\circ$  coupler with (a) the schematic and (b) 3D EM model in the proof-of-concept 60GHz RTPS design.





Figure 4.13 - (a) Simulated amplitudes for the Thru, Cpl, and Iso ports and the return loss at the In port. (b) Simulated phases for the Thru and Cpl ports.

## 4.6. Circuit Implementation and Simulation of the Proposed RTPS

Our proof-of-concept fully differential ultra-compact transformer-based RTPS is shown in Figure 4.14 [143]. It employs two identical transformer-based 90° couplers and two identical transformer-based multi-resonance reflective loads (Figure 4.14a). It is implemented as a fully differential and symmetric design in a standard 130nm SiGe BiCMOS process with a core area of  $340\mu\text{m}\times480\mu\text{m}$ . The symmetric layout also makes the input and output inter-changeable (Figure 4.14b and Figure 4.14c).

The 3D EM model of our proposed differential transformer-based multi-resonance load is shown in Figure 4.14(d). It employs a 2:1 transformer that is optimized to provide a maximum phase shifting range based on the design equations. It also offers a desired impedance conversion with the coupling coefficient k=0.63. The transformer center taps are used to provide the DC bias voltage for the differential varactors that are employed as the tuning capacitors  $C_v$  and  $C_t$ . The transformers for the multi-resonance loads utilize the same metal layers as the transformer-based 90° coupler, and they occupy  $100\mu m \times 100\mu m$  chip size. The outer/inner diameter  $D_{out}/D_{in}$ , width W and space S are  $100\mu m$ ,  $49\mu m$ ,  $5\mu m$  and  $5\mu m$ . Figure 4.15 summarizes the simulated differential complex transmission coefficient, i.e., complex insertion loss IL, of our RTPS at 62GHz for different phase shifting configurations. Note that the amplitude of IL represents the RTPS passive loss, while its phase stands for the output phase shift of the RTPS.

The simulation includes 3D EM models for the transformers and signal routings as well as the extracted device parasitics. Each dot represents a complex IL value achieved by a given varactor setting of  $(C_v, C_t)$ . The two control voltages  $V_{cv}$  and  $V_{ct}$  are

independently swept from -1V to 0.45V with a step of 10mV, making the effective  $C_v$  vary from 46fF to 129fF and the effective  $C_t$  vary from 21fF to 59fF, as shown in Figure 4.15.

Figure 4.16 shows the simulated capacitances (i.e.,  $C_v$  and  $C_t$ ) versus control voltage (i.e.,  $V_{cv}$  and  $V_{ct}$ ). The slopes (i.e., capacitance/V) around -0.8V and +0.8V are much smaller than set around -0.5 and 0V, the corresponding dots of complex IL are denser (Figure 4.15). On the contrary, when  $V_{cv}$  and  $V_{ct}$  are set around -0.8V and +0.8V, the corresponding dots of complex IL are sparser (Figure 4.15). The plots of the simulated  $P_{1dB}$  and IIP3 for different varactor control settings are shown in Figure 4.17. The P1dB is 13.8dBm and the IIP3 is 23.5dBm for the setting of  $V_{cv}$ =0.45V and  $V_{ct}$ =0.45V, respectively. Moreover, the  $P_{1dB}$  is 13.6dBm and the IIP3 is 21.9dBm for the setting of  $V_{cv}$ =-1V and  $V_{ct}$ =-1V, respectively. Note that the dots in Figure 4.15 are not uniformly distributed due to the nonlinear relationship between the varactor control voltages and the complex IL variations. A major reason is due to the nonlinear relationship between the varactor capacitance versus its control voltage value.



Figure 4.14 – Our proposed RTPS design with (a) the schematic, (b) the layout and EM model, (c) the chip microphotograph, and (d) the EM model of the proposed load.



Figure 4.15 – The simulated complex IL at 62 GHz for the proposed transformer-based RTPS. The simulation includes 3D EM modeling of the passive structures and device parasitic extraction.



Figure 4.16 – The simulated capacitance vs. the control voltage of (a)  $V_{cv}$  and (b)  $V_{ct}$ .



Figure 4.17 – The simulated  $P_{1dB}$  and IIP3 with (a)  $V_{cv}$ =0.45V and  $V_{ct}$ =0.45V  $V_{cv}$ =0.45V, and (b)  $V_{cv}$ =-1V and  $V_{ct}$ =-1V.

#### 4.7. Measurement Results

The RTPS chip is measured by direct probing with a 4-port network analyzer (Keysight PNA-X N5247A and R&S ZVA67). The network analyzer enables full 4-port S-parameter measurement up to 67GHz. To evaluate the phase shift performance, we tune the varactor control voltages  $V_{cv}$  and  $V_{ct}$  both from -1V to +0.45V at discrete voltage steps. The differential complex IL parameters are extracted from the S-parameter measurements to capture the RTPS phase shift and passive loss at different varactor settings. Figure 4.18 shows the measured complex IL results at 62GHz, and each point on the Smith chart represents one complex IL value for a given varactor setting ( $V_{cv}$ ,  $V_{ct}$ ). The varactor control voltages of  $V_{cv}$  and  $V_{ct}$  are swept from -1V to +0.45V.

Our RTPS design covers a total phase shift of 367° achieving full span phase shifting. The measured IL magnitudes also closely match the simulated values (Figure 4.18). Compared with the reported 60GHz RTPS designs [159]-[164], our design is a fully integrated mm-Wave RTPS that first-ever successfully covers the 360° full range phase shifting. In addition, Figure 4.18 also shows four different IL contours as the minimum IL (grey), 10dB constant IL (black), 11dB constant IL (red) and 12dB constant IL (blue) contours. The minimum IL circle settings for the full span 360° interpolation are the outmost IL points in Figure 4.20. In parallel, the dense phase interpolation allows one to select phase shift settings to achieve a full span 360° phase shift with a constant IL magnitude (10dB, 11dB, or 12dB). Note that the phase resolution is ~5° between two adjacent dots for the measurement, which presents fewer test dots. Those test dots are roughly uniformly distributed on the Smith chart.

Figure 4.19 shows the measured complex IL results at different frequencies from 58GHz to 64GHz for three independent RTPS test chips. All the three test chips exhibit closely matched measurement results, and a full span 360° phase shift is consistently achieved from 59GHz to 64GHz for all the chips. Figure 4.20 shows the measured IL under the minimum IL settings versus phase shift at different frequencies for the three independent test chips at 62GHz. Test chip 1 achieves a measured minimum IL from 3.7dB to 10.2dB over the 367° full-span phase shift. The measured minimum IL range is 4.1 dB to 11.05 dB for test chip 2 and 3.8dB to 10.55dB for test chip 3, respectively, showing a highly consistent performance.

To quantitatively evaluate an RTPS, its figure-of-merit (FoM) is defined as the maximum phase shifting range  $|\Delta\theta_{max}|$  divided by the worst-case minimum insertion loss (IL<sub>min</sub>) over the entire phase shifting range, i.e., FoM(°/dB)= $|\Delta\theta_{max}|$ /IL<sub>min</sub> [179]. Figure 4.21shows the measured FoM over different frequencies (59GHz to 64GHz) for the three independent RTPS test chips. The achieved FoM values are between 34.5°/dB and 37.1°/dB, well outperforming the reported fully integrated 60GHz RTPS designs on bulk silicon processes (Table 4.2). The measured peak FoM is 37.1°/dB for our proposed RTPS, which is the state-of-the-art FoM among silicon-based fully integrated 60GHz RTPS.

In addition, the phase shift performance along constant IL contours is particularly useful, if an RTPS is used in a phased-array system. A dense phase shifting and a constant amplitude versus phase shift are both important, so that a matched and constant signal amplitude across the array elements can be achieved for high-quality beam-forming and beam-steering. Therefore, using our transformer-based RTPS design, one can choose the phase shift settings to achieve a full span 360° phase shift along a constant IL, i.e., 10dB,

11dB or 12dB. Note that this is possible due to the dense and repeatable phase interpolations of our RTPS design. The IL variations are largely reduced under the constant IL settings. The resulting maximum/average IL variation for test chip 1 is 0.7dB/0.1dB for the 10dB IL contour, 0.59dB/0.1dB for the 11dB IL contour, and 0.74dB/0.13dB for the 12dB IL contour, while the RTPS still covers a 360° full span phase shift. Consistent performance is achieved by test chip 2 and 3 (Figure 4.22).

The measured input matching for the three RTPS test chips at all the varactor settings over the frequencies are shown in Figure 4.23. The control voltages of the two varactors, i.e., V<sub>Cv</sub> and V<sub>Ct</sub>, are independently tuned from -1V to 0.45V. For all the varactor settings, the measured RL values for all the three chips are better than 9.1dB at 58GHz, 9.6dB at 60GHz, 10.4dB at 62GHz and 12.4dB at 64GHz (Figure 4.23), showing a good input matching compared with reported 60GHz RTPS designs [163].



Figure 4.18 – Measured RTPS complex IL at 62GHz for the test chip 1. Minimum IL and constant IL contours (10dB, 11dB, and 12dB) are highlighted.



Figure 4.19 – Measured complex IL of three independent RTPS test chips at different frequencies. The constant IL contours (IL=10/11/12dB) are highlighted.



Figure 4.20 – Measured RTPS minimum IL vs. phase shift settings at various frequencies for 3 test chips, e.g., (a) Test chip 1, (b) Test chip 2 and (c) Test chip 3.



Figure 4.21 – FoM vs. frequency for our RTPS at different frequencies.



Figure 4.22 – Measured IL variation vs.  $360^\circ$  phase shifts for different constant IL contours (10, 11 and 12dB) at 62GHz for 3 test chips, e.g., (a) Test chip 1, (b)Test chip 2 and (c)Test chip 3.



Figure 4.23 – Measured return loss vs. 2 independent control voltages (i.e.,  $V_{cv}$  and  $V_{ct}$ ) of three independent RTPS chip samples at 58, 60, 62 and 64GHz for Test chip 1.



Figure 4.24 – Measured return loss vs. 2 independent control voltages (i.e.,  $V_{cv}$  and  $V_{ct}$ ) of three independent RTPS chip samples at 58, 60, 62 and 64GHz for Test chip 2.



Figure 4.25 – Measured return loss vs. 2 independent control voltages (i.e.,  $V_{cv}$  and  $V_{ct}$ ) of three independent RTPS chip samples at 58, 60, 62 and 64GHz for Test chip 3.

Table 4.2 – Performance comparison of reported fully integrated RTPS designs in silicon

| Reference                | Freq.<br>[GHz] | Max. phase shift $[^{\circ}]^{(5)}$ | IL<br>[dB] <sup>(5)</sup>                          | Max. IL<br>variation<br>[dB] <sup>(5)</sup>           | RL [dB] <sup>(5)</sup> | Best FoM [°/dB] | Core chip<br>area<br>[mm <sup>2</sup> ] | Process         |
|--------------------------|----------------|-------------------------------------|----------------------------------------------------|-------------------------------------------------------|------------------------|-----------------|-----------------------------------------|-----------------|
| This work                | 62             | 367                                 | 3.7-10.2 <sup>(1)</sup><br>9.6-10.3 <sup>(2)</sup> | $\begin{array}{c} 6.5^{(1)} \\ 0.7^{(2)} \end{array}$ | 10.4(5)                | 37.1            | 0.16                                    | 130nm<br>BiCMOS |
| [160]<br>RFIC 09'        | 60             | 180                                 | 4.2-7.5<br>(8.4-15) <sup>(3)</sup>                 | 3.3                                                   | -                      | 24              | 0.18                                    | 130nm<br>BiCMOS |
| [159]<br>CICC 11'        | 60             | 180                                 | 5-8.3<br>(10-16.6) <sup>(3)</sup>                  | 3.3                                                   | 9                      | 21.7            | 0.031                                   | 65nm CMOS       |
|                          |                | 147                                 | 3.3-5.7                                            | 2.4                                                   | 13                     | 25.8            | 0.048                                   |                 |
| [163]<br>ARRAY<br>10'    | 60             | 156                                 | 4-6.2                                              | 2.2                                                   | 5                      | 25              | 0.33                                    | 130nm<br>BiCMOS |
| [164]<br>MWCL 14'        | 60             | 90                                  | 4.5-6.9<br>(18-27.6) <sup>(4)</sup>                | 1.4                                                   | 12                     | 13              | 0.25                                    | 130nm<br>BiCMOS |
| [170]<br><i>I</i> MS 16' | 60             | 200                                 | 8.2                                                | 1                                                     | 15                     | 24.4            | 0.28                                    | 130nm<br>BiCMOS |
| [172]<br>EuMC 14'        | 60             | 190                                 | 10.8                                               | 1.8                                                   | 15                     | 17.6            | 0.027                                   | 90nm CMOS       |
| [169]<br>EuMC 16'        | 40             | 60                                  | 5                                                  | ı                                                     | 17                     | 12              | 0.18                                    | 55nm<br>BiCMOS  |
| [158]<br>T-MTT 08'       | 2.5            | 340                                 | 8.6-12.6                                           | 4                                                     | -                      | 27              | 0.66                                    | 180nm<br>CMOS   |
| [155]<br>TCAS-I<br>07'   | 1.9            | 360                                 | 20.5                                               | 14                                                    | -                      | 17.6            | 2.5                                     | 180nm<br>CMOS   |

(1)Minimum IL settings; (2)Constant 10dB IL circle settings; (3)Assuming two RTPS designs are connected in cascade for 360° phase shift; (4)Assuming four RTPS designs are connected in cascade for 360° phase shift; (5)Evaluated at the center frequency of 62GHz.

# 4.8. Chapter Summary

We present a comprehensive discussion on different passive reflective loads reported for RTPS designs, their load impedance tuning ranges, and the resulting phase shifting range limits. To address the limitations of reported RTPS reflective loads, we propose a fully differential transformer-based mm-Wave RTPS topology that achieves a full span 360° phase shift, low loss, and a compact area. The proposed transformer-based mm-Wave RTPS is composed of two transformer-based 90° couplers and two transformerbased multi-resonance reflective loads. A proof-of-concept design at 62GHz is implemented in a standard 130nm BiCMOS process with a core area of 480μm-by-340μm, and it achieves a 367° phase shifting range and low IL (3.7dB<|IL|<10.2dB). It also performs a full span 360° phase shifting from 58GHz to 64GHz with a worst-case minimum IL of 10.7dB. Moreover, our RTPS design allows for 360° phase shifting over constant IL contours, e.g., 10dB, 11dB, and 12dB, with a small IL variation (<0.74dB). Compared with the reported fully integrated mm-Wave RTPS in silicon, our design achieves the first-ever full span 367° phase shift, the lowest insertion loss, and the best figure-of-merit (FoM) of 37.1°/dB. Three independent RTPS chips are measured, and highly repeatable results are achieved.

## 4.9. Appendix

The purpose of this Appendix is to investigate the RTPS performance if the component parasitic losses are included. The overall RTPS insertion loss IL includes the losses caused by the 90° coupler and the reflective loads and is shown as

$$IL = 2IL_{90^{\circ}} + Loss_{Load} = 2IL_{90^{\circ}} + |\Gamma|,$$
 (4-25)

where all the parameters are in dB scale, and  $\Gamma$  is the reflection coefficient of the reflective load. Since the 90° coupler loss is independent of the reflective load and its setting, it can be considered as a constant additive loss. Therefore, we focus on the loss effect due to the reflective loads here. In Chapter 4.4, we analyze the maximum phase shift range  $\Delta\theta_{max}$  of an RTPS with the lossless reflective loads. Here, we evaluate the impact of lossy reflective loads on the RTPS performance. We follow the same reflective load assumptions and the definitions, and the derived capacitances and inductances in Chapter 4.4.

#### 4.9.1. Capacitive load (CL)

As shown in Figure 4.26(a), the series loss resistance of a varactor is defined as  $R_C(\alpha_Q,\alpha) = \alpha_Q/(\omega_0 Q_C^{\text{max}}\alpha C^{\text{min}})$ , where  $\alpha_Q$  stands for the ratio between the maximum and minimum quality factor  $Q_C$ , i.e.,  $1 \le \alpha_Q \le \alpha_Q^{\text{max}} = Q_C^{\text{max}}/Q_C^{\text{min}}$ .  $Q_C^{\text{max}}$  occurs at  $C = C^{\text{min}}$  (i.e.,  $\alpha = \alpha_Q = 1$ ), and  $Q_C^{\text{min}}$  occurs at  $C = C^{\text{max}}$  (i.e.,  $\alpha_Q = \alpha_Q^{\text{max}}$  and  $\alpha = \alpha_{\text{max}}$ ). In addition, if  $\alpha_Q$  equals  $\alpha$ ,  $\alpha$ , will be a fixed series resistor, i.e.,  $\alpha$  including the loss of  $\alpha$ , the  $\alpha$  occurs are rewritten as

$$\Delta\theta_{\text{max}} = \tan^{-1} \left( \frac{\sqrt{\alpha_{\text{max}}} Q_{C}^{\text{max}}}{Q_{C}^{\text{max}} + \sqrt{\alpha_{\text{max}}}} \right) + \tan^{-1} \left( \frac{\sqrt{\alpha_{\text{max}}} Q_{C}^{\text{max}}}{Q_{C}^{\text{max}} - \sqrt{\alpha_{\text{max}}}} \right)$$

$$- \tan^{-1} \left( \frac{Q_{C}^{\text{max}}}{\sqrt{\alpha_{\text{max}}} Q_{C}^{\text{max}} + \alpha_{Q}} \right) - \tan^{-1} \left( \frac{Q_{C}^{\text{max}}}{\sqrt{\alpha_{\text{max}}} Q_{C}^{\text{max}} - \alpha_{Q}} \right).$$

$$(4-26)$$

For a given  $\alpha_{\rm max}$ , and  $Q_C^{\rm max}$ , the  $|\Gamma|$  of CL can be derived as

$$\left|\Gamma\right| = \sqrt{\frac{\alpha_{\text{max}}\left(Q_{C,\text{max}}^{2}\alpha_{Q}^{2}\right) + Q_{C,\text{max}}\left(\alpha^{2}Q_{C,\text{max}} - 2\alpha\alpha_{Q}\sqrt{\alpha_{\text{max}}}\right)}{\alpha_{\text{max}}\left(Q_{C,\text{max}}^{2}\alpha_{Q}^{2}\right) + Q_{C,\text{max}}\left(\alpha^{2}Q_{C,\text{max}} - 2\alpha\alpha_{Q}\sqrt{\alpha_{\text{max}}}\right)}}.$$
(4-27)

First, based on equations (26) and (27), if  $Q_C^{\text{max}}$  is large enough, the  $\Delta\theta_{\text{max}}$  and  $|\Gamma|$  of a CL RTPS is close to the ideal lossless case. Next, assuming  $\alpha_{\text{max}}$ =3, Figure 4.26(b)-(c) show the  $\Delta\theta_{\text{max}}$  of a lossy CL RTPS and the  $|\Gamma|$  for different  $Q_C^{\text{max}}$  versus the  $\alpha$ . The  $\Delta\theta_{\text{max}}$  of a CL RTPS is almost constant versus different load quality factors, and  $|\Gamma|$  varies from - 2.3dB ( $Q_C^{\text{max}}$ =10) to -0.75dB ( $Q_C^{\text{max}}$ =30). Figure 4.26(d) depicts the load impedance trajectories on Smith chart. Thus, the lossy CL directly degrades the IL but does not reduce its phase shifting range.



Figure 4.26 – (a) A lossy CL model, (b) the maximum phase shift  $\Delta\theta_{max}$  vs.  $\alpha$ , (c) the load reflection coefficient  $|\Gamma|$  vs.  $\alpha$ , and (d) the load impedance trajectory with different  $Oc^{max}$ .

#### 4.9.2. Series L-C resonant load (SLC)

A lossy SLC load is modeled in Figure 4.27(a). Considering the varactor series resistances  $R_{Cs}$  and the inductor series resistances  $R_{Ls}$ , we define  $R_{Ls}=\omega_0 L_{s'}/Q_L$ , where  $Q_L$  is the quality factor of the inductor  $L_s$  at the operating frequency  $\omega_0$ . The  $\Delta\theta_{max}$  of SLC can be obtained in equation (4-28). For a given  $\alpha_{max}$  and quality factors  $Q_L$  and  $Q_C^{max}$ , the  $|\Gamma|$  of SLC can be derived in equation (4-29). Choosing  $\alpha_{max}=3$  as an example, the  $|\Gamma|$  of SLC versus the tuning range  $\alpha$  for different quality factors (i.e.,  $Q_C^{max}=Q_L$ ) is plotted in Figure 4.27(b). The  $\Delta\theta_{max}$  versus different  $Q_C^{max}$  (e.g.,  $Q_L$  and  $Q_C^{max}$ ) is plotted in Figure 4.27(c). For  $\alpha_{max}=3$ , the  $|\Delta\theta_{max}|$  of an SLC RTPS only exhibits a minor variation (e.g.,  $\leq 6.3^{\circ}$ ) for different  $Q_L$  and  $Q_C^{max}$ , compared to the ideal case with lossless load (i.e.,  $|\Delta\theta_{max}|=120^{\circ}$ ). In addition, assuming  $Q_L=Q_C^{max}$ , the worst-case  $|\Gamma|$  is equal to -5.1dB.

$$\Delta\theta_{\text{max}} = \tan^{-1} \left[ \frac{(\alpha_{\text{max}} - 1)Q_{L}Q_{C}^{\text{max}}}{(1 + \alpha_{\text{max}})Q_{C}^{\text{max}} + 2\alpha_{\text{max}}Q_{L} + 2\sqrt{\alpha_{\text{max}}}Q_{L}Q_{C}^{\text{max}}} \right]$$

$$- \tan^{-1} \left[ \frac{(\alpha_{\text{max}} - 1)Q_{L}Q_{C}^{\text{max}}}{(1 + \alpha_{\text{max}})Q_{C}^{\text{max}} + 2\alpha_{\text{max}}Q_{L} - 2\sqrt{\alpha_{\text{max}}}Q_{L}Q_{C}^{\text{max}}} \right]$$

$$- \tan^{-1} \left[ \frac{(\alpha_{\text{max}} - 1)Q_{L}Q_{C}^{\text{max}}}{(1 + \alpha_{\text{max}})Q_{C}^{\text{max}} + 2\alpha_{Q}Q_{L} - 2\sqrt{\alpha_{\text{max}}}Q_{L}Q_{C}^{\text{max}}} \right]$$

$$+ \tan^{-1} \left[ \frac{(\alpha_{\text{max}} - 1)Q_{L}Q_{C}^{\text{max}}}{(1 + \alpha_{\text{max}})Q_{C}^{\text{max}} + 2\alpha_{Q}Q_{L} + 2\sqrt{\alpha_{\text{max}}}Q_{L}Q_{C}^{\text{max}}} \right].$$
(4-28)

$$|\Gamma| = \sqrt{\frac{Z_{0}^{2} - \left[2\omega_{0}L_{s}/Q_{L} + 2\sqrt{\alpha_{\max}}\alpha_{Q}Z_{0}/\alpha Q_{C}^{\max}\right]Z_{0}}{+\left[\omega_{0}L_{s}/Q_{L} + \sqrt{\alpha_{\max}}\alpha_{Q}Z_{0}/\alpha Q_{C}^{\max}\right]^{2} + \left(\omega_{0}L_{s} - \sqrt{\alpha_{\max}}Z_{0}/\alpha\right)^{2}}} \cdot \frac{+\left[\omega_{0}L_{s}/Q_{L} + \sqrt{\alpha_{\max}}\alpha_{Q}Z_{0}/\alpha Q_{C}^{\max}\right]^{2} + \left(\omega_{0}L_{s} - \sqrt{\alpha_{\max}}Z_{0}/\alpha\right)^{2}}{+\left[\omega_{0}L_{s}/Q_{L} + \sqrt{\alpha_{\max}}\alpha_{Q}Z_{0}/\alpha Q_{C}^{\max}\right]^{2} + \left(\omega_{0}L_{s} - \sqrt{\alpha_{\max}}Z_{0}/\alpha\right)^{2}}.$$
(4-29)



Figure 4.27 – (a) A lossy SLC model, (b) the load reflection coefficient  $|\Gamma|$  as  $\alpha$  varies from 1 to  $\alpha_{max}$ , (c) the  $\Delta\theta_{max}$  versus different quality factors, and (d) the load impedance trajectory on Smith chart with the different  $Qc^{max}$ .

# 4.9.3. Parallel L-C resonant load (PLC)

The lossy PLC load is modeled in Figure 4.28(a). Including the series resistances  $R_{Lp}$  of an inductor and the series resistance  $R_{Cp}$  of a varactor, the derived  $\Delta\theta_{\rm max}$  of a PCL RTPS in equation (4-10) can be re-written as equation (4-30), and  $|\Gamma|$  is expressed in equation (4-31). Based on the derived equations, for a given  $\alpha_{\rm max}$ , the  $|\Gamma|$  of a PLC load

versus the  $\alpha$  and the quality factors (e.g.,  $Q^{\max}=Q_{\rm C}^{\max}=Q_{\rm L}$ ) are plotted in Figure 4.28(b). The  $\Delta\theta_{\rm max}$  of an RTPS with PLC load versus different quality factors (e.g.,  $Q_{\rm L}$  and  $Q_{\rm C}^{\max}$ ) is plotted in Figure 4.28(c). Also, the load impedance trajectory with different quality factors (e.g.,  $Q_{\rm L}$  and  $Q_{\rm C}^{\max}$ ) are shown in Figure 4.28(d). Assuming  $\alpha_{\rm max}=3$ , the  $\Delta\theta_{\rm max}$  of PLC also only occurs minor variation, i.e.,  $118.7^{\circ} \le |\Delta\theta_{\rm max}| \le 123.7^{\circ}$ , for different values of the quality factor (e.g.,  $10 \le Q_{\rm L} = Q_{\rm C}^{\max} \le 30$ ). The worst-case  $|\Gamma|$  equals -8.9dB. Compared with SLC, if the resistances caused by finite quality factor are included, PCL exhibits a similar  $\Delta\theta_{\rm max}$  variation but higher loss.

$$\Delta\theta_{\text{max}} \cong \tan^{-1} \left\{ \frac{1 + \left(\frac{\alpha_{Q}}{Q_{C}^{\text{max}}}\right)^{2} - \sqrt{\alpha_{\text{max}}} \omega_{0} L_{p}}{Z_{0}} \right\}$$

$$- \tan^{-1} \left\{ \frac{1 - \frac{\omega_{0} L_{p}}{\sqrt{\alpha_{\text{max}}} Z_{0}}}{\frac{\omega_{0} L_{p}}{\alpha_{\text{max}}} Z_{0}} + \left(\frac{1}{Q_{L}} + \frac{Z_{0}}{\omega_{0} L_{p}}\right) \right\}$$

$$+ \tan^{-1} \left\{ \frac{1 - \frac{\omega_{0} L_{p}}{\sqrt{\alpha_{\text{max}}} Z_{0}}}{\frac{\omega_{0} L_{p}}{\alpha_{\text{max}}} Z_{0}} + \left(\frac{1}{Q_{L}} + \frac{Z_{0}}{\omega_{0} L_{p}}\right) \right\}$$

$$+ \tan^{-1} \left\{ \frac{1 + \left(\frac{\alpha_{Q}}{Q_{C}^{\text{max}}}\right)^{2} - \sqrt{\alpha_{\text{max}}} \omega_{0} L_{p}}{Z_{0}} \right\}$$

$$- \tan^{-1} \left\{ \frac{1 - \frac{\omega_{0} L_{p}}{\sqrt{\alpha_{\text{max}}} Z_{0}}}{\frac{\omega_{0} L_{p}}{\sqrt{\alpha_{\text{max}}} Z_{0}}} - \left(\frac{1}{Q_{L}} - \frac{Z_{0}}{\omega_{0} L_{p}}\right) \right\}$$

$$- \tan^{-1} \left\{ \frac{1 - \frac{\omega_{0} L_{p}}{\sqrt{\alpha_{\text{max}}} Z_{0}}}{\frac{\omega_{0} L_{p}}{\sqrt{\alpha_{\text{max}}} Z_{0}}} - \left(\frac{1}{Q_{L}} - \frac{Z_{0}}{\omega_{0} L_{p}}\right) \right\}$$

$$- (4-30)$$

$$\left|\Gamma\right| \cong \begin{pmatrix} \frac{\alpha\omega_{0}^{2}L_{p}}{\sqrt{\alpha_{\max}}} \left(\frac{\alpha\omega_{0}Z_{0}C_{p}^{\min}}{Q_{L}^{2}} - \frac{2\alpha_{Q}}{Q_{C}^{\max}}\right) - \frac{2\alpha\omega_{0}Z_{0}L_{p}}{\sqrt{\alpha_{\max}}} \left(1 - \frac{\alpha_{Q}}{Q_{C}^{\max}}Q_{L}\right) \\ + \left(\omega_{0}^{2}L_{p}^{2} + Z_{0}^{2} - \frac{2\omega_{0}Z_{0}L_{p}}{Q_{L}}\right) \left[1 + \left(\frac{\alpha_{Q}}{Q_{C}^{\max}}\right)^{2}\right] \\ \frac{\alpha\omega_{0}^{2}L_{p}}{\sqrt{\alpha_{\max}}} \left(\frac{\alpha\omega_{0}Z_{0}C_{p}^{\min}}{Q_{L}^{2}} + \frac{2\alpha_{Q}}{Q_{C}^{\max}}\right) - \frac{2\alpha\omega_{0}Z_{0}L_{p}}{\sqrt{\alpha_{\max}}} \left(1 - \frac{\alpha_{Q}}{Q_{C}^{\max}}Q_{L}\right) \\ + \left(\omega_{0}^{2}L_{p}^{2} + Z_{0}^{2} + \frac{2\omega_{0}Z_{0}L_{p}}{Q_{L}}\right) \left[1 + \left(\frac{\alpha_{Q}}{Q_{C}^{\max}}\right)^{2}\right] \\ \end{pmatrix}$$

$$= 0.77$$



Figure 4.28 – (a) A lossy PLC model, (b) the load reflection coefficient  $|\Gamma|$  as  $\alpha$  varies from 1 to  $\alpha_{max}$ , (c) the  $\Delta\theta_{max}$  versus different quality factors, and (d) the load impedance trajectory on Smith chart with the different  $Q_{C}^{max}$ .

## 4.9.4. *CLC* $\pi$ -resonant load with one-side capacitive tuning)

The lossy CLC-1 model with the resistances  $R_{C2}$ ,  $R_{C3}$  and  $R_{LI}$  is shown in Figure 4.29(a). For a given  $\alpha_{\text{max}}=3$ , Figure 4.29(b)-(d) show the load impedance trajectories on Smith chart versus the  $\alpha$  of  $C_2$  and the quality factors (e.g.,  $Q_{C3}$ ,  $Q_L$  and  $Q_{C2}^{\text{max}}$ ) for different  $\beta$  (i.e.,  $\beta=C_3/C_2^{\text{min}}$ ). A larger  $\beta$  for CLC-1 achieves a wider phase shift, but the losses increase accordingly. Also, with smaller quality factors (e.g.,  $Q \le 10$ ), the load reflection coefficient trajectory cannot circle the origin of the Smith chart, which substantially shrinks the phase shift. For example, the load impedance phase  $\angle \Gamma(C_2^{\text{min}})$  and  $\angle \Gamma(C_2^{\text{max}})$  for Q=10 are 190° and 279.4°, but,  $\angle \Gamma(C_2^{\text{min}})$  and  $\angle \Gamma(C_2^{\text{max}})$  for Q=20 are 195.7° and 283.1°, respectively. Thus, the  $|\Delta\theta_{\text{max}}|$  for Q=20 is 272.6° while the  $|\Delta\theta_{\text{max}}|$  for Q=10 is only 89.4° (Figure 4.29d).

#### 4.9.5. CLC $\pi$ -resonant load with two-side capacitive tuning

The lossy CLC-2 model is shown in Figure 4.30(a), which contains the series resistances  $R_{C2}$ ,  $R_{C3}$  and  $R_L$ . For  $\alpha_{max}$ =3 and  $\beta$ =1, Figure 4.30(b) shows the load impedance trajectory on the Smith chart versus the  $\alpha$  of  $C_2$  and  $C_3$ , i.e.,  $1 \le \alpha_3 = \alpha_2 = \alpha \le \alpha_{max}$ , and the quality factors, i.e.,  $Q_{C3}^{max} = Q_L = Q_{C2}^{max}$ . Like CLC-1, for smaller quality factors (e.g.,  $Q \le 10$ ), the CLC-2 cannot circle the origin of the Smith chart, reducing its phase shift significantly Figure 4.30(b). Thus, the higher-order reflective loads can achieve a larger phase shift but introduce larger losses. The large passive loss may also reduce the phase shifting range.



Figure 4.29 – (a) A lossy CLC-1 model, (b) the load impedance trajectory on Smith chart with different quality factors for  $\alpha_{max}=3$  and  $\beta=1$ , (c)  $\beta=2$  and (d)  $\beta=3$ .



Figure 4.30 – (a) A lossy CLC-2 model and (b) the lossy CLC-2 load impedance trajectory on Smith chart with the different quality factors for  $\alpha_{max}$ =3 and  $\beta$ =1.

# 4.9.6. Transformer-based multi-resonance reflective load

Our proposed transformer-based multi-resonance reflective load, including the resistances,  $R_{Cv}$ ,  $R_{Lm}$ ,  $R_{Lk}$ , and  $R_{Ct}$ , is shown in Figure 4.31(a) Assuming  $\alpha_{max}$ =3 and  $\alpha_{v}$ = $\alpha_{t}$ , the load impedance trajectory with different quality factors (e.g.,  $Q_{L}$ ,  $Q_{cv}$ ,  $Q_{cv}$ , and  $Q_{ct}$ 

are shown on the Smith chart Figure 4.31(b). Compared with CLC-2, with the same quality factors (e.g.,  $Q_L$ ,  $Q_{cv}$ , max,  $Q_{ct}$  and RTPS with our proposed reflective load can consistently achieve a full-span 360° phase shift as well as lower or similar passive loss. This shows the advantages of our proposed load for high-performance mm-Wave RTPS.



Figure 4.31 - (a) A lossy proposed transformer-based multi-resonance reflective model and (b) the load impedance trajectory on Smith chart with the different quality factors for  $\alpha_{max}$ =3.

## CHAPTER 5. CONCLUSIONS

In this dissertation, we discussed our PA output networks for continuous-mode harmonically-tuned operations, PA linearity improvement technique and reflective loads innovations towards the development of silicon-based PAs and RTPS for 5G emerging applications at mm-Wave frequency ranges.

First, we present three mm-wave continuous-mode PAs for 5G MIMO applications, including a two-stage differential continuous-mode Class-F<sup>-1</sup> PA in 130-nm SiGe process, a single-stage differential continuous-mode hybrid Class-F/F<sup>-1</sup> PA in 45-nm SOI CMOS process, and a two-stage differential continuous-mode hybrid Class-F/F<sup>-1</sup> PA in 45-nmSOI CMOS process. The first PA design covers 19-29.5 GHz, while the other two designs cover 23-41 GHz, all covering multiple 5G bands (24/28/37/39GHz). All the presented PA designs are based on a proposed transformer-based continuous-mode harmonically tuned PA output network. This network provides proper harmonic impedance terminations for the fundamental, second-order, third-order and harmonic impedances, whichsupportscontinuous-modeClass-F<sup>-1</sup> or hybrid Class-F/F<sup>-1</sup> PA operations to achieve ultra-wide bandwidth yet high efficiency. The first PA design achieves awidePsat1-dB bandwidth of 19–29.5 GHz (43.3%) and high peak PAE (43.5%). The second design achieves an ultrawide P<sub>sat</sub> 1-dB bandwidth of 23.5-41 GHz (53.3%) and high peak PAE (46%). Moreover, it achieves 43.4% PAE and 18.6dBm P<sub>sat</sub> at 27 GHz, 40.2% PAE and 18.6dBm P<sub>sat</sub> at 37GHz, and 41.2% PAE and 18.5dBm P<sub>sat</sub> at 39 GHz, respectively. The third PA design also achieves an ultra-wide P<sub>sat</sub> 1-dB bandwidth of 23.5-41 GHz (53.3%) and high peak PAE (43.2%). It achieves 43% PAE and 18.9dBm Psat at 27 GHz, 37% PAE and 19dBm P<sub>sat</sub> at 37 GHz, and 36% PAE and 18.9dBm P<sub>sat</sub> at 39 GHz, respectively. Extensive 64- and 256-QAM modulation tests demonstrate the high PA linearity. Our proposed PA designs outperform the reported mm-wave silicon-based 5G PAs in terms of high efficiency over an ultrawide bandwidth.

Finally, we present a millimeter-wave fully differential transformer-based passive RTPS capable of performing full span 360° continuous phase shift from 58GHz to 64GHz. It consists of two transformer-based 90° couplers and two transformer-based multi-resonance reflective loads to provide 360° phase shift with low loss and ultra-compact chip size. Our proof-of-concept design is implemented in a standard 130nm BiCMOS process with a core area of 480µm-by-340µm. It achieves a wide phase shifting range of 367° and a low insertion loss IL (3.7dB<|IL|<10.2dB) at 62GHz and maintains a full span 360° phase shifting range from 58GHz to 64GHz. Moreover, it supports 360° phase shifting with a constant IL, i.e., |IL|=10, 11, 12dB, at an IL variation of less than 0.74dB at 62GHz. To the best of our knowledge, this design achieves a first-ever full span 360° phase shifting (up to 367°), the lowest IL, the smallest IL variation, and the best figure-of-merit (FoM) of 37.1°/dB among reported 60GHz fully integrated RTPS in silicon.

## **REFERENCES**

- [1] B. Rabet and J. Buckwalter, "A high-efficiency 28GHz Outphasing PA with 23dBm output power using a triaxial balun combiner," in *Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC)*, Feb. 2018, pp. 174-176.
- [2] S. Li, T. Chi, H. T. Nguyen, T. Huang and H. Wang, "A 28GHz Packaged Chireix Transmitter with Direct on-Antenna Outphasing Load Modulation Achieving 56%/38% PA Efficiency at Peak/6dB Back-Off Output Power," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2018, pp. 68-71.
- [3] T. Chi, F. Wang, S. Li, M. Huang, J. S. Park and H. Wang, "A 60 GHz On-Chip Linear Radiator with Single-Element 27.9dbm P<sub>sat</sub> and 33.1dbm Peak EIRP Using Multi-Feed Antenna for Direct On-Antenna Power Combining," in *Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC)*, Feb. 2017, pp. 296–297.
- [4] T. Li and H. Wang, "A Millimeter-Wave Fully Integrated Passive Reflection-Type Phase Shifter With Transformer-Based Multi-Resonance Loads for 360° Phase Shifting," *IEEE Trans. on Circuits and Systems I (TCAS-I)*, vol. 65, no. 4, pp. 1406-1419, April 2018.
- [5] D. Zhao and P. Reynaert, "A 60-GHz dual-mode class AB power amplifier in 40-nm CMOS," *IEEE J. Solid-State Circuits (JSSC)*, vol. 48, no. 10, pp. 2323–2337, Oct. 2013.
- [6] T. Li, M. Huang and H. Wang, "A continuous-mode harmonically tuned 19-to-29.5GHz ultra-linear PA supporting 18Gb/s at 18.4% modulation PAE and 43.5% peak PAE," in *Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC)*, Feb. 2018, pp. 410-412.
- [7] S. Hu, F. Wang and H. Wang, "A 28GHz/37GHz/39GHz multiband linear Doherty power amplifier for 5G massive MIMO applications," in *Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC)*, Feb. 2017, pp. 32-33.
- [8] P. Indirayanti and P. Reynaert, "A 32 GHz 20 dBm-PSAT transformer-based Doherty power amplifier for multi-Gb/s 5G applications in 28 nm bulk CMOS," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2017, pp. 45-48.

- [9] N. Rostomyan, M. Özen and P. Asbeck, "28 GHz Doherty Power Amplifier in CMOS SOI with 28% Back-Off PAE," *IEEE Microw. and Wireless Compon. Lett.* (MWCL), vol. 28, no. 5, pp. 446-448, May 2018.
- [10] T. Li and H. Wang, "A continuous-mode 23.5-41GHz hybrid Class-F/F-1 power amplifier with 46% peak PAE for 5G massive MIMO applications," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2018, pp. 220-230.
- [11] C. Chappidi, X. Wu and K. Sengupta, "Simultaneously Broadband and Back-Off Efficient mm-Wave PAs: A Multi-Port Network Synthesis Approach," *IEEE J. Solid-State Circuits (JSSC)*, vol. 53, no. 9, pp. 2543-2559, Sept. 2018.
- [12] A. Sarkar, F. Aryanfar and B. A. Floyd, "A 28-GHz SiGe BiCMOS PA With 32% Efficiency and 23-dBm Output Power," *IEEE J. Solid-State Circuits (JSSC)*, vol. 52, no. 6, pp. 1680-1686, June 2017.
- [13] A. Sarkar and B. Floyd, "A 28-GHz class-J Power Amplifier with 18-dBm output power and 35% peak PAE in 120-nm SiGe BiCMOS," *in Proc. IEEE Topical Meeting on Silicon Monolithic Integr. Circuits in RF Systems (SiRF)*, Jan. 2014, pp. 71-73.
- [14] T. Hanna, N. Deltimple and S. Frégonèse, "A wideband highly efficient class-J integrated power amplifier for 5G applications," in *Proc. IEEE Int. New Circuits and Systems Conf. (NEWCAS)*, Jun. 2017, pp. 325-328.
- [15] M. Vigilante and P. Reynaert, "A wideband Class-AB Power amplifier with 29–57-GHz AM–PM compensation in 0.9-V 28-nm bulk CMOS," *IEEE J. Solid-State Circuits (JSSC)*, vol. 53, no. 5, pp. 1288-1301, May 2018.
- [16] S. Shakib, H. Park, J. Dunworth, V. Aparin and K. Entesari, "A 28GHz efficient linear power amplifier for 5G phased arrays in 28nm bulk CMOS," in *Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC)*, Feb. 2016, pp. 352-353.
- [17] S. Shakib, M. Elkholy, J. Dunworth, V. Aparin and K. Entesari, "A wideband 28GHz power amplifier supporting 8×100MHz carrier aggregation for 5G in 40nm CMOS," in *Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC)*, Feb. 2017, pp. 44-45.
- [18] Y. Zhang and P. Reynaert, "A high-efficiency linear power amplifier for 28GHz mobile communications in 40nm CMOS," in *Proc. IEEE Radio Freq. Integr.*

- [19] B. Park, et al., "Highly linear mm-wave CMOS power amplifier," *IEEE Trans. on Microw. Theory and Techn. (T-MTT)*, vol. 64, no. 12, pp. 4535-4544, Dec. 2016.
- [20] S. Ali, P. Agarwal, J. Baylon, S. Gopal, L. Renaud and D. Heo, "A 28GHz 41%-PAE linear CMOS power amplifier using a transformer-based AM-PM distortion-correction technique for 5G phased arrays," in *Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC)*, Feb. 2018, pp. 406-408.
- [21] W. Huang, J. Lin, Y. Lin and H. Wang, "A K-band power amplifier with 26-dBm output power and 34% PAE with novel inductance-based neutralization in 90-nm CMOS," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2018, pp. 228-231.
- [22] S. Ali, *et al.*, "A 40% PAE frequency-reconfigurable CMOS power amplifier with tunable gate—drain neutralization for 28-GHz 5G radios," *IEEE Trans. on Microw. Theory and Techn. (T-MTT)*, vol. 66, no. 5, pp. 2231-2245, May 2018.
- [23] S. Helmi, J. Chen and S. Mohammadi, "High-Efficiency Microwave and mm-Wave Stacked Cell CMOS SOI Power Amplifiers," *IEEE Trans. Microw. Theory and Techn (T-MTT)*., vol. 64, no. 7, pp. 2025-2038, Jul. 2016.
- [24] H. Dabag, J. Kim, L. Larson, J. Buckwalter, and P. Asbeck, "A 45-GHz SiGe HBT amplifier at greater than 25 % efficiency and 30 mW output power," *in Proc. IEEE Bipolar/BiCMOS Circuits Technol. Meeting (BCTM)*, Oct. 2011, pp. 25-28.
- [25] A. Agah, H. Dabag, B. Hanafi, P. Asbeck, L. Larson, and J. Buckwalter "A 34% PAE, 18.6 dBm 42-45 GHz stacked power amplifier in 45 nm SOI CMOS," *in Proc. IEEE Radio Fqreq. Integr. Circuits Symp. (RFIC)*, Jun. 2012, pp. 57-60.
- [26] H. Wang, *et al.*, "Power Amplifiers Performance Survey 2000-Present," [Online]. Available: <a href="https://gems.ece.gatech.edu/PA\_survey.html">https://gems.ece.gatech.edu/PA\_survey.html</a>.
- [27] S. Cripps, "RF Power Amplifiers for Wireless Communications," Artech House, 2006.
- [28] H. Wang and K. Sengupta, "RF and mm-Wave Power Generation in Silicon," Elsevier, 2015.

- [29] H. Wang, C. Sideris and A. Hajimiri, "A CMOS Broadband Power Amplifier With a Transformer-Based High-Order Output Matching Network," *IEEE J. Solid-State Circuits (JSSC)*, vol. 45, no. 12, pp. 2709-2722, Dec. 2010.
- [30] H. Wang, C. Sideris and A. Hajimiri, "A 5.2-to-13GHz class-AB CMOS power amplifier with a 25.2dBm peak output power at 21.6% PAE," in *Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC)*, Feb. 2010, pp. 44-45.
- [31] D. Jung, H. Zhao and H. Wang, "A Highly Linear Doherty Power Amplifier with Multigated Transistors Supporting 80MSymbol/s 256-QAM," in *Proc. IEEE Int. Microw. Symp (IMS), Jun.* 2018, pp. 1222-1225.
- [32] K. Datta and H. Hashemi, "Performance limits, design and implementation of mm-wave SiGe HBT Class-E and stacked Class-E power amplifiers," *IEEE J. Solid-State Circuits (JSSC)*, vol. 49, no. 10, pp. 2150-2171, Oct. 2014.
- [33] K. Datta and H. Hashemi, "Waveform engineering in a mm-wave stacked-HBT switching power amplifier," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jan. 2017, pp. 216-219.
- [34] A. Chakrabarti and H. Krishnaswamy, "High-Power High-Efficiency Class-E-Like Stacked mmWave PAs in SOI and Bulk CMOS: Theory and Implementation," *IEEE Trans. on Microw. Theory and Techn. (T-MTT)*, vol. 62, no. 8, pp. 1686-1704, Aug. 2014.
- [35] A. Sarkar and B. A. Floyd, "A 28-GHz harmonic-tuned power amplifier in 130-nm SiGe BiCMOS," *IEEE Trans. Microw. Theory and Techn. (T-MTT)*, vol. 65, no. 2, pp. 522-535, Feb. 2017.
- [36] S. Ali, P. Agarwal, S. Mirabbasi and D. Heo, "A 42–46.4% PAE continuous class-F power amplifier with C<sub>gd</sub> neutralization at 26–34 GHz in 65 nm CMOS for 5G applications," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2017, pp. 212-215.
- [37] S. Mortazavi and K. Koh, "A Class F-1/F 24-to-31GHz power amplifier with 40.7% peak PAE, 15dBm OP<sub>1dB</sub>, and 50mW P<sub>sat</sub> in 0.13μm SiGe BiCMOS," in *Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC)*, Feb 2014, pp. 254-255.
- [38] S. Mortazavi and K. Koh, "A 38 GHz inverse Class-F power amplifier with 38.5% peak PAE, 16.5 dB gain, and 50mW P<sub>sat</sub> in 0.13-µm SiGe BiCMOS," *in Proc. IEEE*

- Radio Freq. Integr. Circuits Symp. (RFIC), Jun. 2015, pp. 211-214.
- [39] S. Mortazavi and K. Koh, "Integrated inverse Class-F silicon power amplifiers for high power efficiency at microwave and mm-wave," *IEEE J. Solid-State Circuits (JSSC)*, vol. 51, no. 10, pp. 2420-2434, Oct. 2016.
- [40] F. Raab, "Class-F power amplifiers with maximally flat waveforms," *IEEE Trans. Microw. Theory Techn. (T-MTT)*, vol. 45, no. 11, pp. 2007-2012, Nov. 1997.
- [41] V. Carrubba, *et al.*, "On the extension of the continuous Class-F mode power amplifier," *IEEE Trans Microw. Theory and Techn. (T-MTT)*, vol. 59, no. 5, pp. 1294-1303, May 2011.
- [42] N. Tuffy, L. Guan, A. Zhu, and T. J. Brazil, "A simplified broadband design methodology for linearized high-efficiency continuous class-F power amplifiers," *IEEE Trans. Microw. Theory Techn.* (T-MTT), vol. 60, no. 6, pp. 1952–1963, Jun. 2012.
- [43] V. Carrubba, *et al.*, "The continuous inverse Class-F mode with resistive second-harmonic impedance," *IEEE Trans. Microw. Theory and Techn. (T-MTT)*, vol. 60, no. 6, pp. 1928-1936, June 2012.
- [44] K. Chen and D. Peroulis, "Design of broadband highly efficient harmonic-tuned power amplifier using in-band continuous Class-F<sup>-1</sup>/F mode Transferring," *IEEE Trans. on Microw. Theory and Techn. (T-MTT)*, vol. 60, no. 12, pp. 4107-4116, Dec. 2012.
- [45] I. Ju and J. D. Cressler, "A Highly Efficient X-Band Inverse Class-F SiGe HBT Cascode Power Amplifier With Harmonic-Tuned Wilkinson Power Combiner," *IEEE Trans. on Circuits and Systems II (TCAS-II)*, vol. 65, no. 11, pp. 1609-1613, Nov. 2018.
- [46] I. Ju and J. D. Cressler, "An X-band inverse class-F SiGe HBT cascode power amplifier with harmonic-tuned output transformer," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2017, pp. 390-393.
- [47] I. Aoki, S. D. Kee, D. B. Rutledge and A. Hajimiri, "Distributed active transformer-a new power-combining and impedance-transformation technique," *IEEE Trans. on Microw. Theory and Techn. (T-MTT)*, vol. 50, no. 1, pp. 316-331, Jan. 2002.

- [48] I. Aoki et al., "A Fully Integrated Quad-Band GSM/GPRS CMOS Power Amplifier," in *Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC)*, Feb. 2010, pp. 570-636.
- [49] I. Aoki et al., "A Fully-Integrated Quad-Band GSM/GPRS CMOS Power Amplifier," *IEEE J. Solid-State Circuits (JSSC)*), vol. 43, no. 12, pp. 2747-2758, Dec. 2008.
- [50] R. Bhat, A. Chakrabarti and H. Krishnaswamy, "Large-Scale Power Combining and Mixed-Signal Linearizing Architectures for Watt-Class mmWave CMOS Power Amplifiers," *IEEE Trans. on Microw. Theory and Techn. (T-MTT)*, vol. 63, no. 2, pp. 703-718, Feb. 2015.
- [51] W. Tai, L. R. Carley and D. S. Ricketts, "A 0.7W fully integrated 42GHz power amplifier with 10% PAE in 0.13μm SiGe BiCMOS," in *Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC)*, Feb. 2010, pp. 142-143.
- [52] S. Helmi, J. Chen and S. Mohammadi, "High-Efficiency Microwave and mm-Wave Stacked Cell CMOS SOI Power Amplifiers," *IEEE Trans. on Microw. Theory and Techn. (T-MTT)*, vol. 64, no. 7, pp. 2025-2038, July 2016.
- [53] S. Shakib, J. Dunworth, V. Aparin and K. Entesari, "mmWave CMOS Power Amplifiers for 5G Cellular Communication," *IEEE Communications Magazine*, vol. 57, no. 1, pp. 98–105, Jan. 2019.
- [54] H. Park et al., "A High Efficiency 39GHz CMOS Cascode Power Amplifier for 5G Applications," *in Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC) Dig.*, Jun. 2019, pp. 179-182.
- [55] W. Huang and H. Wang, "An Inductive-Neutralized 26-dBm K-/Ka- Band Power Amplifier With 34% PAE in 90-nm CMOS," *IEEE Trans. Microw. Theory Techn.*, vol. 67, no. 11, pp. 4427–4440, Nov. 2019.
- [56] F. Wang and H. Wang, "24.1 A 24-to-30GHz Watt-Level Broadband Linear Doherty Power Amplifier with Multi-Primary Distributed-Active-Transformer Power-Combining Supporting 5G NR FR2 64-QAM with >19dBm Average Pout and >19% Average PAE," in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, Feb. 2020, pp. 362–364.
- [57] S. Li, M. Huang, D. Jung, T. Huang, and H. Wang, "A 28GHz current-mode inverse-outphasing transmitter achieving 40%/31% PA efficiency at Psat/6dB PBO and supporting 15Gbit/s 64QAM for 5G communication," in IEEE Int. Solid-State

- [58] S. Li, T. Chi, D. Jung, T. Huang, M. Huang, and H. Wang, "An E-band high-linearity antenna-LNA frontend with 4.8dB NF and 2.2dBm IIP3 exploiting multi-feed on-antenna noise-canceling and gm-boosting," in *IEEE Int. Solid-State Circuits Conf.* (ISSCC) Dig. Tech. Papers, Feb. 2020.
- [59] S. Li, T. Chi, T. Huang, H. T. Nguyen, and H. Wang, "A 28GHz Packaged Chireix Transmitter with Direct On-Antenna Outphasing Load Modulation Achieving 56%/38% PA Efficiency at Peak/6dB Back-Off Output Power," in *Proc. IEEE Radio Frequency Integrated Circuits (RFIC)*, May 2018.
- [60] T. Li, M. Huang and H. Wang, "A continuous-mode harmonically tuned 19-to-29.5GHz ultra-linear PA supporting 18Gb/s at 18.4% modulation PAE and 43.5% peak PAE," *in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2018, pp. 410–412.
- [61] S. Li, T. Chi, J. Park, H. T. Nguyen and H. Wang, "A 28-GHz flip-chip packaged Chireix transmitter with on-antenna outphasing active load modulation," *IEEE J. Solid-State Circuits*, vol. 54, no. 5, pp. 1243-1253, May 2019.
- [62] T. Li, M. Huang and H. Wang, "Millimeter-Wave Continuous-Mode Power Amplifier for 5G MIMO Applications," *IEEE Trans. Microw. Theory Techn.*, vol. 67, no. 7, pp. 3088–3098, Jul. 2019.
- [63] D. Zhao and P. Reynaert, "A 60-GHz dual-mode class AB power amplifier in 40-nm CMOS," *IEEE J. Solid-State Circuits*, vol. 48, no. 10, pp. 2323–2337, Oct. 2013.
- [64] H. T. Nguyen, D. Jung and H. Wang, "A 60GHz CMOS Power Amplifier with Cascaded Asymmetric Distributed-Active-Transformer Achieving Watt-Level Peak Output Power with 20.8% PAE and Supporting 2Gsym/s 64-QAM Modulation," in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, Feb. 2019, pp. 90-92.
- [65] H. T. Nguyen, T. Chi, S. Li and H. Wang, "A 62-to-68GHz linear 6Gb/s 64QAM CMOS Doherty radiator with 27.5%/20.1% PAE at peak/6dB-back-off output power leveraging high-efficiency multi-feed antenna-based active load modulation," *in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2018, pp. 402-404.

- [66] C. Chou, Y. Hsiao, Y. Wu, Y. Lin, C. Wu and H. Wang, "Design of a V-Band 20-dBm Wideband Power Amplifier Using Transformer-Based Radial Power Combining in 90-nm CMOS," *IEEE Trans. Microw. Theory Techn.*, vol. 64, no. 12, pp. 4545-4560, Dec. 2016.
- [67] Y. Jen, J. Tsai, T. Huang and H. Wang, "Design and Analysis of a 55–71-GHz Compact and Broadband Distributed Active Transformer Power Amplifier in 90-nm CMOS Process," *IEEE Trans. Microw. Theory Techn.*, vol. 57, no. 7, pp. 1637-1646, July 2009.
- [68] T. Chi, F. Wang, S. Li, M. Huang, J. S. Park, and H. Wang, "A 60 GHz on-chip linear radiator with single-element 27.9 dBm Psat and 33.1 dBm peak EIRP using multifeed antenna for direct on-antenna power combining," *in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2017, pp. 296–297.
- [69] S. V. Thyagarajan, A. M. Niknejad and C. D. Hull, "A 60 GHz Drain-Source Neutralized Wideband Linear Power Amplifier in 28 nm CMOS," *IEEE Trans. Circuits Syst.*, *I: Regular Papers*, vol. 61, no. 8, pp. 2253–2262, Aug. 2014.
- [70] Y. Gong *et al.*, "A 28-GHz switchless, SiGe bidirectional amplifier using neutralized common-emitter differential pair," in *IEEE Microwave and Wireless Components Letters*, vol. 28, no. 8, pp. 717-719, Aug. 2018.
- [71] Y. Gong *et al.*, "A broadband logarithmic power detector using 130 nm SiGe BiCMOS technology," in *Proc. 2019 IEEE BiCMOS Compound Semiconductor. Integrated Circuits and Technology Symposium*, Nashville, TN, USA, 2019, pp. 1-4.
- [72] Y. Gong et al., "A compact, high-power, 60 GHz SPDT switch using shunt-series SiGe PIN diodes," in *Proc. 2019 IEEE Radio Frequency Integrated Circuits Symposium*, Boston, MA, USA, 2019, pp. 15-18.
- [73] C. R. Chappidi and K. Sengupta, "20.2 A frequency-reconfigurable mm-Wave power amplifier with active-impedance synthesis in an asymmetrical non-isolated combiner," *in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2016, pp. 344–345.
- [74] J. Chen and A. M. Niknejad, "A compact 1 V 18.6 dBm 60 GHz power amplifier in 65 nm CMOS," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2011, pp. 432–433.

- [75] M. Varonen, M. Karkkainen, M. Kantanen, and K. A. I. Halonen, "Millimeter-wave integrated circuits in 65-nm CMOS," *IEEE J. Solid-State Circuits*, vol. 43, no. 9, pp. 1991–2002, Sep. 2008.
- [76] V. Vidojkovic et al., "A low-power radio chipset in 40 nm LP CMOS with beamforming for 60 GHz high-data-rate wireless communication," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2013, pp. 236–237.
- [77] M. Bassi, J. Zhao, A. Bevilacqua, A. Ghilioni, A. Mazzanti, F. Svelto, "A 40–67 GHz power amplifier with 13 dBm PSAT and 16% PAE in 28 nm CMOS LP," *IEEE J. Solid-State Circuits*, vol. 50, no. 7, pp. 1618–1628, Jul. 2015.
- [78] H. T. Nguyen and H. Wang, "A Coupler-Based Differential Doherty Power Amplifier with Built-In Baluns for High Mm-Wave Linear-Yet-Efficient Gbit/s Amplifications," in Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC) Dig., Jun. 2019, pp. 195– 198.
- [79] W. Sun and C. Kuo, "A 19.1% PAE, 22.4-dBm 53-GHz Parallel Power Combining Power Amplifier with Stacked-FET Techniques in 90-nm CMOS," *in Proc IEEE MTT-S Int. Microw. Symp. (IMS) Dig.*, Jun. 2019, pp. 327–330.
- [80] Y. Chang, Y. Wang, C. Chen, Y. Wu and H. Wang, "A V-Band Power Amplifier With 23.7-dBm Output Power, 22.1% PAE, and 29.7-dB Gain in 65-nm CMOS Technology," *IEEE Trans. Microw. Theory Techn.*, vol. 67, no. 11, pp. 4418–4426, Nov. 2019.
- [81] Chu et al., "A Dual-Mode V-Band 2/4-Way Non-Uniform Power-Combining PA with +17.9-dBm Psat and 26.5-% PAE in 16-nm FinFET CMOS," in Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC) Dig., Jun. 2020, pp. 1–4.
- [82] Y. Gong and J. Cressler, "A Balanced Power Amplifier with Asymmetric Coupled-Line Couplers and Wilkinson Baluns in a 90nm aSiGe BiCMOS Technology," in Proc IEEE MTT-S Int. Microw. Symp. (IMS) Dig., Jun. 2010, pp. 1–4.
- [83] T. Li and H. Wang, "A Millimeter-Wave Fully Integrated Passive Reflection-Type Phase Shifter With Transformer-Based Multi-Resonance Loads for 360° Phase Shifting," *IEEE Trans. Circuits Syst., I: Regular Papers*,, vol. 65, no. 4, pp. 1406–1419, Apr. 2018.
- [84] K.-J. Koh and G. M. Rebeiz, "An X- and Ku-band 8-element phased-array receiver

- in 0.18-µm SiGe BiCMOS technology ", *IEEE J. Solid-State Circuits*, vol. 43, no. 6, pp. 1360–1371, Jun. 2008.
- [85] S. Li, T. Chi, J. Park, H. T. Nguyen and H. Wang, "A 28-GHz flip-chip packaged Chireix transmitter with on-antenna outphasing active load modulation," *IEEE J. Solid-State Circuits*, vol. 54, no. 5, pp. 1243–1253, May 2019.
- [86] T. Li, J. S. Park and H. Wang, "A 2-24-GHz 360° Full-Span Differential Vector Modulator Phase Rotator With Transformer-Based Poly-Phase Quadrature Network," *IEEE Trans. on Very Large Scale Integration (VLSI) Systems*.
- [87] C. Wang and L. E. Larson, "A capacitance-compensation technique for improved linearity in CMOS class-AB power amplifiers", *IEEE J. Solid-State Circuits*, vol. 39, no. 11, pp. 1927–1937, Nov. 2004.
- [88] T. Joo, B. Koo and S. Hong, "A WLAN RF CMOS PA with large-signal MGTR method", *IEEE Trans. Microw. Theory Techn.*, vol. 61, no. 3, pp. 1272-1279, Mar. 2013.
- [89] D. Jung, H. Zhao and H. Wang, "A CMOS Highly Linear Doherty Power Amplifier With Multigated Transistors," *IEEE Trans. Microw. Theory Techn.*, vol. 67, no. 5, pp. 1883–1891, May 2019.
- [90] B. Park et al., "Highly Linear mm-Wave CMOS Power Amplifier," *IEEE Trans. Microw. Theory Techn.*, vol. 64, no. 12, pp. 4535–4544, Dec. 2016.
- [91] M. Abdulaziz, H. V. Hünerli, K. Buisman and C. Fager, "Improvement of AM-PM in a 33-GHz CMOS SOI Power Amplifier Using pMOS Neutralization," *IEEE Microw. Wireless Compon. Lett.*, vol. 29, no. 12, pp. 798-801, Dec. 2019.
- [92] S. Kulkarni and P. Reynaert, "A 60-GHz Power Amplifier With AM-PM Distortion Cancellation in 40-nm CMOS," *IEEE Trans. Microw. Theory Techn.*, vol. 64, no. 7, pp. 2284-2291, Jul. 2016.
- [93] K. Y. Son, C. Park and S. Hong, "A 1.8-GHz CMOS Power Amplifier Using Stacked nMOS and pMOS Structures for High-Voltage Operation," *IEEE Trans. Microw. Theory Techn.*, vol. 57, no. 11, pp. 2652-2660, Nov. 2009.
- [94] J. Xia, X. Fang, A. B. Ayed and S. Boumaiza, "Millimeter Wave SOI-CMOS Power

- Amplifier With Enhanced AM-PM Characteristic," *IEEE Access*, vol. 8, pp. 8861-8875, 2020.
- [95] S. Golara, S. Moloudi and A. A. Abidi, "Processes of AM-PM Distortion in Large-Signal Single-FET Amplifiers," *IEEE Trans. Circuits Syst., I: Regular Papers*, vol. 64, no. 2, pp. 245-260, Feb. 2017.
- [96] S. Cripps, "RF Power Amplifiers for Wireless Communications," 2<sup>nd</sup> edition, Artech House, 2006.
- [97] H. Hashemi and S. Raman, Eds., "Mm-Wave Silicon Power Amplifiers and Transmitters. Cambridge," Cambridge University Press, 2016.
- [98] B. Razavi, "Design of Analog CMOS Integrated Circuits," 2<sup>nd</sup> edition, McGraw-Hill, 2016.
- [99] I. Aoki, S. D. Kee, D. B. Rutledge, and A. Hajimiri, "Distributed active transformer— A new power-combining and impedance-transformation technique," *IEEE Trans. Microw. Theory Techn.*, vol. 50, no. 1, pp. 316–331, Jan. 2002.
- [100] T. Chi, F. Wang, S. Li, M. Huang, J. S. Park and H. Wang, "17.3 A 60GHz on-chip linear radiator with single-element 27.9dBm Psat and 33.1dBm peak EIRP using multifeed antenna for direct on-antenna power combining," *in Proc. IEEE Int. Solid-State Circ. Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2017, pp. 296-297.
- [101] D. Chowdhury, P. Reynaert and A. M. Niknejad, "A 60GHz 1V + 12.3dBm Transformer-Coupled Wideband PA in 90nm CMOS," *in Proc. IEEE Int. Solid-State Circ. Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2008, pp. 560-635.
- [102] K. Raczkowski, S. Thijs, W. De Raedt, B. Nauwelaers and P. Wambacq, "50-to-67GHz ESD-protected power amplifiers in digital 45nm LP CMOS," *in Proc. IEEE Int. Solid-State Circ. Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2009, pp. 382-383.
- [103] T. LaRocca, J. Y. Liu and M. F. Chang, "60 GHz CMOS Amplifiers Using Transformer-Coupling and Artificial Dielectric Differential Transmission Lines for Compact Design," *IEEE J. Solid-State Circ. (JSSC)*, vol. 44, no. 5, pp. 1425-1435, May 2009.
- [104]O. El-Aassar and G. M. Rebeiz, "A DC-to-108-GHz CMOS SOI Distributed Power

- Amplifier and Modulator Driver Leveraging Multi-Drive Complementary Stacked Cells," *IEEE J. Solid-State Circ. (JSSC)*, vol. 54, no. 12, pp. 3437-3451, Dec. 2019.
- [105] K. Fang and J. F. Buckwalter, "Efficient Linear Millimeter-Wave Distributed Transceivers in CMOS SOI," *IEEE Trans. Microw. Theory Techn. (T-MTT)*, vol. 67, no. 1, pp. 295-307, Jan. 2019.
- [106] H. T. Nguyen, S. Li and H. Wang, "4.6 A mm-Wave 3-Way Linear Doherty Radiator with Multi Antenna Coupling and On-Antenna Current-Scaling Series Combiner for Deep Power Back-Off Efficiency Enhancement," *in Proc. IEEE Int. Solid-State Circ. Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2019, pp. 84-86.
- [107] J. Y. Liu, R. Berenguer and M. F. Chang, "Millimeter-Wave Self-Healing Power Amplifier With Adaptive Amplitude and Phase Linearization in 65-nm CMOS," *IEEE Trans. Microw. Theory Techn. (T-MTT)*, vol. 60, no. 5, pp. 1342-1352, May 2012.
- [108] D. Zhao and P. Reynaert, "A 60-GHz Dual-Mode Class AB Power Amplifier in 40-nm CMOS," *IEEE J. Solid-State Circ. (JSSC)*, vol. 48, no. 10, pp. 2323-2337, Oct. 2013.
- [109] S. Kulkarni and P. Reynaert, "A 60-GHz Power Amplifier With AM-PM Distortion Cancellation in 40-nm CMOS," *IEEE Trans. Microw. Theory Techn. (T-MTT)*, vol. 64, no. 7, pp. 2284-2291, Jul. 2016.
- [110] C. Chou, Y. Hsiao, Y. Wu, Y. Lin, C. Wu and H. Wang, "Design of a V-Band 20-dBm Wideband Power Amplifier Using Transformer-Based Radial Power Combining in 90-nm CMOS," *IEEE Trans. Microw. Theory Techn. (T-MTT)*, vol. 64, no. 12, pp. 4545-4560, Dec. 2016.
- [111] S. V. Thyagarajan, A. M. Niknejad and C. D. Hull, "A 60 GHz Drain-Source Neutralized Wideband Linear Power Amplifier in 28 nm CMOS," *IEEE Trans. Circ. and System I (TCAS-1)*, vol. 61, no. 8, pp. 2253-2262, Aug. 2014.
- [112] Y. Sun, G. G. Fischer and J. C. Scheytt, "A Compact Linear 60-GHz PA With 29.2% PAE Operating at Weak Avalanche Area in SiGe," *IEEE Trans. Microw. Theory Techn.* (*T-MTT*), vol. 60, no. 8, pp. 2581-2589, Aug. 2012.
- [113] C. R. Chappidi and K. Sengupta, "Frequency Reconfigurable Mm-Wave Power Amplifier With Active Impedance Synthesis in an Asymmetrical Non-Isolated

- Combiner: Analysis and Design," *IEEE J. Solid-State Circ. (JSSC)*, vol. 52, no. 8, pp. 1990-2008, Aug. 2017.
- [114] Y. Zhao and J. R. Long, "A Wideband, Dual-Path, Millimeter-Wave Power Amplifier With 20 dBm Output Power and PAE Above 15% in 130 nm SiGe-BiCMOS," *IEEE J. Solid-State Circ. (JSSC)*, vol. 47, no. 9, pp. 1981-1997, Sept. 2012.
- [115] Y. Chang, Y. Wang, C. Chen, Y. Wu and H. Wang, "A V-Band Power Amplifier With 23.7-dBm Output Power, 22.1% PAE, and 29.7-dB Gain in 65-nm CMOS Technology," *IEEE Trans. Microw. Theory Techn. (T-MTT)*, vol. 67, no. 11, pp. 4418-4426, Nov. 2019.
- [116] D. Zhao, S. Kulkarni and P. Reynaert, "A 60-GHz Outphasing Transmitter in 40-nm CMOS," *IEEE J. Solid-State Circ. (JSSC)*, vol. 47, no. 12, pp. 3172-3183, Dec. 2012.
- [117] K. Khalaf et al., "Digitally Modulated CMOS Polar Transmitters for Highly-Efficient Mm-Wave Wireless Communication," *IEEE J. Solid-State Circ. (JSSC)*, vol. 51, no. 7, pp. 1579-1592, Jul. 2016.
- [118] J. Chen et al., "A digitally modulated mm-Wave cartesian beamforming transmitter with quadrature spatial combining," *in Proc. IEEE Int. Solid-State Circ. Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2013, pp. 232-233.
- [119] D. Zhao and P. Reynaert, "A 40-nm CMOS E-Band 4-Way Power Amplifier With Neutralized Bootstrapped Cascode Amplifier and Optimum Passive Circuits," *IEEE Trans. Microw. Theory Techn. (T-MTT)*, vol. 63, no. 12, pp. 4083-4089, Dec. 2015.
- [120] J. Chen and A. M. Niknejad, "A compact 1V 18.6dBm 60GHz power amplifier in 65nm CMOS," in Proc. IEEE Int. Solid-State Circ. Conf. (ISSCC) Dig. Tech. Papers, Feb. 2011, pp. 432-433.
- [121] V. Vidojkovic et al., "A low-power radio chipset in 40nm LP CMOS with beamforming for 60GHz high-data-rate wireless communication," *in Proc. IEEE Int. Solid-State Circ. Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2013, pp. 236-237.
- [122] O. T. Ogunnika and A. Valdes-Garcia, "A 60GHz Class-E tuned power amplifier with PAE >25% in 32nm SOI CMOS," *in Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2012, pp. 65-68.

- [123] S. Helmi, J. Chen and S. Mohammadi, "High-Efficiency Microwave and mm-Wave Stacked Cell CMOS SOI Power Amplifiers," *IEEE Trans. Microw. Theory Techn. (T-MTT)*, vol. 64, no. 7, pp. 2025-2038, Jul. 2016.
- [124] M. Bassi, J. Zhao, A. Bevilacqua, A. Ghilioni, A. Mazzanti and F. Svelto, "A 40–67 GHz Power Amplifier With 13 dBm Psat and 16% PAE in 28 nm CMOS LP," *IEEE J. Solid-State Circ. (JSSC)*, vol. 50, no. 7, pp. 1618-1628, Jul. 2015.
- [125] W. Sun and C. Kuo, "A 19.1% PAE, 22.4-dBm 53-GHz Parallel Power Combining Power Amplifier with Stacked-FET Techniques in 90-nm CMOS," *in Proc. IEEE Intl. Microw. Symp. (IMS)*, Jun. 2019, pp. 327-330.
- [126] A. Siligaris et al., "A 60 GHz Power Amplifier With 14.5 dBm Saturation Power and 25% Peak PAE in CMOS 65 nm SOI," *IEEE J. Solid-State Circ. (JSSC)*, vol. 45, no. 7, pp. 1286-1294, Jul. 2010.
- [127] S. N. Ali, P. Agarwal, S. Gopal and D. Heo, "Transformer-Based Predistortion Linearizer for High Linearity and High Modulation Efficiency in mm-Wave 5G CMOS Power Amplifiers," *IEEE Trans. Microw. Theory Techn. (T-MTT)*, vol. 67, no. 7, pp. 3074-3087, Jul. 2019.
- [128] S. Jeon, et al., "A Scalable 6-to-18GHz Concurrent Dual-Band Quad-Beam Phased-Array Receiver in CMOS", in *IEEE ISSCC Dig. Tech. Papers*, Feb. 2008, pp. 186–605.
- [129] J. Park, *et al.*, "A K-band 5-bit digital linear phase rotator with folded transformer based ultra-compact quadrature generation", in *Proc. IEEE RF Integr. Circ. Symp.*, Jun. 2014, pp. 75–78.
- [130] K. J. Koh, *et al.*, "An X- and Ku-Band 8-Element Phased-Array Receiver in 0.18-µm SiGe BiCMOS Technology," *IEEE J. Solid-State Circ.*, vol. 43, no. 6, pp. 1360–1371, Jun. 2008.
- [131] T. Li, J. S. Park and H. Wang, "A 2-24-GHz 360° Full-Span Differential Vector Modulator Phase Rotator With Transformer-Based Poly-Phase Quadrature Network," *IEEE Trans. on Very Large Scale Integration (VLSI) Systems, Sep. 2020.*
- [132] K. Koh, *et al.*, "A Millimeter-Wave (40–45 GHz) 16-Element Phased-Array Transmitter in 0.18-μm SiGe BiCMOS Technology," *IEEE J. Solid-State Circ.*, vol. 44, no. 5, pp. 1498–1509, May 2009.

- [133] M. Huang, *et al.*, "An All-Passive Negative Feedback Network for Broadband and Wide Field-of-View Self-Steering Beam-Forming with Zero DC Power Consumption," *IEEE J. Solid-State Circ.*, vol. 52, no. 5, pp. 1260–1273, May 2017.
- [134] A. Natarajan, *et al.*, "A Fully-Integrated 16-Element Phased-Array Receiver in SiGe BiCMOS for 60-GHz Communications," *IEEE J. Solid-State Circ.*, vol. 46, no. 5, pp. 1059–1075, May 2011.
- [135] M. Tabesh, et al., "A 65 nm CMOS 4-Element Sub-34 mW/Element 60 GHz Phased-Array Transceiver," *IEEE J. Solid-State Circ.*, vol. 46, no. 12, pp. 3018–3032, Dec. 2011.
- [136] Y. Gong *et al.*, "A bi-directional, X-band 6-Bit phase shifter for phased array antennas using an active DPDT switch," in *Proc. 2017 IEEE Radio Frequency. Integrated. Circuits Symposium*, Honolulu, HI, 2017, pp. 288-291.
- [137] M. Huang, *et al.*, "A 5GHz All-Passive Negative Feedback Network for RF Front-End Self-Steering Beam-Forming with Zero DC Power Consumption," in *Proc. IEEE RF Integr. Circ. Symp.*, May 2016, pp. 91–94.
- [138] T. Chi, et al., "A 60GHz On-Chip Linear Radiator with Single-Element 27.9dBm Psat and 33.1dBm Peak EIRP Using Multi-Feed Antenna for Direct On-Antenna Power Combining," in *Proc. IEEE Int. Solid-State Circ. Conf.*, Feb. 2017, pp. 296–297.
- [139] T. Chi, et al., "A Packaged 90-to-300GHz Transmitter and 115-to-325GHz Coherent Receiver in CMOS for Full-Band Continuous-Wave Mm-Wave Hyperspectral Imaging," in *Proc. IEEE Int. Solid-State Circ. Conf.*, Feb. 2017, pp. 304–305.
- [140] T. Chi, et al., "A Multi-Phase Sub-Harmonic Injection Locking Technique for Bandwidth Extension in Silicon-Based THz Signal Generation," *IEEE J. Solid-State Circ.*, vol. 50, no. 5, pp. 1861–1873, Aug. 2015.
- [141] H. Wang, et al., "A Wideband CMOS Linear Digital Phase Rotator", in *Proc. IEEE Int. Custom Integr. Circ. Conf.*, Sep. 2007, pp. 671–674.
- [142] K. Koh, et al., "A 0.13-μm CMOS Phase Shifters for X-, Ku-, and K-band Phased Arrays", IEEE J. Solid- State Circ., vol. 42, No. 11, pp. 2535–2546, Nov. 2007.
- [143] H. Wang, et al., "A Broadband Self-Healing Phase Synthesis Scheme," in Proc. IEEE

- RF Integr. Circ. Symp., Jun. 2011, pp. 1–4.
- [144] T. Li., et al., "A 2-24GHz 360-Degree Full-Span Differential Vector Modulator Phase Rotator with Transformer-Based Poly-Phase Quadrature Network," in *Proc. IEEE Int. Custom Integr. Circ. Conf.*, Sep. 2015, pp. 1–4.
- [145]H. Hayashi, et al., "An MMIC Active Phase Shifter Using a Variable Resonant Circuit," *IEEE Trans. Microw. Theory Techn.*, No.10, pp. 2021–2026, Oct. 1999.
- [146] D. Pozar, *Microwave engineering*, 4<sup>th</sup> edition, Wiley, 2011.
- [147] J. Lee, *et al.*, "60 GHz switched-line-type phase shifter using body-floating switches in 0.13 μm CMOS technology", *Electron. Lett.*, vol. 48, no. 7, pp. 376–378, Apr. 2012.
- [148] T. Hancock, *et al.*, "A 12-GHz SiGe phase shifter with integrated LNA", *IEEE Trans. Microw. Theory Techn.*, vol. 53, no. 3, pp. 977–983, Mar. 2005.
- [149] T. Chu, *et al.*, "A 4-Channel UWB Beam-Former in 0.13μm CMOS using a Path-Sharing True-Time-Delay Architecture," in *Proc. IEEE Int. Solid-State Circ. Conf.*, Feb. 2007, pp. 426–428.
- [150] A. Lane, et al., "A Miniature 4-bit Octave Bandwidth Switched Filter Phase Shifter GaAs MMIC", in *Proc. Eur. Microw. Conf.*, Sep. 1978, pp. 437–442.
- [151] L. Wang, et al. "Highly Linear Ku-Band SiGe PIN Diode Phase Shifter in Standard SiGe BiCMOS Process," *IEEE Microw. Wireless Compon. Lett.*, vol. 20, no. 1, pp. 37–39, Jan. 2010.
- [152]B. Henoch, et al., "A 360° reflection-type diode phase modulator," *IEEE Trans. Microw. Theory Techn.*, vol. 19, pp. 103–105, Jan. 1971.
- [153]O. Gurbuz, et al., "A 1.6–2.3-GHz RF MEMS Reconfigurable Quadrature Coupler and Its Application to a 360° Reflective-Type Phase Shifter", *IEEE Trans. Microw. Theory Techn.*, vol. 63, no.2, pp.414–421, Dec. 2014.
- [154]B. Biglarbegian, *et al.*, "MEMS-based reflective-type phase-shifter for emerging millimeter-wave communication systems," in *Proc. <u>Eur. Microw. Conf.</u>*, Oct. 2010, pp. 1556–1559.

- [155] M. El-Tanani, et al., "C-band low-loss phase shifter >>360° for WLAN applications," in *Proc. Eur. Microw. Conf.*, Oct. 2007, pp. 1503–1506.
- [156] H. Zarei, et al., "Reflective-Type Phase Shifters for Multiple-Antenna Transceivers", *IEEE Trans. Circ. Syst. I, Reg. Papers.*, vol. 54, no. 8, pp. 1647–1656, Aug. 2007.
- [157]F. Ellinger, et al., "Compact reflective-type phase-shifter MMIC for C-band using a lumped-element coupler", *IEEE Trans. Microw. Theory Techn*, vol. 49, no. 5, pp. 913–917, May 2001.
- [158]F. Ellinger, *et al.*, "Ultracompact reflective-type phase shifter MMIC at C-band with 360° phase-control range for smart antenna combining", *IEEE J. Solid- State Circ.*, vol. 37, no. 4, pp. 481–486, Aug. 2002.
- [159] J. Wu, et al., "2.45-GHz CMOS reflection-type phase-shifter MMICs with minimal loss variation over quadrants of phase-shift range", *IEEE Trans. Microw. Theory Techn.*, vol. 56, no. 10, pp. 2180–2189, Oct. 2008.
- [160] M. Tabesh, *et al.*, "60GHz low-loss compact phase shifters using a transformer-based hybrid in 65nm CMOS", in *Proc. IEEE Int. Custom Integr. Circ. Conf.*, Sep. 2011, pp. 1–4.
- [161] M. Tsai *et al.*, "60GHz Passive and Active RF-path Phase Shifters in Silicon", in *Proc. IEEE RF Integr. Circ. Symp.*, Jun 2009, pp. 223–226.
- [162] B. Biglarbegian, et al., "Millimeter-Wave Reflective-Type Phase Shifter in CMOS Technology", *IEEE Microw. Wireless Compon. Lett.*, vol. 19, no. 9, pp. 560–562, Aug. 2009.
- [163] C. Lin, et al., "A Full-360 Reflection-Type Phase Shifter with Constant Insertion Loss", *IEEE Microw. Wireless Compon. Lett.*, vol. 18, no. 2, pp. 106–108, Feb. 2008.
- [164] H. <u>Krishnaswamy</u>, *et al.*, "A Silicon-based, All-Passive, 60GHz, 4-Element, Phased-Array Beam former Featuring a Differential, Reflection-Type Phase Shifter", in *Proc. IEEE Int. Phased Array Syst. Techn.*. *Symp.*, Oct. 2010, pp. 225–232.
- [165] F. Meng, *et al.*, "Miniaturized 3-bit Phase Shifter for 60 GHz Phased-Array in 65 nm CMOS Technology," *IEEE Microw. Wireless Compon. Lett.*, vol. 24, no. 1, pp. 50–52, Jan. 2014.

- [166] J. Lyu, et al., "K-Band CMOS Phase Shifter with Low Insertion-loss Variation", in *Proc. IEEE Asia-Pacific Microw. Conf.*, Dec. 2012, pp. 88–90.
- [167] K. Han, et al., "The design of a 60 GHz low loss hybrid phase shifter with 360 degree phase shift", in *Proc. Int. Symp. On Communications and Information Technologies*, Sep. 2014, pp. 551–554.
- [168] C. Wu, et al., "Design of a K-band low insertion loss variation phase shifter using 0.18-µm CMOS process", in Proc. IEEE Asia-Pacific Microw, Conf., Dec. 2010, pp. 1735–1738.
- [169] W. Li, et al., "A 3.5-4.5 GHz ultra-compact 0.25mm² reflection-type 360° phase shifter", in *Proc. IEEE RF Integr. Circ. Symp.*, Jun. 2011, pp. 1–4.
- [170] Z. Iskandar, *et al.*, "A 30–50 GHz reflection-type phase shifter based on slow-wave coupled lines in BiCMOS 55 nm technology", in *Proc. Eur. Microw. Conf. (EuMC)*, Oct. 2016, pp. 1413–1416.
- [171] R. Yishay, et al., "A 57–66 GHz reflection-type phase shifter with near-constant insertion loss", in *Proc. IEEE Int. Microw. Symp.*, Jun. 2016, pp. 1–4.
- [172] T. Li, *et al.*, "A millimeter-wave fully differential transformer-based passive reflective-type phase shifter", in *Proc. IEEE Int. Custom Integr. Circ. Conf.*, Sep. 2015, pp. 1–4.
- [173] T. Li and H. Wang, "A Millimeter-Wave Fully Integrated Passive Reflection-Type Phase Shifter With Transformer-Based Multi-Resonance Loads for 360° Phase Shifting," *IEEE Trans. on Circuits and Systems I (TCAS-I)*, vol. 65, no. 4, pp. 1406-1419, Apr. 2018.
- [174] W. Li, et al., "A 57-to-64 GHz ultra-compact 0.027 mm<sup>2</sup> reflection type phase shifter with low insertion loss", in *Proc. Eur. Microw. Conf.*, Oct. 2014, pp. 1734–1737.
- [175] J. Long, "Monolithic transformers for silicon RF IC design", *IEEE J. Solid-State Circ.*, vol. 35, no. 9, pp. 1368–1382, Aug. 2000.
- [176] H. Wang, *et al.*, "A CMOS Broadband Power Amplifier with a Transformer-Based High-Order Output Matching Network," *IEEE J. Solid-State Circ.*, vol. 45, no. 12, pp. 2709–2722, Dec. 2010.

- [177] R. Vogel "Analysis and design of lumped- and lumped-distributed-element directional couplers for MIC and MMIC applications", *IEEE Trans. Microw. Theory Techn.*, vol. 40, no. 2, pp. 253–262, Feb. 1992.
- [178] R. Frye, et al., "A 2-GHz Quadrature Hybrid Implemented in CMOS Technology", *IEEE J. Solid- State Circ.*, vol. 38, no. 3, pp. 550–555, Mar. 2003.
- [179] J. Park, et al., "A Transformer-Based Poly-Phase Network for Ultra-Broadband Quadrature Signal Generation," *IEEE Trans. Microw. Theory. Tech.*, vol. 63, no. 12, pp. 4444–4457, Dec. 2015.
- [180] M. Sazegar, *et al.*, "Low-Cost Phased-Array Antenna Using Compact Tunable Phase Shifters Based on Ferroelectric Ceramics", *IEEE Trans. Microw. Theory Techn.*, vol. 59, no. 5, pp. 1265–1273, May 2011.