Abstract-This paper presents a 64-84-GHz phase-locked loop (PLL) realized in a low-cost 80-GHz HBT technology. The circuit consists of a wide tuning-range voltage-controlled oscillator, a push-push frequency doubler, a divide-by-32 frequency divider, a phase detector and an active loop filter. The measured phase noise at 1-MHz offset is 106 dBc/Hz. The output power is 2.5 dBm at 64 GHz, and it slowly decreases to 8.1 dBm at 84 GHz, with a maximum dc power consumption of 517 mW. To the authors' knowledge, the circuit achieves the widest frequency tuning range and its in-band phase noise is the lowest among the fully integrated -band PLLs reported to date.
I. INTRODUCTION
H IGHLY stable frequency sources are a key requirement of many millimeter-wave (mm-wave) communication and sensing systems. Stability is typically achieved using a phase-locked loop (PLL) approach, tying the mm-wave output to a lower reference frequency. With emerging applications in communications at 60 GHz and radar at 77 GHz, -band signal sources are in high demand. Owing to their potential for highly complex multifunctional integrated circuit (IC) implementations, Si/SiGe heterojunction bipolar transistor (HBT), Si/SiGe BiCMOS and ultra-scaled CMOS have made silicon the predominant platform for integrated synthesizer developments. Aside from phase noise and output power, tuning range is another important parameter, either because the application requires it (e.g., high-resolution frequency-modulated continuous wave (FMCW) radar) or because it provides the flexibility to use the synthesizer for a number of different applications. In sensor applications, the tuning speed is often another critical challenge.
A review of published wideband integrated synthesizers with output frequencies in the 60-100-GHz frequency range shows a Manuscript received March 10, 2012 ; revised August 01, 2012; accepted August 06, 2012. Date of publication September 04, 2012; date of current version December 13, 2012 . This work was supported by the German Ministry of Education and Research (BMBF) under the framework of Project EASY-A (Contract 01BU0815). This paper is an expanded version of the paper presented at the 12th IEEE Topical Meeting on Silicon Monolithic Integrated Circuits in RF Systems, January 15, 2012, Santa Clara, CA.
The authors are with the Institute of Electron Devices and Circuits and the Competence Center on Integrated Circuits in Communications, Ulm University, 89081 Ulm, Germany (e-mail: gang.liu@uni-ulm.de; andreas.trasser@uni-ulm.de; hermann.schumacher@uni-ulm.de).
Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org.
Digital Object Identifier 10.1109/TMTT.2012.2213833 significant variety of approaches. The voltage-controlled oscillator (VCO) may oscillate directly at the frequency of operation, or a lower frequency VCO may be used with subsequent frequency multiplication [1] (and also this paper). Alternatively, a low-frequency PLL-stabilized VCO may be up-converted using an injection-locked oscillator at the higher frequency of operation [8] . To achieve a very wide tuning range, CMOS and BiCMOS synthesizers use switched MOS varactors, usually via a 3-bit digital control word [2] , [3] , or even multiple VCOs on one chip [5] - [8] .
The choice of loop bandwidth and divider ratio between the frequency of operation and the reference frequency affects phase noise, but also locking range (for analog PLLs) and locking speed. Most of the surveyed PLLs use small loop bandwidths, 4 MHz or below, down to 100 kHz [1] - [3] , [5] - [8] , and only [4] uses a large loop bandwidth (45 MHz) comparable to the design presented here. Most of the designs [1] - [3] , [5] , [7] , [8] also use rather high divider ratios, between 768-8184. Only [4] and [6] use low divider ratios, 64 and 16, respectively, comparable to the work presented here. Low divider ratios lead to low phase noise, provided that a stable reference source is available. The dividers use injection-locked [5] dynamic and static division principles. In case of the injection-locked divider, it imposes a limitation on the tuning range of the synthesizer.
All reviewed designs, finally, use a charge pump in the loop. This is a common approach, but requires fast pnp or pMOS devices.
A tabular overview of the reviewed designs, along with results of the synthesizer presented here, is shown in Table I .
The design reported here had the following objectives.
• Frequency of operation very close to the technology . This is, on the one hand, motivated by the need of very lowcost front-end ICs for consumer applications at 60 GHz, and on the other hand, provides valuable insight into design techniques capable of addressing sources for systems operating much above 100 GHz.
• Very wide tuning range. This is motivated by the need to address several applications with one synthesizer IC.
• The ability to sweep over a wide frequency range without band switching, as needed by high-resolution FMCW sensors.
• Fast settling time for applications requiring high-frequency agility.
• State-of-the-art phase-noise suppression despite the partially conflicting requirements above. Specifically, an approach was chosen that uses a moderate divider ratio (64 with respect to the frequency of operation) to lower phase noise, a loop bandwidth of 50 MHz for large 0018-9480/$31.00 © 2012 IEEE The circuit was first discussed briefly in [9] . This paper is an extended version, describing the design, simulation, and characterization in more details.
II. TECHNOLOGY OVERVIEW
For realizing the PLL circuit, we used an SiGe HBT technology [10] from Telefunken Semiconductors GmbH. The process requires only 22 masks with 0.8-m minimum feature size, and therefore, is very cost effective. Two types of npn transistors are available with minimum geometric emitter size of 0.8 1.4 m (electrically active emitter size of 0.5 1.1 m ). The selectively implanted collector (SIC) npn transistor has an of 80/90 GHz and a of 2.4 V. The non-SIC transistor improves the to 4.5 V, with a reduced of 50 GHz. One type of lateral pnp (LPNP) transistor is also available, but not suitable for RF purposes. Four types of resistors, metal-insulator-metal (MIM) and nitride capacitors, as well as varactor diodes are provided. The process offers three metal layers for passive elements and interconnections. Two types of silicon substrate (1000 and 20 cm) are available. The low-resistivity substrate is chosen for this study.
III. PLL CIRCUIT DESIGN
The block diagram of the PLL circuit is shown in Fig. 1 . It is an analog PLL incorporating a phase detector (PD) and active loop filter (LF). Compared to the commonly used digital PLL employing a phase frequency detector (PFD) and charge pump, the analog PLL can operate at much higher frequencies, enabling a smaller frequency division ratio in the PLL, and therefore, lower phase noise.
A. VCO and Doubler
The signal generation is realized by a VCO and a frequency doubler, which is a modified version of the work reported ear- lier [14] . Fig. 2 shows the schematic of the VCO and frequency doubler. The VCO is a differential Colpitts VCO with a cascaded common base stage as the output buffer. It provides two differential outputs to drive the frequency doubler and divider in parallel. Large transistors (0.5 30 m ) are used in the VCO core and output buffer because they can generate more output power and have smaller base resistance (compared to small transistors), facilitating oscillation at higher frequency and resulting in lower phase noise [11] . The inductors and all matching networks are realized with thin-film microstrip lines (TFMLs). The oscillation frequency is tuned through varactor diodes.
The oscillation frequency of the VCO can be calculated as (1) where is the base inductance and is the equivalent capacitance of 1 in series with the transistor base-emitter junction capacitance . To have a wide frequency tuning range, should be much smaller than . However, a very small cannot produce sufficient negative impedance to start the oscillation. in parallel to improve the quality factor. The capacitance of a single varactor diode changes from 43 to 107 fF, when the dc voltage across the varactor varies from 4 to 0 V. The quality factor at 30 GHz changes from 20 to 10, as shown in Fig. 3 . The capacitance can be further increased to 158 fF with slight forward biasing 0.4 V . This is helpful to achieve a wider tuning range, although the phase noise gets slightly worse. In this design, the slightly higher phase noise of the free-running VCO is not critical because it is well suppressed within the loop bandwidth and already decreases to a sufficiently low level at high offset frequencies (outside the loop bandwidth, 50-MHz offset).
The oscillation frequency can be continuously tuned from 31 to 42 GHz. Since the frequency divider requires only a very small input power, the VCO output is modified such that most of the power (5 dBm) is delivered to the frequency doubler and about 0 dBm to the frequency divider.
The frequency doubler is a push-push type, based on an emitter coupled pair (ECP), as shown in Fig. 2 . This type of frequency doubler can provide higher conversion gain, compared to the passive frequency doublers, and it requires less chip area than the nonlinearity type of frequency doublers because no specific filter and matching networks is needed at the input and output. Frequency doublers based on the push-push topology have been reported [12] , [13] , operating at relatively low frequencies (compared to ). In this study, the frequency doubler is operated at frequencies close to . Therefore, the topology is modified to have better performance in the designed frequency range. The output is taken from the common emitter, with a 300-pH inductor as the load, which provides higher conversion gain than a resistive load (as in [12] ). The ECP is biased for class-A operation using a current source. Compared with class-B operation (as in [13] ), the conversion gain and output power is higher at high input levels ( 0 dBm). With 7-dBm input power, the conversion gain peaks at 30-GHz output ( 5 dB) and slowly decreases to 10 dB at 80 GHz. Further details of the VCO and frequency doubler are provided in [14] .
B. Frequency Divider
Frequency dividers are also important building blocks in modern PLLs. They decrease the input frequency from the VCO so that the PLL can operate at much lower frequencies. However, the in-band phase noise of the locked signal will be increased by a factor of , where is the frequency division ratio. Therefore, a small frequency division ratio is preferred to obtain a lower phase noise. In this study, a division ratio of 32 (for fundamental VCO) is chosen, which lowers the reference signal to a relatively low frequency (around 1.1 GHz) and still provides good phase-noise performance.
The divide-by-32 frequency divider consists of five divide-by-2 stages. To get a wide tuning range PLL, the frequency divider should operate properly over the complete frequency tuning range of the VCO. The dc power consumption should also be minimized. Therefore, each divide-by-2 stage is designed and optimized separately balancing maximum operating frequency and power consumption.
The first divider stage is the most critical one because it operates at the highest frequency. The maximum VCO output frequency is 42 GHz so the first divider stage should operate at least up to 42 GHz (better up to higher frequency to keep some safety margins). Static frequency dividers based on master-slave D-flip-flops (DFFs) cannot operate at this frequency range due to the high ratio of operating frequency and . Injection-locked frequency dividers (as used in [2] and [3] ) can operate at higher frequency, but they all rely on introducing additional phase shift to an oscillator to achieve locking [15] , therefore usually have limited operating frequency range. In this design, a dynamic frequency divider is used as the first stage. It is based on the regenerative concept [16] , as shown in Fig. 4(a) . The output of the mixer, after passing the low-pass filter and amplifier, is fed back to one input of the mixer. When an input signal at frequency with a certain power is applied to the other input of the mixer, the divider will sustain an oscillation within the loop at frequency , provided that the higher mixing products ( , , etc.) are filtered out by the low-pass filter. The operation of the dynamic divider relies on the mixer and low-pass filter, which can have very wide bandwidth so the dynamic frequency divider can operate over a much wider frequency range than the injection-locked dividers. Fig. 4(b) shows the schematic of the dynamic divider, consisting of a Gilbert-cell mixer core , a transimpedance amplifier (TIA) stage and emitter-followers (EFs) as low-pass feedback and output buffer . The transimpedance stage increases the mismatch in the loop, leading to wider bandwidth and much better performance (with respect to maximum operating frequency, safe broadband operation, and high sensitivity), compared with a single resistor as the load [17] . Small transistors (0.5 5 m ) are used in the mixer core and TIA for low current consumption. The maximum operating frequency of the dynamic divider is determined by the open-loop gain. To calculate the open-loop gain, the divider feedback loop is cut at the output of the second EF , as indicated in Fig. 4 . One input signal at frequency is injected into the divider from the input port (base of ) and another signal at frequency is injected into the base of . 2 The open-loop gain can be calculated as the power gain from the input at to the output at . Fig. 5 shows the simulated open-loop gain under small-signal driving condition. 3 The maximum operating frequency of the dynamic divider is twice the frequency at which the open-loop gain drops to 1. By adjusting the resistors in the TIA and EFs, the open-loop gain can be maximized. In this design, the maximum operating frequency is intentionally reduced from 70 to 50 GHz to save dc power consumption (by increasing the load resistor in the TIA and reducing the current in the EFs). The dynamic divider stops dividing at low frequencies ( 10 GHz) because the third harmonic in the mixer output signal is not sufficiently suppressed in 2 Dummy EFs (same as -) are inserted before and the output of is terminated with a dummy mixer core to represent the same matching conditions in the closed loop. 3 As the signal level at increases, the loop gain slowly decreases due to nonlinearity of the transistors. the feedback loop [17] . Therefore, it is not suited for frequency division at very low frequencies. The second stage is a static frequency divider based on master-slave DFFs, as shown in Fig. 6 . The DFFs are built using emitter-coupled-logic (ECL) ( being the clock, , being the latches). The EF provides the dc level shift (for proper biasing of , ) and increases the speed (maximum operating frequency) of the divider. , , , and are the most critical elements that determine the speed of the static divider, as analyzed in [18] and [19] . By choosing the third smallest transistor (0.5 2 m ) for , , and 100 for , the speed of the divider can be maximized (about 34 GHz). However, such high speed for the second divider stage is not necessary. Therefore, the speed is reduced to 28 GHz by increasing and decreasing the current in and . The dc power consumption is reduced by 50%.
The static divider can operate down to very low frequencies (megahertz range, limited by the on-chip capacitor). Therefore, it is also used for the following three stages. Since they operate at much lower frequencies, the speed of the following stages is further reduced to save dc power consumption. The EFs in the DFFs (indicated via the dashed box) are omitted, the current in is reduced, and is increased to compensate the decrease of voltage swing due to the lower current. The output buffer (EF) is only added at the last output stage.
The operating frequency range and input and output power of each stage are designed carefully so that the divider chain can operate properly over the complete frequency range with low power consumption and sufficient output power. Fig. 7 shows the block diagram of the divider chain with the maximum operating frequency and dc power consumption (under 3.5-V supply) of each stage. A standalone frequency divider has been realized for characterization. Fig. 8 shows the measured input sensitivity and output power. The divider chain can divide from 11 GHz up to 50 GHz, covering the complete tuning range of the VCO. The required input power ( 8 dBm) is well below the available power from the VCO and the output power is about 2 dBm, which is sufficient to drive the PD.
C. PD and LF
The PD is based on a Gilbert-cell mixer with resistive load, as shown in Fig. 9 . The EF provides the dc level shift so that the PD can drive the following LF directly. The divider output is connected to the switching quad of the mixer core and the reference signal is connected to the transconductance stage.
When the two input signals are different in frequency (at the initial phase of the locking process), the PD operates as a mixer and generates a low-frequency signal, which tunes the VCO frequency toward the reference frequency. When the two input signals have the same frequency, but differ in phase (by ), as in the locked state, a dc voltage shift is produced at the output, fixing the VCO frequency at the reference frequency (multiplied by the division ratio ). Fig. 10 shows the simulated output voltage of the PD versus the input phase error. The gain of the PD , defined as , is mainly determined by the load resistor and the reference signal power. As will be shown in Section IV, influences the PLL loop gain and noise contribution from the PD, and thus needs to be chosen carefully. In the current design, the load resistor is 550 and the PD gain is about 0.1 (with 15-dBm reference signal power).
In addition to the dc voltage shift, the output of the PD also contains ac signals, which are mixing products of the two input signals. These ac signals will modulate the VCO and generate spurs (reference spurs) around the locked signal, and must be sufficiently suppressed. The suppression of the ac signals on the tuning node (and hence, the spur signals at the PLL output) is determined by the LF attenuation at the spur frequencies (harmonics of the reference frequency), which can be controlled by the LF bandwidth. To have high spur suppression, a small bandwidth (low cutoff frequency) is preferred. However, a small LF bandwidth directly leads to small loop bandwidth, and hence, small frequency locking range and longer settling time of the PLL, so a compromise between spur suppression and locking range must be considered when choosing the filter bandwidth. In this design, a relatively wide bandwidth is chosen because wide locking range is the main design goal and the reference spurs are less critical since they are far from the locked signal.
The LF is also shown in Fig. 9 . It is based on a feedback amplifier and passive RC filter with an EF in between for dc level shift. The amplifier amplifies the small control signal from the PD and performs low-pass filtering (though feedback) together with the RC filter. The bandwidth and attenuation (at high frequency, e.g., at 1 GHz) of the filter, which influence the PLL loop bandwidth and reference spur suppression, are mainly determined by the feedback and the RC filter . Fig. 11 shows the simulated ac power gain of the filter. The 3-dB bandwidth is 47 MHz, which is a compromise between wide locking range (wide loop bandwidth) and spur signal suppression (smaller bandwidth).
The free-running VCO can be tuned from 31 to 42 GHz, which corresponds to a voltage tuning from 0.6 to 6 V. In order to lock the VCO to a wide frequency range, the LF output must be able to cover a wide voltage range. Since the output voltage swing of the PD is very small (0.13 V peak-to-peak at 30 MHz), the LF must provide sufficient gain to have a large output voltage swing, which also increases the gain of the PD and helps to suppress the noise contribution from the PD and LF, as will be shown in Section IV. The gain of the LF can be easily adjusted by changing the load resistor and biasing current. With a load of 2.5 k , the LF has 33-dB gain and 2.3-V (peak-to-peak) output voltage swing. This allows a maximum frequency tuning range of 4 GHz that is wider than the locking range determined by the loop bandwidth. Due to the high voltage swing, the non-SIC transistors (with 4.5-V ) are used in the LF design. The locking range of the PLL is limited by the loop bandwidth and covers only part of the VCO tuning range (about 3 GHz around the free-running frequency). Therefore, the usable frequency range of the VCO is limited. To cover the full tuning range of the VCO, the locking range can be shifted by adjusting the supply voltage of the LF (
). The biasing current of the filter is fixed so that a change of the supply voltage only shifts the dc voltage level at the output node. As a result, the free-running frequency of the VCO, and hence, the locking range, can be shifted, while the loop response remains the same. By changing from 2 to 8 V in steps of 1 V, the PLL can lock to different frequency bands, covering almost the whole tuning range of the VCO, as will be shown later in Section V.
IV. PLL SIMULATION
To assist and optimize the design of the PLL, different simulations have been performed. The PLL behavior in the locked state (loop gain, loop bandwidth, phase noise) is simulated in the frequency domain, based on a linear model, as shown in Fig. 12 . The transient response of the PLL (locking time) is simulated in time domain using transient simulation (ADS), based on real circuit models.
A. Loop Gain Simulation
In the locked state, the PLL can be modeled by a linear model in the frequency domain, as shown in Fig. 12 , where , , , and are PD gain, VCO gain, LF transfer function, and frequency division ratio, respectively.
The open-and closed-loop gain are two important PLL design parameters, defining the PLL loop stability and bandwidth. The open-loop gain is the product of all the transfer functions in To simulate the open-loop gain, the loop is cut at the output of the frequency divider, as shown in Fig. 12 . The PD, VCO, and frequency divider are replaced with ideal behavioral models. The complete circuit model of the LF is used in the simulation since a behavior model could not be extracted. The simulation is performed in ADS using ac simulation. After feeding an input signal, the open-loop gain can be calculated as the gain from the input to the divider output.
The stability of the loop can be evaluated from the amplitude and phase of the open-loop gain. The phase margin 4 is usually used to define the stability of the PLL. In this PLL design, one effective way to control the phase margin is to tune the load resistor in the LF ( in Fig. 9 ). Fig. 13 (a) and (b) shows the simulated magnitude and phase of the open-loop gain for different load resistors. As the load resistor decreases, the phase margin increases and the loop becomes more stable. However, decreasing also reduces the loop bandwidth and locking range. A 2.5-k load resistor is used in the current design, which gives a good compromise between loop bandwidth and stability.
The closed-loop gain (also called the transfer function)
defines the phase relation between the input and output. The performance of the PLL (locking range, spur suppression, etc.) is mainly determined by the closed-loop gain. Fig. 13(c) shows the simulated closed-loop gain for different load resistors.
B. Phase-Noise Simulation
Phase noise is an important characteristic of a PLL. Ideally, the output phase noise should be the same as the phase noise of the reference signal plus , where is the frequency division ratio, as indicated in the closed-loop gain simulation. However, the noise of other loop components also contributes to the final output phase noise. Each component has its own transfer function. In the locked state, the PLL can be modeled as a linear system, where the total output noise is the sum of each noise source (from each component) multiplied by its own transfer function. In order to achieve the lowest possible phase noise, the noise from other loop components should be lower than the reference noise. The phase noise is simulated (using ac simulation) based on the model in Fig. 12 , where the noise contributions of the individual components ( , , etc.) are added separately. The noise is represented by the root mean square (rms) noise voltage at each frequency. The simulated noise data of the PD, LF, frequency divider, and the measured noise data of the VCO, reference signal (1-GHz signal from Agilent signal source E8254A), as shown in Fig. 14(a) , are used. The PD and the LF are treated as one circuit and their noises are combined.
The simulated total phase noise and contributions from the loop components are given in Fig. 14(b) . The in-band phase noise is mainly dominated by the reference signal and the out-of-band noise is dominated by the VCO. The noise from the PD and LF is well suppressed due to the high gain in the LF. The divider phase noise is much lower and has almost no influence on the output phase noise. The minimum in-band phase noise is mainly determined by the contribution from the PD and LF, which is about 120 dBc/Hz at 1-MHz offset.
C. Transient Simulation
PLL locking is a nonlinear and dynamic process, which cannot be modeled using the linear model in Fig. 12 . Therefore, the complete PLL circuit (VCO, divider, PD, and LF) is simulated in the time domain using transient simulation. Fig. 15 shows the simulated tuning voltage of the VCO when locking to a 1.1-GHz reference signal. The locking time is less than 100 ns. Fig. 16 shows a photograph of the PLL IC. The chip is very compact and occupies an area of only 1.1 1 mm , including the bonding pads.
V. MEASUREMENT RESULTS
The circuit is characterized on-wafer. The output is connected through a ground-signal-ground (GSG) probe to a -band mixer (Agilent 11970V), which extends the frequency range of the spectrum analyzer (Agilent 8563E). A signal generator (Agilent 8254A) is used to generate the reference signal, which is connected through a GSG probe to the input of the PD (the second input is grounded by the ground of the probe). The VCO and frequency doubler are biased with a 4-V supply and draw 60-mA current. The frequency divider and PD are biased with a 3.5-V supply and consume 54-mA current (45 mA for a divider). The supply voltage of the LF varies from 2 to 8 V with a current change from 2 to 11 mA (due to the output EF).
A clean locked spectrum is observed at the output. Fig. 17 shows the locked spectrum at 80 GHz with different span. The filter is biased at 6 V. The reference signal is at 1.25 GHz, with a power of 15 dBm. The suppression of the reference spur is 37 dB. Fig. 18 shows the measured phase noise at different frequencies. The measurements are very close to the simulation results (6 dB are added to simulation results because of frequency doubling). At 1-MHz offset, the measured phase noise is 106 dBc/Hz. The rms phase error (integrated up to 100 MHz) is 4.4 , which is calculated from the measured phase-noise data using the following equation [20] : (3) where is the rms phase error in degrees, is the measured phase noise power relative to carrier (in scalar units), and and are the lower and upper limits for integration. With a 4-V supply at the LF, the PLL can lock from 69.8 to 76.9 GHz, which is limited mainly by the loop bandwidth. By changing the LF supply voltage, the PLL can lock to different frequency bands, as shown in Fig. 19 , covering the complete frequency range from 64 to 84 GHz.
The output power of the PLL is given in Fig. 20 . At 64 GHz, the measured output power is 2.4 dBm. Due to the increased conversion loss of the doubler, the output power decreases to 8.1 dBm at 84 GHz. 5 The leakage of the fundamental signal at the output is below 20 dBm over the whole tuning range, as shown in Fig. 20 . Table I compares this study with other fully integrated -band PLLs. This work achieves the widest frequency tuning range and lowest in-band phase noise.
VI. CONCLUSION
This paper has presented the design, simulation, and characterization of a mm-wave PLL circuit that achieves the widest frequency tuning range (64-84 GHz) among all reported mm-wave PLLs. The circuit is realized in a low-cost 80-GHz SiGe HBT technology, yet still delivers a satisfactory output power of 2.5 dBm at 64 GHz and 8.1 dBm at 84 GHz. The measured phase noise ( 106 dBc/Hz at 1-MHz offset) is the lowest reported in-band phase noise for PLLs in similar frequency ranges. The circuit is well suited as a -band signal source for low-cost applications. 
