ABSTRACT A D-band on-off keying (OOK) transceiver chipset is fabricated in a 65-nm bulk CMOS technology as a low-cost and highly integrative solution to short-distance wireless connectivity. Supplementary transistor modeling is performed for accurate circuit design at mm-wave frequencies. To overcome low transistor f max and reduce dc power consumption, the transmitter employs a frequency-multiplier-based architecture with no power amplifier. The receiver adopts a non-coherent architecture consisting of a dccoupled three-stage differential amplifier and an envelope detector. The OOK transmitter exhibits a measured output power of −9.8 dBm and an on-off level difference of 13.2 dB at 134.1 GHz. The receiver shows a measured average responsivity of 4.1 kV/W and a noise equivalent power of 211.4 pW/Hz 1/2 over all D-band frequencies. The dc power consumption of the transmitter and the receiver is 76 and 32.5 mW, respectively. The transceiver is tested in both on-chip loopback and air-channel configurations and demonstrates data transmission up to 10 and 2 Gb/s at a distance of 0.03 m, respectively.
I. INTRODUCTION
Demand for short-distance high-speed wireless communication has continued to increase due to the explosive growth of data traffic in WPAN and IoT devices [1] , [2] . Microwave frequencies have been predominantly employed for diverse commercial and military purposes. The mm-wave band is now considered an alternative frequency resource that can meet the increased demand. Several on-off keying (OOK) transceivers have been reported at mm-wave bands, especially the V-band [3] - [7] . The OOK architecture is appropriate for massive short-distance communications between chips, boards, and devices due to its simplicity, low DC consumption, and small chip size.
Recently, an upper mm-wave band beyond 100 GHz has attracted considerable attention as a promising spectrum for the OOK transceiver system. Compared to the mm-wave frequencies below 100 GHz, the upper band has several benefits. It offers a continuously available wider bandwidth and thus enables a higher data rate which is directly proportional to the bandwidth in the OOK system. Furthermore, if the carrier frequency increases, the fractional bandwidth is reduced, which enables simpler RF circuit design. This additionally reduces the chip and antenna size, thus allowing for a small form factor of the transceiver system.
Traditionally, transceivers at the upper mm-wave band have been fabricated using compound semiconductor technologies [8] - [10] . Recently, owing to rapid advancement of silicon technologies, several silicon-based OOK transceivers have been reported at the upper mm-wave band [11] - [19] . However, they primarily rely on sophisticated device technologies, such as deeply-scaled CMOS, SOI, or SiGe technologies. Consequently, they suffer from high cost and limited accessibility. It should be noted that a majority of application devices and sensors (such as for the IoT or WPAN) requiring wireless connectivity are still implemented in lessscaled bulk technologies because they do not require a high transistor speed. Therefore, it is worth developing a transceiver using the same low-cost and easily-accessible technologies at the upper mm-wave band. This would enable seamless integration of the transceiver with the application devices and sensors and thus lower the manufacturing cost.
In this paper, a D-band OOK transmitter and receiver chip set is implemented using a 65-nm bulk CMOS technology. Compared to previous D-band transceivers that were fabricated predominantly through highly-scaled processes, such as 32-nm and 40-nm nodes, or by means of SOI technologies [14] - [17] , this work demonstrates a relatively low-cost and highly-integrative solution to short-distance wireless connectivity.
Since the given bulk technology was not originally intended for upper-mm-wave circuit design, several design challenges exist, such as limited model accuracy and low transistor speed. To improve the model accuracy at upper mmwave frequencies, a simple supplementary modeling is performed on top of the original transistor model, as described in Section II. To overcome the low transistor speed, i.e. a low transistor f max , a multiplier-based architecture that has no power amplifier (PA) is adopted in the transmitter. The entire transceiver architecture and link budget are presented in Section III. In Section IV, the design of each circuit block comprising the D-band OOK transceiver is presented with the measurement result. Section V provides the measured performance of the integrated transmitter and receiver, followed by a demonstration of wireless data transmission using the transceiver chipset.
II. SUPPLEMENTARY EMPIRICAL TRANSISTOR MODELING
Although a transistor model is provided by the fabrication foundry, the model accuracy becomes degraded as the operating frequency increases. This is because the extrinsic RLC components distributed in the transistor pads are usually underestimated in bulk CMOS technologies. This results in a non-negligible discrepancy between simulation and measurement in the circuit design at mm-wave frequencies [20] . Therefore, supplementary modeling is performed to the original transistor model to improve the model accuracy. As shown in Fig. 1 , the extrinsic distributed RLC components are simply modeled as six supplementary extrinsic components
, L s , C gs , C gd , and C ds ), and are added to the original transistor model. While not rigorously precise, the proposed empirical model enables a quick and efficient correction of the original model, thus making it suitable for mm-wave circuit design.
The supplementary extrinsic component values are empirically extracted from the measured S-parameters of transistors of various sizes. It is difficult to precisely de-embed the effect of RF probing pads and feed structures at upper mmwave frequencies. Therefore, the supplementary modeling is fulfilled up to 50 GHz and is extrapolated toward D-band frequencies. Nonetheless, the accuracy of the extrapolated model is confirmed by V-band and D-band circuit design described in Section IV. Fig. 2 shows the maximum available gain (MAG) or maximum stable gain (MSG) of a 2 × 10 − µm transistor of the 65-nm bulk CMOS technology employed in this work. Compared to the original model, the corrected model described in Section II predicts that the transistor f max is significantly lower than 200 GHz. The low f max makes it challenging to design active circuit blocks at D-band frequencies. Therefore, the D-band OOK transceiver in this work demands a new architecture that overcomes the low f max . Fig. 3(a) shows a block diagram of the proposed D-band OOK transmitter, consisting of a carrier generator and a modulator. Instead of a D-band fundamental oscillator, a V-band-oscillator-multiplier chain is employed to generate the carrier signal due to the low transistor f max of the given technology. A harmonic oscillator can be considered an alternative solution to D-band carrier generation; nevertheless, it exhibits lower output power and DC-to-RF efficiency than the proposed multiplier-based carrier generator [21] , [22] .
III. TRANSCEIVER ARCHITECTURE AND LINK BUDGET
The OOK modulator can be designed as either an active or passive type. The active type can yield a high gain if implemented using an advanced transistor technology, albeit at the expense of high DC power consumption [16] , [17] . However, in the technology used in this work, this benefit is not available because of the limited transistor MAG. Thus, we employ a passive switched-type modulator that reduces the DC power consumption of the transmitter.
A PA design presents another challenge because a large transistor size further lowers f max and MAG at the operating frequency. For example, transistor f max with a large gate width of 3 × 30µm decreases to 125 GHz. Nonetheless, the OOK modulation scheme demands a lower receive SNR and thus a lower transmit power than advanced modulation schemes, such as QPSK and QAM [23] . Therefore, a PA-less architecture is employed in the transmitter in this work to target shortdistance communication with low DC power consumption.
The D-band OOK receiver is shown in Fig. 3 (b). It adopts a non-coherent architecture consisting of a D-band preamplifier and an envelope detector. Compared to a mixerbased coherent OOK receiver [13] , it reduces the complexity and DC power consumption. The pre-amplifier is used to compensate the high air-channel loss and improve the noise performance of the receiver. To achieve a positive gain at the frequency close to the transistor f max , an inter-stage DC-couple technique is used, as described in Section IV-E. An input balun is included in the amplifier for a single-ended antenna feed. As depicted in Fig. 4 , a link budget for 1-Gbps OOK wireless communication at 0.02 m is calculated to estimate the output power required from the transmitter. In non-coherent OOK architecture, the receiver sensitivity can be expressed approximately as [24] :
where NEP is the noise equivalent power of the receiver, BW is the frequency bandwidth, and SNR min is the minimum SNR required for OOK demodulation. According to the presimulation, receiver NEP is 95.9 pW/Hz 1/2 at 140 GHz. The SNR min with a bit error rate (BER) of 10 −3 is approximately 11 dB [25] . Considering BW of 2 GHz needed for a 1-Gbps data rate, the receiver sensitivity is calculated as −12.7 dBm. According to the Friis transmission formula [26] , free-space path loss over a distance of 0.02 m is 41.4 dB at 140 GHz. The antenna gain is assumed to be 21 dBi as a typical value for commercial D-band horn antennas [27] . Then, the minimum output power demanded from the transmitter is finally calculated as -13.3 dBm.
IV. CIRCUIT BLOCK DESIGN
The complete schematics of the D-band OOK transmitter and receiver are shown in Fig. 5 (a) and (b), respectively. All circuit blocks adopt a differential topology to exploit the virtual ground except the D-band modulator. The transmission lines are implemented using a microstrip line structure with a 1.2-um thick top metal used for signal and M1 for ground. The Q-factor of the microstrip line is 55.2 at 140 GHz. Each circuit block is individually designed and tested before integration into the transceiver chipset.
A. V-BAND OSCILLATOR
The V-band oscillator consists of a cross-coupled oscillation core (M 1 and M 2 ) and a source-following buffer (M 3 and M 4 ) as shown in Fig. 5(a) . The gate width of M 1 and M 2 is determined with consideration of the trade-off between the oscillation startup condition and the inductance required in the LC tank. As shown in Fig. 6(a) , the conductance seen looking into the oscillation core (G in ) has a more negative value as the gate width increases, thus facilitating oscillation VOLUME 7, 2019 startup. On the other hand, the larger gate width increases parasitic capacitance, thus dramatically reducing the inductance value required at the output LC tank. A small inductance (<100 pH) is typically implemented using a short transmission line. If the line is too short, it will suffer from high sensitivity to process variation, and layout difficulty. Therefore, the gate width is determined as 12 µm. The tank inductance, calculated as 122 pH, is implemented by high-impedance transmission lines (TL 1 and TL 2 ) with an electrical length of 21 • at 70 GHz. Fig. 6(b) and (c) show a schematic and a chip micrograph of the oscillator test cut, respectively. One of the differential output ports is terminated by an on-chip 50-resistor for measurement purposes. The measured output power and spectrum are shown in Fig. 6(d) . With a drain bias of 1.2 V, the oscillator exhibits a single-ended output power of 0.2 dBm at 67.8 GHz. The output power will further increase by 3 dB if the output is differentially obtained. It should be noted that the simulation based on the corrected transistor model proposed in Section II is −0.2 dBm at 69.1 GHz. This result agrees well with the measurement, and thus validates the model accuracy. The DC power consumption is 42 mW.
B. V-BAND DRIVE AMPLIFIER
The V-band drive amplifier adopts only a single stage of the common-source transistors (M 5 and M 6 in Fig. 5(a) ) to reduce DC power consumption. Since the amplifier is used to provide sufficient drive power to the subsequent D-band frequency doubler, the maximum output power available from a single transistor is simulated at different gate widths, as shown in Fig. 7(a) . The gate width of M 5 and M 6 is thus determined as 60 µm because a further width increase rather reduces the output power due to larger parasitic capacitance. Fig. 7 (b) and (c) show a schematic and a chip micrograph of the V-band amplifier test cut, respectively. For measurement purposes, a Marchand balun is added to both the input and output. As depicted in Fig. 7(d) , the amplifier exhibits a positive gain over the entire V-band frequencies with a peak gain of 5 dB. De-embedding the back-to-back balun loss of 2 dB, the gain of the ''differential'' amplifier actually used in the transmitter will be approximately 7 dB. Consequently, the amplifier will generate output power of at least 5 dBm if driven by the previous V-band oscillator. The DC power consumption is 33.6 mW.
C. D-BAND FREQUENCY DOULBER
As shown in Fig. 5(a) , the D-band frequency doubler is designed with differential common-source transistors (M 7 and M 8 ). At the drain of the transistors, the secondharmonic currents are added in-phase, whereas the fundamental is suppressed. Fig. 8(a) shows the second-harmonic current normalized to the peak as a function of the gate bias voltage (V GG D BL ). From the simulation, V GG D BL is chosen as 0.5 V to maximize the conversion gain, while consuming minimal quiescent DC power. Input and output matching are performed at the fundamental and second-harmonic frequencies, respectively, using transmission lines (TL 13 -TL 19 ) .
A schematic and a chip micrograph are shown in Fig. 8 (b) and (c), respectively. A Marchand balun is added to the V-band input for measurement purposes. The measured output power and conversion gain are shown in Fig. 8(d) . Assuming that the input drive power provided by the preceding stage is 5 dBm, the output power from the doubler reaches −9 dBm.
D. D-BAND OOK MODULATOR
The OOK modulator is designed with a double-shunt transistor (M 9 and M 10 in Fig. 5(a) ) that switches on or off the D-band carrier signal depending on the baseband input. In Fig. 9(a) , the on-resistance (R on ) and off-capacitance (C off ) of the transistor are simulated at different gate widths. It can be seen that a large gate width results in low R on , leading to high isolation of the switch. On the other hand, a small gate width is desirable for low insertion loss of the switch due to low C off . Therefore, the gate width is determined as 30 µm with consideration of the trade-off.
A series transmission line (TL 20 ) is inserted between the two transistors to absorb C off when the switch is in the 'on' state. Thus, the input and output impedances are matched to 50 . The baseband signal is fed into the gate of each transistor through a quarter-wave transmission line (TL 21 and TL 22 ) at the carrier frequency. Compared to conventional resistive feed methods [28] , this reduces the RC time constant at the baseband feed point, thus improving the modulation speed. Fig. 9 (b) and (c) show a schematic and a chip micrograph of the OOK modulator test cut, respectively. The measured result is shown in Fig. 9(d) . The on-state insertion loss and the off-state isolation are 4.4 dB and 17.4 dB, respectively, at 140 GHz. Therefore, the final output power of the integrated transmitter is expected to be -13.4 dBm when the OOK modulator is driven by the preceding frequency doubler with -9 dBm. This output power meets the power budget requirement described in Fig. 4 .
E. D-BAND PRE-AMPLIFIER
As shown in Fig. 2 , the transistor f max is lower than 200 GHz, which makes it challenging to design a D-band pre-amplifier required in the receiver. In this work, an interstage DC-coupled technique is employed to overcome the low f max and low MAG.
A schematic of the D-band pre-amplifier is shown in Fig. 5(b) . It consists of three DC-coupled differential stages (M 11 -M 16 ) and an input balun. In conventional multi-stage amplifiers, an AC-couple capacitor is usually added between the stages to separate the DC biases. However, the capacitor suffers from a low Q-factor as the frequency increases, thus lowering the MAG of each stage. For example, the Q-factor of a 100-fF MIM capacitor decreases to 0.26 at 140 GHz according to the simulation. In Fig. 10(a) , the MAG of a two-stage transistor cell is compared when the two transistors are AC-coupled with the MIM capacitor or are DC-coupled without the capacitor. The MAG of the AC-coupled cell decreases by 1.6 dB at 140 GHz. Therefore, the D-band amplifier in this work is designed without AC-couple capacitors.
Since the inter-stage is DC-coupled, the gate and drain biases are all tied into a common voltage of 1 V. The gate width is optimized to 1 × 14 µm to obtain the maximum MAG at D-band frequencies. Impedance matching is performed using transmission lines. Especially, the relatively wide transmission lines of TL 43 and TL 44 are employed to extend the operating bandwidth [29] . A rat-race coupler is inserted at the input for converting a single-ended signal into a differential one. Fig. 10(b) , (c), and (d) depict a schematic, a chip micrograph, and measured S-parameters of the D-band amplifier test cut, respectively. The peak gain of 10.4 dB is measured at 117 GHz. The DC power consumption is 32 mW. The simulation agrees well with the measurement even at D-band frequencies owing to the supplementary modeling proposed in Section II.
F. D-BAND ENVELOPE DETECTOR
The D-band envelope detector is designed in a differential common-source topology. According to the simulation of VOLUME 7, 2019 the responsivity and noise-equivalent power (NEP) shown in Fig. 11(a) , the gate bias voltage is chosen as 0.5 V. The input impedance is matched using transmission lines (TL 55 -TL 58 ). The DC output is taken at the common node of the differential pair, so that the D-band carrier is suppressed by itself. The output load resistor (R 2 ) is chosen as 1 k considering the trade-off between responsivity and voltage headroom. Fig. 11(b) and (c) show a schematic and a chip micrograph of the envelope detector, where a D-band rat-race balun is added to the differential input for measurement purposes. The measured average responsivity and NEP from 110 to 170 GHz are 1.2 kV/W and 234 pW/Hz 1/2 , respectively.
V. MEASUREMENTS OF INTEGRATED TRANSCEIVER A. D-BAND INTEGRATED OOK TRANSMITTER
The transmitter chip, integrating the V-band oscillator, drive amplifier, D-band frequency doubler, and OOK modulator, is shown in Fig. 12 . The chip area is 1110 µm × 634µm including probing pads. The measured output power when the baseband input is one and zero are -9.8 dBm and -23 dBm, respectively, at 134.1 GHz. The DC power consumption of the transmitter is 76 mW.
The modulation performance of the transmitter is tested using the measurement setup shown in Fig. 13(a) . A (2 9 − 1) PRBS from an arbitrary waveform generator modulates the transmitter. The D-band modulated output is down-converted to IF of 10 GHz by a sub-harmonic mixer and then is monitored by an oscilloscope. The modulated waveforms at 1, 4, 7, and 10 Gbps are depicted in Fig. 13(b) .
B. D-BAND INTEGRATED OOK RECEIVER
The receiver chip, integrating the D-band pre-amplifier and envelope detector, is shown in Fig. 14 . The chip area is 1420 µm × 620µm including probing pads. The measured responsivity and NEP of the receiver are shown in Fig. 15 . The averaged responsivity and NEP over the entire D-band frequencies are 4.1 kV/W and 211.4 pW/Hz 1/2 , respectively. The responsivity and NEP of the envelope detector alone are superimposed. The integrated receiver exhibits a higher responsivity than the individual envelope detector owing to the pre-amplifier. The receiver consumes DC power of 32.5 mW.
The demodulation performance of the receiver is tested using an on-chip loopback transceiver system that integrates the receiver with an OOK modulator, as shown in Fig. 16(a) . A 140-GHz carrier is externally injected, modulated, and then fed to the receiver for demodulation. Fig. 16(b) shows the measured eye diagrams of the demodulated output at 1, 4, 7, and 10 Gbps. It should be mentioned that measurement at higher data rates is limited by the oscilloscope bandwidth, not by the receiver demodulation capability.
C. WIRELESS DATA TRANSMISSION WITH TRANSCEIVER CHIPSET
A wireless data transmission is demonstrated using the proposed transceiver chipset, as shown in Fig. 17(a) . A D-band horn antenna with 21-dBi gain is connected to each of the transmitter and receiver chips. The air-channel distance between the horn antennas is d. A (2 9 − 1) PRBS data input modulates the D-band carrier that is internally generated in the transmitter chip. Fig. 17(b) shows the measured eye diagrams of the demodulated signal at the receiver output. The data rates are 1 and 2 Gbps at d = 0.01, 0.02, and 0.03 m. To confirm the communication quality, error vector magnitude (EVM) is calculated by post-processing the timedomain data of eye diagram for 1 Gbps at 0.02 m. The calculated EVM is 9.8 %. Note that the on-chip loopback transceiver in Fig. 17 exhibits a modulation/demodulation capability up to 10 Gbps. Therefore, the data rate and channel distance (d) can be further improved by increasing the output power of the transmitter.
In Table 2 , the transceiver in this work is compared with other ASK/OOK transceivers operating above 100 GHz. The proposed transceiver achieves a high data rate of 10 Gbps with relatively low DC power and a small chip area owing to the simple architecture employing no PA, mixer, or LO. Relatively short channel distance is traded for the low DC power. More importantly, the proposed transceiver is fabricated in a low-cost low-speed technology compared with other transceivers [14] - [17] . The low transistor f max and the inaccurate model are overcome by the multiplier-based architecture, supplementary modeling, and DC-coupled design technique. 
VI. CONCLUSION
A D-band OOK transmitter and receiver were demonstrated in a bulk 65-nm CMOS technology. To overcome the limitations of the low-cost low-speed technology for mm-wave circuit design, a new transceiver architecture and supplementary transistor modeling were proposed. The transceiver showed a decent data-communication performance comparable with those of other state-of-the-art transceivers fabricated in highcost highly-scaled technologies. This work thus demonstrates a relatively low-cost, highly-integrative solution for shortdistance wireless connectivity.
