Abstract-Low-power (LP) high-speed serial I/O transmitters which include equalization to compensate for channel frequency-dependent loss are required to meet the aggressive link energy-efficiency targets of future systems. This brief presents an LP serial-link-transmitter design that utilizes an output stage which combines a voltage-mode driver, which offers low staticpower dissipation, and current-mode equalization, which offers low complexity and dynamic-power dissipation. The utilization of current-mode equalization decouples the equalization settings and termination impedance, allowing for a significant reduction in predriver complexity relative to segmented voltage-mode drivers. Proper transmitter series termination is set with an impedance control loop which adjusts the on-resistance of the output transistors in the driver voltage-mode portion. Further reductions in dynamic-power dissipation are achieved through scaling the serializer and local clock distribution supply with data rate. Fabricated in a 1.2-V 90-nm LP CMOS process, the transmitter supports an output swing range of 100-400 mV ppd and up to 6 dB of equalization and includes output-duty-cycle control. The transmitter achieves 6-Gbit/s operation at 1.26-pJ/bit energy efficiency with 300-mV ppd output swing and 3.72-dB equalization.
I. INTRODUCTION
A LARGE percentage of serial-link power is often consumed in the transmitter, which must provide adequate signal swing on the low-impedance channel, maintain proper source termination, and include equalization to compensate for channel frequency-dependent loss. In low-power (LP) designs, the output driver often consumes the majority of the static power due to the low-impedance channel. This leads link architects to consider voltage-mode drivers to improve energy efficiency, as with differential receiver-side termination, these drivers have the potential to consume one-quarter of the outputstage power relative to conventional current-mode drivers [1] .
While obtaining significant improvements in I/O energy efficiency will require improvements in electrical channel loss characteristics [1] , the ability to efficiently include some transmit equalization allows for more loss compensation and increased flexibility in equalization circuitry partitioning. How- ever, the potential power savings of voltage-mode drivers generally degrade with the introduction of transmit equalization and overheads associated with maintaining proper source termination. In order to generate the different output voltage levels for transmit equalization, significant output-stage segmentation is required in voltage-mode drivers which implement resistor divider [2] - [4] and channel shunting approaches [5] . This segmentation increases predriver complexity, resulting in degraded dynamic-power consumption. Additional output-stage segmentation is often implemented to digitally tune the driver termination to match the channel [5] , further degrading energy efficiency. While analog control loops which scale the predriver supply can be utilized to set the driver output impedance [2] , [6] , [7] , this does not allow independent optimization of the predriver supply to minimize dynamic power with data rate. This brief presents a hybrid voltage-mode transmitter with current-mode equalization, which enables independent control over termination impedance, equalization settings, and predriver supply, allowing for a significant reduction in predriver complexity and power. Transmitter equalization techniques are reviewed in Section II, with a comparison of the hybrid transmitter with voltage-and current-mode drivers. Section III details the transmitter architecture, which includes local clocking circuitry with duty-cycle correction, lowcomplexity scalable-supply serialization and predriver, hybrid driver, and global impedance control. Experimental results from a 90-nm LP CMOS prototype are presented in Section IV. Finally, Section V concludes this brief.
II. TRANSMITTER EQUALIZATION TECHNIQUES
Channel frequency-dependent loss, which causes intersymbol interference, is often compensated by equalization implemented at the transmitter in the form of an FIR filter. Fig. 1 shows examples of a two-tap FIR filter, with one maincursor tap and one postcursor tap, in low-swing voltage-mode, current-mode, and proposed hybrid drivers. Assuming a standard two-tap high-pass FIR filter with a negative postcursor tap
and the amount of equalization peaking is
Two techniques to implement FIR equalization in the lowswing voltage-mode driver include a resistive voltage divider [ Fig. 1(a) resistor network [ Fig. 1(b) ] [5] . Here, both techniques utilize segmentation of the output driver to implement the different output voltage levels for equalization. In the design in [2] and [4] , a 1 − α percentage of the output segments is controlled by the main-cursor tap, and an α percentage is controlled by the postcursor tap, with the output segments sized to ensure that all parallel combinations maintain proper source termination. Note that, as shown in Table I , the current drawn from the output driver supply V REF varies with the output level, with all current flowing out into the channel during the maximum output swing and a portion being sunk at the transmitter during the deemphasized level. This current variation can be a problem, as it necessitates more stringent voltage regulation of the V REF supply. The design in [3] sacrifices a constant channel match and modulates the output impedance to achieve a lower output voltage level at a lower current level, while a constant current draw is achieved in [5] by switching a shunt resistor network in addition to the main output transistors.
The main drawback associated with these voltage-modedriver designs involves the overhead in the predrive logic required to distribute the tap weights among the segments, which grows with equalization resolution. In contrast, as shown in Fig. 1 (c), current-mode drivers offer the potential to implement high-resolution equalization without significant predriver complexity by setting the tap coefficients with tail-current-source digital-to-analog converters (DACs). If the output switches of the current steering stages are sized to handle the maximum tap current, only a single predrive buffer is required per equalization tap. However, this reduction in predrive dynamic power is greatly overshadowed by the four-times increase in output-stage static current due to the parallel termination scheme. Fig. 1(d) shows a simplified schematic of the hybrid driver proposed in this brief which combines the low output current levels of a voltage-mode driver to implement the main tap and a parallel current-mode driver to implement the postcursor tap with minimal predriver complexity. While parallel current drivers have previously been implemented with voltagemode drivers as swing enhancers [4] , this implementation improves driver energy efficiency by eliminating the voltagemode-driver segmentation, as the equalization coefficient is set via the current-mode-driver tail-current DAC setting.
For the hybrid two-tap driver, the voltage-mode output-stage reference voltage is reduced to a value of V * ppd.max (1 − α), and the maximum swing is
where R TX is the transmitter impedance and I EQ is the equalization current. The minimum differential voltage swing is
III. TRANSMITTER ARCHITECTURE Fig. 2 shows the block diagram of the serial-link transmitter which utilizes two power supplies, a 1.2-V fixed AV DD, and a scalable DV DD. The local clock distribution, serialization muxes, and predriver buffers are powered from DV DD which is scaled with data rate in order to improve the transmitter power efficiency. While an external supply was used for DV DD in this design, an adaptive switching regulator [8] could efficiently generate this scalable supply. A fixed 1.2-V AV DD supply is used to supply sufficient voltage headroom for the voltagemode output-stage regulator, the current-mode equalizer stage, and the global impedance controller.
Two bits of parallel input data from on-die test circuitry capable of generating either a 2 15 − 1 pseudorandom binary sequence (PRBS) or 16-bit fixed data pattern serves as the input to the half-rate output stage. The output stage includes two sets of 2:1 muxes to implement a two-tap FIR equalization filter, with the top mux driving the main-cursor voltage-mode driver and the bottom mux driving the postcursor current-mode equalizer stage. In order to reduce power consumption for operation when equalization is not necessary, the data in the equalizer path are gated to disable the equalization serializer and any output equalization current.
In order to provide compatibility with low-swing global clock distribution present in LP multichannel link systems, an ac-coupled current mode logic (CML)-to-CMOS local clock distribution stage generates the serializer clocks. The transmitter utilizes inverter-based clock buffers with 4-bit digitally adjustable pMOS/nMOS size ratio in order to tune out errors in input duty cycle and clock distribution network mismatches, allowing the output duty cycle to be corrected to within ±1% over a data rate range of 2-6 Gbit/s.
After serialization with half-rate clocks, the main-cursor data signals drive the switches (M2 and M3) of an nMOS lowswing voltage-mode driver, while the delayed data signals drive the switches of a pMOS differential current-mode driver to implement the postcursor tap, which is shown in Fig. 3 . Here, equalization adjustment is possible with minimal overhead, with the tail current source of the current-mode stage having 4-bit binary control. A reference current switchable between 60 and 120 μA allows for the addition of a total equalization current of 0.9-1.8 mA into the output stage at 4-bit resolution. The equalization current is steered between the driver outputs by switching the pMOS output switches, which are sized to handle the maximum equalization current setting. This allows the use of a single nonsegmented predriver to switch the pMOS output switches, greatly simplifying the output driver predrive complexity relative to other voltage-mode drivers which include equalization taps [2] - [5] . Higher resolution is achievable with ideally no power overhead simply by increasing the tail-current DAC bits. While this design is intended for low-/medium-loss channels, and thus only implemented two taps, the scheme is easily extendable to higher tap values with additional parallel current drivers.
The driver pull-up impedance Z UP is set by the M2 top switches and an additional shared M1 transistor whose gate is controlled by V Z UP , while the pull-down impedance Z DN is set by the M3 bottom switches and an additional shared M4 transistor whose gate is controlled by V Z DN . A global impedance control loop allows for both the driver Z UP and Z DN impedances to be set near the channel impedance by utilizing a replica transmitter with dual feedback amplifiers that forces V Z UP to a value consistent with a high output level of
and sets V Z DN to a value consistent with a low output level of [2] , [6] , [7] , the method implemented in this work allows the predrive swing value (DV DD) to be decoupled from the impedance control, providing a degree of freedom to allow for potential predrive voltage scaling for improved energy efficiency. In order to reduce the gate capacitance of the switch transistors and save power, this design intentionally targets a 60-Ω single-ended output impedance. While not an exact channel match, this still provides a simulated low-frequency return loss of −22 dB, which meets the industry-standard return loss specifications [6] . Simulation results with backplane channels also indicate eye height degradation that is less than 3% relative to a 50-Ω design.
An on-chip linear voltage regulator sets the power supply of the voltage-mode driver to a value V REF , which is equal to the peak-to-peak differential output swing without equalization, and allows for an adjustable output swing from 100 to 400 mV ppd . The driver's low common-mode output voltage allows for the regulator to have a source-follower output stage, which offers improved supply-noise rejection relative to common-source output stages [6] . The low output impedance of the source follower allows for the use of a 40-pF decoupling capacitor to improve the power supply rejection ratio, while still maintaining stability.
IV. EXPERIMENTAL RESULTS
The transmitter was fabricated in a 90-nm LP CMOS process. As shown in the die photograph in Fig. 4 , the total transmitter active area is 250 μm × 140 μm. Fig. 5(a) shows the low-frequency output patterns with a peak output swing near 400 mV ppd and a maximum equaliza- tion value of 6 dB. The measured equalization settings match well with the linear value in decibels predicted by (2) , with a slope of 0.4 dB/code for a 400-mV ppd max swing and a 120-μA reference current setting [ Fig. 5(b) ]. In the hybrid driver, if the current-mode equalization settings are increased beyond 6 dB, the regulator is required to sink a portion of the equalization current. While this is not possible with the current regulator implementation, for LP serial-link transceivers which often also implement efficient receiver-side continuous-time linear equalizers, this level of transmit equalization is generally suitable for channels with 15-20 dB of loss at the Nyquist frequency. For increased equalization settings, the regulator output stage can be modified to sink a portion of the equalization current for equalization settings above 6 dB.
The transmitter transient performance at a maximum of 6-Gbit/s data rate is verified in the 2 15 − 1 PRBS eye diagrams with operation over a 3-in FR4 channel with 4-dB loss at 3 GHz, shown in Fig. 6 . By enabling the current-mode equalization, improvement is achieved in both eye height (127-163 mV) and eye width (106-115 ps). Testing with 3-GHz fixed clock patterns shows no significant degradation in output jitter with equalization enabled, 3.43-ps rms jitter without equalization, and 3.17-ps rms jitter with 6-dB equalization and the same 200-mV ppd output swing. With the addition of an subminiature version A (SMA) cable to the 3-in FR4 channel, the total channel loss increases to 6 dB at 2.4 GHz, and the performance with maximum equalization settings is verified in the 4.8-Gbit/s eye diagram in Fig. 7 . Again, improvement is achieved in both eye height (87-146 mV) and eye width (123-150 ps). Fig. 8(a) shows how Z UP and Z DN vary as the output swing without equalization V REF varies from 100 to 400 mV ppd . Relative to the 60-Ω target output impedance, Z UP and Z DN vary by maximum values of 7% and 10%, respectively. This is due to the driver output impedance increasing because of the reduced amplifier gain at the higher V Z UP and V Z DN output voltages required as the output swing increases. Fig. 7 eye diagrams, transmitter energy efficiency versus data rate for minimum channel outputs of 50-mV eye height and 0.6-UI eye width is shown with and without equalization. For data rates of 4 Gbit/s and lower, equalization is not required for the target eye opening, and a 1.11-pJ/bit optimal energy efficiency is achieved at 4 Gbit/s. Including equalization improves the overall eye margins and is necessary above 4 Gbit/s to achieve 0.6-UI eye width. Activating the equalization circuitry to achieve the target eye margins raises the energy efficiency by less than 0.2 pJ/bit up to 6 Gbit/s. Table II shows a measured power breakdown at different data rates and equalization conditions. For the 6-Gbit/s settings used in the eye diagram in Fig. 6, 1 .26-pJ/bit energy efficiency is achieved, with the largest power consumption from the 1.2-V DV DD supply. As the data rate is dropped to 2 Gbit/s, significant DV DD power savings are achieved by reducing the supply to 0.8 V. However, the total transmitter energy efficiency is dominated by the output-stage power, and 1.36 pJ/bit is achieved with 100-mV ppd output swing and no equalization. Table III shows a comparison of this design with other low-swing voltage-mode transmitters. Relative to the design in [7] which was implemented in a similar process, the presented design allows for higher data rate operation with the efficient inclusion of two-tap FIR equalization and four times the output swing. The efficiency of the equalization is evident by comparing this work with that in [2] , which implemented two-tap output equalization via a segmented resistor divider approach.
V. CONCLUSION
This brief has presented a hybrid voltage-mode transmitter with current-mode equalization, which enables independent control over termination impedance, equalization settings, and predriver supply. By controlling the equalization settings with a tail-current-source DAC in the parallel current-mode driver ,   TABLE II  TRANSMITTER PERFORMANCE SUMMARY   TABLE III  TRANSMITTER PERFORMANCE COMPARISONS segmentation is eliminated in the voltage-mode output stage, allowing for significant reduction in predriver complexity and power. Output impedance control is maintained in a manner that is compatible with supply scaling with additional series transistors in the voltage-mode output stage which are controlled by a global impedance control loop. These techniques allow for efficient transmit equalization over a wide range of data rates, supply voltages, and output swing levels.
