Comparator hysteresis compensation for decision feedback equalisers by Mattia, O. E. et al.
Comparator hysteresis compensation
for decision feedback equalisers
O.E. Mattia✉, D. Guermandi, G. Torfs and P. WambacqELECTHigh-speed comparators are extensively used in serial link receiver
designs. Some comparator architectures can show signiﬁcant hysteresis
that degrade the sensitivity of the receiver, increasing the bit error rate. In
this Letter, a comparator hysteresis compensation strategy that re-uses
the ﬁrst tap of a decision feedback equaliser to shift the comparator
input voltage, increasing the decision margin is proposed. An updated
equaliser coefﬁcient adaptation scheme is also introduced. The proposed
technique can be used for binary and multi-level modulations.Introduction: As serial link data rates are pushed into several tens of
Gbit/s, complex equalisation schemes become necessary at the receiver
side. A common equaliser is the decision feedback equaliser (DFE),
shown conceptually in Fig. 1a and schematically in Fig. 1b. The basic
idea of a DFE is that the comparator makes a decision that represents
an estimate of the data. The coefﬁcients on the feedback path form a
ﬁnite impulse response ﬁlter that emulates the channel impulse response
(CIR) at ITAPS, which is subtracted from the actual channel output IMAIN,
cancelling the inter-symbol interference (ISI). If the CIR estimate is
correct then VCOMP has no ISI.
gm
hN h2
CLKRX
CLK
a
b
IMAIN–
ITAPS+ITAPS–
MTAP,1
ITAP1
DATA<N:0>
SIGN<N:0>
1st TAP
IMAIN+
VCOMP
MAMP
IAMP
VIN
RL
RL
CLKRX
h1 a
m
pl
itu
de
time
+
–
+
–
Fig. 1 Traditional binary decision feedback equaliser
a concept and channel impulse response
b DFE schematics
The comparator’s propagation delay limits the maximum speed at
which the DFE loop can be closed [1]. Comparator hysteresis cancella-
tion schemes usually need a reset phase to erase the previous bit
decision, decreasing the available time for sensing and comparing.
This Letter instead proposes the use of the DFE ﬁrst tap coefﬁcient to
compensate for this hysteresis, as next.
Hysteresis compensation: The basic concept is illustrated in Fig. 2, and
consists in shifting the current analogue input signal based on the pre-
vious bit decision, maintaining the same ‘decision margin’.
In Fig. 2a, the decision margin presented to the ideal comparator is
equal to |VIN|. In Fig. 2 b, this margin has been degraded by the hyster-
esis, and has now become | + VIN − VLH| and | − VIN + VHL|, being
VHL and VLH the high-to-low and low-to-high thresholds, respectively.
In Fig. 2c, the hysteresis is compensated by the ﬁrst DFE tap, shifting
the input levels by VHYST = (VLH + VHL)/2 based on the previous bit
decision to increase the next bit decision margin, as given by
VCOMP[k] = RL(IMAIN[k]− DATA0[k − 1]ITAP1), where k represents
data samples spaced by CLKRX . This results in a higher decision
margin for VCOMP equal to | + VIN + VHYST − VLH| and
| − VIN − VHYST + VHL| for the cases where the incoming data sequence
is 01 and 10, respectively. A higher decision margin has a further advan-
tage of decreasing the comparator delay time.RONICS LETTERS 13th December 2018 Vol.CLKRX
CLKRXCLKRX
CLKRX
+VIN + VHYST 
–VIN + VHYST 
+VIN – VHYST 
–VIN – VHYST 
+VIN+VIN
–VIN–VIN
VLH
VLH
VHL
VHL
+VIN
+VIN
–VIN
–VIN0
0
0
0 tCLK tCLK
tCLK
0 tCLK
a b
c
Fig. 2 Hysteresis compensation by shifting the input voltage to maintain the
same decision margin
a No hysteresis
b With hysteresis
c First tap compensated
Coefﬁcient adaptation: In practice, the DFE taps must be continuously
adapted to changes in the environment that result in variations of the
CIR. A common way to implement such adaptation consists of an auxili-
ary comparator CAUX with threshold voltage VIN, that monitors the input
eye diagram and correlates its output with the data using a Sign-Sign LMS
algorithm [2], as shown in Fig. 3a. Suppose the DFE is adapting to a CIR
that contains only one tap of post-cursor ISI k1. According to the pre-
viously described hysteresis compensation scheme, the correct value for
the tap coefﬁcient should be overestimated to h1 = k1+ VHYST.
However, since the threshold ofCAUX is set to VIN, the DFE will converge
to the tap value that minimises the error around VIN, resulting in the wrong
value of h1 = k1 and the equalised eye diagram of Fig. 3b.
CAUX
CAUX
C0
C0
CLKRX
CLKRX
CLKRX
CLKRX
CLKRX
CLKRX
tCLKtCLK
VIN
+VIN
–VIN
+VIN + VLH
+VIN – VHL
–VIN + VLH
–VIN – VHL
VIN
VHL VLH
+
– LMS
engine
h1 = k1
LMS
engine
h1 = k1 + VHYST
00
a c
b d
0 0
Fig. 3 DFE adaptation for hysteresis compensation
a Traditional auxiliary comparator with ﬁxed threshold
b Resulting eye-diagram
c Proposed auxiliary comparator with variable threshold
d Resulting eye-diagram with hysteresis compensation
Instead, we propose the use of a novel adaptive threshold auxiliary
comparator, as shown in Fig. 3c. The main comparator C0 output is
used to shift the adaptive comparator’s threshold to that of
VIN+ VHL/VLH, resulting in the equalised eye diagram of Fig. 3d and
thus compensating for the hysteresis of comparator C0, as described pre-
viously at Fig. 2c. This scheme assumes that the auxiliary comparator
CAUX does not have hysteresis. Since the adaptation engine can
operate at a fraction of the speed of the data path, the addition of a
reset phase is less impactful here.
Simulation results: The proposed hysteresis compensation and tap
adaptation scheme was implemented in a behavioural model using
Matlab, both for NRZ and PAM4 signal modulations. A 5-tap CIR
was assumed, to be compensated by a 5-tap DFE. Shown in Fig. 4a is54 No. 25 pp. 1421–1422
the actual and estimated CIR, while Fig. 4b shows the coefﬁcient evol-
ution over time for the NRZ case. The PAM4 case produces similar
results. Note that the ﬁrst tap is adapted to a higher value than the
CIR to compensate for a normalised hysteresis of 0.15.
0.1
real CIR
estimated CIR
0.8
0.6
0.4
a
m
pl
itu
de
0.2
–0.2 –0.2
–0.1
0
0.1
0.2
0.3
0.1
0.5
0.6
0 0 1000 2000 3000
tap 1
tap 2
tap 3
tap 4
tap 5
2
sample
a b
sample
4 6
0
Fig. 4 DFE adaptation results
a real and estimated CIR
b coefﬁcient evolution over time
Fig. 5 shows the comparator input voltage during DFE adaptation, for
the (a) NRZ and (b) PAM4 cases. Note how initially there is no visible
eye margin, while after about 500 samples the input of the comparator
converges to the two eye levels described in Fig. 2. In the PAM4 case
each DFE tap produces two adittional levels due to the previous data
extra possibilities, meaning the hysteresis is fully for half of the data
bits, and only partially compensated for in the other half.
1.5
comparator input
1.0
0.5
0
–0.5
–1.0
–1.5
0 1000
sample
a
m
pl
itu
de
a
m
pl
itu
de
a b
2000 3000
1.5
comparator input
1.0
0.5
0
–0.5
–1.0
–1.5
0 1000
sample
2000 3000 4000 5000
Fig. 5 Comparator input during DFE adaptation, with hysteresis
compensation
a NRZ
b 4PAM
12
10
8
6
o
cc
u
re
n
ce
s
o
cc
u
re
n
ce
s
4
2
0
20
0
10
20
30
40
50
60
9.5 10.0 10.5
hysteresis, mV
11.0 11.5
30 40 50
threshold voltage, mV
a
b
60 70 80
VLH
VHL
Fig. 6 Variability results for 100 Monte Carlo runs
a comparators’ threshold voltage
b hysteresisELECTRONICS LETTERS 13th DExperimental results: A hardware implementation of the proposed tech-
nique has been demonstrated in a 18/36 Gbit/s QPSK/16-QAM 5-tap
DFE fabricated in 28 nm CMOS for mmWave wireless communication.
After down-conversion on a homodyne RX, the signal is presented as
two I/Q components each containing a NRZ/4-PAM signal [3]. In this
design each comparator had a hysteresis of about 10% of the
maximum eye level, due to kickback of the CML slave latch and there-
fore independent of frequency. Shown in Fig. 6 is 100 Monte Carlo
simulation runs for both average global effects and local mismatch.
Note that even though the offset has a standard deviation of 10 mV
the hysteresis is practically insensitive to fabrication variations.
An on-chip PRBS9 generator and checker was used to measure the
BER bathtub curves. Shown in Fig. 7 are the experimental results for
QPSK and 16QAM signals at the maximum 9 GHz clock frequency.
18 Gbit/s QPSK BER vs. UI
10–1
10–3
10–5
10–7
10–9
0 20
BE
R
BE
R
40
no errors
for 109 bits
sampling instant, ps
a b
60
95 ps
(0.86 UI)
80 100
10–1
10–3
10–5
10–7
10–9
0 20 40
no errors
for 109 bits
sampling instant, ps
60
35 ps
(0.32 UI)
80 100
uncomp offset comp offset & hyst comp
36 Gbit/s 16 QAK BER vs. UI
Fig. 7Measurement results for 18 Gbps GPSK and 36 Gbps 16 QAM DFE in
28 nm CMOS
After the comparators’ offset has been manually tuned the ﬁrst tap of
the DFE was used to compensate for the expected 10 mV hysteresis. The
horizontal eye opening is increased for both cases, demonstrating the
effectiveness of the technique using the ﬁrst DFE tap. Only offset com-
pensation was not enough to receive error-free LSBs on the 16QAM
mode due to limited bandwidth of the PRBS generator at the
maximum speed. The same measurement was repeated with the same
ﬁrst tap setting but at lower clock speeds of 4 and 1 GHz, to avoid
this bandwidth limitation, and including different channel CIRs [3].
Conclusion: This work presented a novel comparator hysteresis com-
pensation scheme for DFE-based serial-link receivers, that allows for
a higher receiver sensitivity and lower BER. A novel DFE coefﬁcient
adaptation scheme is also introduced to account for the comparator hys-
teresis. It is demonstrated for NRZ and PAM4 modulations by using a
behavioural model and a hardware implementation in 28 nm CMOS.
© The Institution of Engineering and Technology 2018
Submitted: 30 July 2018 E-ﬁrst: 8 November 2018
doi: 10.1049/el.2018.6485
One or more of the Figures in this Letter are available in colour online.
O.E. Mattia, D. Guermandi and P. Wambacq (IoT Unit, IMEC,
Kapeldreef 75, Leuven, Belgium)
✉ E-mail: oscar.elisio.mattia@imec.be
G. Torfs (IDLab, Department of Information Technology, Ghent
University – IMEC, Gent, Belgium)
O.E. Mattia and P. Wambacq: Also with ETRO Department, Vrije
Universiteit Brussel, Brussels, Belgium
References
1 He, A., Gai, W., and Tang, L.: ‘40 Gbps 4-level pulse amplitude
modulation closed-loop decision-feedback equaliser with high-speed
comparator in 55 nm CMOS technology’, Electron. Lett., 2018, 54,
(12), pp. 741–742, doi: 10.1049/el.2018.1112
2 Stojanovic,́ V., Ho, A., Garlepp, B.W., et al.: ‘Autonomous dual-mode
(PAM2/4) serial link transceiver with adaptive equalization and
data recovery’, J. Solid-State Circuits, 2005, 40, pp. 1012–1025,
doi: 10.1109/JSSC.2004.842863
3 Mattia, O.E., Guermandi, D., Torfs, G., et al.: ‘An up to 36 Gbps 5-tap
16QAM DFE for mmWave wireless communication in 28 nm CMOS’.
IEEE Custom Integrated Circuits Conf. (CICC), Austin, TX, USA,
April 2017, pp. 1–4, doi: 10.1109/CICC.2017.7993666ecember 2018 Vol. 54 No. 25 pp. 1421–1422
