ABSTRACT
INTRODUCTION
For the burst-mode applications such as passive optical network (PON) [1, 2] , the CDR circuit in the receiver must lock to each burst data packet very quickly. In a PON system, the optical line termination (OLT) receives data packets from various optical network units (ONUs) and these data packets have different phase and amplitude, so immediate clock extraction and data retiming in the CDR circuit is necessary.
The conventional PLL-based CDR circuit requires a long sequence of bits to settle, hence is not suitable for burst-mode applications. Fast locking capability of open-loop CDR circuits [3] - [8] makes them attractive solutions for these applications. However, open loop CDRs suffer from limited frequency tracking range and high jitter transfer [9] . Jitter transfer is not a serious issue for applications where no repeater is required [3] , hence, open-loop CDRs are appropriate for such applications.
In open-loop CDR circuits, a gated voltage-controlled oscillator (GVCO) [3] - [5] or an injectionlocked oscillator (ILO) [6] - [8] can usually be used. GVCOs have higher phase noise than LC VCOs because of their ring structure. So, using GVCO, in general, leads the recovered clock showing larger jitter than using ILO. Also the operation speed of GVCOs is lower than ILOs [5] . An injection-locked oscillator uses an LC type VCO which has better phase noise and higher operation speed in comparison with ring oscillator.
A CDR circuit using injection-locking technique is described in [6] , in which using two cascaded voltage-controlled LC oscillators as a full-rate CDR forces the use of an edge detector circuit. Both VCOs are injection-locked to the input data and the edge detector design needs high speed current mode logic (CML) XOR and delay buffer circuits. Also the reference PLL which is used to overcome the process, supply voltage, and temperature (PVT) variations consists of a VCO which must be an exact duplicate of the main VCOs, otherwise the PLL provides an inaccurate control voltage which is not suitable to use for main VCOs. In this paper, a high speed injectionlocked CDR is proposed which has several advantages over current designs. Utilizing a novel injection-locked oscillator structure and using an appropriate injection method in the proposed CDR circuit leads to higher speed operation by eliminating the edge detector circuit and by sampling the data with a half rate clock. In addition, a new approach instead of a reference PLL is proposed to accommodate PVT variations which forces the free-running frequency of the ILO to track the data rate by adding or removing some of the utilized capacitors to the ILO structure. A 20 Gb/s open-loop CDR circuit is designed and simulated in 0.18 µm CMOS technology for burst-mode applications, which consumes less than 55.3 mW from a 1.8 V supply.
The rest of the paper is organized as follows. Section II describes the CDR topology and architecture. Section III presents the building blocks utilized in the circuit. Section IV describes the simulations and summarizes the results, and finally section V concludes the paper. Figure. 1 shows the proposed CDR architecture. The circuit is composed of an injection-locked LC oscillator, a clock buffer and two retiming flip flops. In addition, to track the frequency of oscillator a divider chain, a frequency detector (FD), and a counter are used in this structure. Input data is injected to the ILO to provide an aligned clock. The recovered clock is applied to a clock buffer which in turn drives two flip flops in the decision circuit to recover the data.
THE PROPOSED CDR
As the injection locking technique has narrow locking range, the ILO may not lock to the frequency of the incoming data stream if it is not properly controlled. Furthermore, the PVT variations may also cause ILO to lose lock. In the proposed architecture a frequency locked loop is used to compensate for these problems. The clock frequency is divided by 40 and then applied to a frequency detector as shown in Fig. 1 . In the frequency detector, this signal is compared to a reference frequency of 250 MHz. The frequency detector produces an output based on whether f ref > f divider or not. This signal is then applied to a counter as to determine up or down counting. The frequency detector circuit also produces another signal whose frequency is dependent on the difference between f ref and f divider . This signal is applied to the counter as its clock and it determines the speed of counting. Thus, counter can count up or down depending on the frequency of ILO in comparison to the reference frequency and then add some capacitors to ILO or remove them from it. The counter does this through the control of a few switches which are connected to the capacitors. By using this method the lock range of the circuit is increased considerably. 
Frequency tracking
As locking range of the ILO is narrow, a frequency locked loop (FLL) is required to set the oscillator frequency in the locking range. We propose a method to track the oscillator frequency and bring it within the appropriate range. The proposed FLL uses a divider chain, a frequency detector and an up/down counter. The output frequency of the oscillator is divided by 40 and then is applied to a frequency detector. The frequency detector circuit is shown in Figure. 2 which is a rotational frequency detector circuit [10] . It compares the divider output frequency with a reference frequency of 250 MHz and produces an up/down signal and a clock for the counter. In this circuit the ILO output clock which is composed of CK i and CK q , are sampled by the reference clock, producing two outputs Q 1 and Q 2 . Leading or lagging of Q 1 from Q 2 shows the polarity of the frequency error ( fig. 2(b) ). The FD's output, Q 3 , is applied to the counter and forces the counter to count up or down depending on the frequency error's polarity. Fig. 2(c) shows the states of Q 1 and Q 2 , where the sign of beat frequency is indicated by the rotating direction. Fig. 3 shows the designed super harmonic injection-locked LC oscillator which is used as a half rate CDR circuit in this paper. In this structure the data is injected into the ILO directly, instead of applying it to an edge detector. The M1 and M2 create an incident current at the rising and falling edge of the data, respectively. As a result, at every edge of the data the differential output of the oscillator will be shorted. The main advantages of this structure are low power consumption and the reduction of the maximum operational frequency to half of the other state of the art structures in the literature.
BUILDING BLOCKS

ILO
As mentioned before, capacitors with switches are incorporated in the ILO to overcome PVT variations. By switching these capacitors in or out of the ILO structure, the ILO's output frequency is brought to its locking range. So, the ILO can lock to the injected input data. Since the lock range for this ILO is around 150MHz, the frequency shift in ILO's output frequency for each added or removed capacitor is designed to be 100 MHZ. The capacitor values can be calculated from the following equation.
Where f out is the ILO output frequency, f 0 is the ILO free-running frequency, C eq is the total capacitance, C 0 is parasitic capacitance of ILO, and C ∆ is the capacitance value change for each step of counting. For example, when the counter has the value of three, the transistors q 0 , q 1 are on and the capacitors C 1 , C 2 are connected to the ILO. When the counter counts up and changes to four, the transistors q 0 , and q 1 turn off and q 2 turns on hence, the capacitors C 3 are connected to the ILO. So, for this step of counting ∆C=C 3 -(C 1 +C 2 ). The values of C 1 , C 2 , C 3 , C 4 are chosen such that the change in the value of capacitors connected to the ILO for each step of counting, ∆C, to be constant.
It is clear that the capacitors values can be calculated easily for a certain frequency shift and a given free-running frequency of ILO.
Clock buffer
In order to isolate the capacitive loading of the next stage on the ILO, we use the clock buffer which is shown in Fig. 4 . The bandwidth limitation in conventional CML buffers with resistive loads leads to limited output voltage swings. In order to increase signal swing and to solve the bandwidth limitation problem, we can use the inductive peaking technique. We choose the inductor and resistor values to adjust the buffer delay such that the phase difference between the input data and the clock to be equal to 90 ̊ and, as a result the clock edges align with the center of the input data eye.
Figure. 
Flipflop
We can classify flipflops into two categories: static and dynamic. Although the static latches can operate in higher speeds, they consume more power in contrast to their dynamic counterparts. We use the static flipflops only in situations that we need the fast operation, such as decision circuits and first stages of divider. The dynamic flipflops are used in all other situations, such as next stages of the divider and frequency detector circuit, to reduce the power consumption. The details of these two types of flipflops are described as follows.
Static Flipflop
The CML topology is used in our design because it can speed up the operation to some extent. The CML circuit operation is fast, because they have lower output voltage swing compared to the static CMOS circuits. We also use inductive peaking technique similar to [11, 12] which improves the speed of CML circuits by extending the bandwidth. Fig.5 shows the designed CML latch circuit. 
Dynamic Flipflop
As mentioned before, the dynamic flipflop shown in Fig. 6 is used to reduce the power consumption in situations where we do not need fast operation speed [13] . We have also depicted a dynamic divider in Fig. 7 which is used in the next stages of the divider chain. This dynamic divider consumes less power than CML ones. This structure is made of two consecutive D flipflops which are dynamic latches that sample the data in every rising edge of the clock. 
Counter circuit
For turning the switches, used in the injection locked oscillator, on and off, a 4-bit up/down counter is utilized which is shown in Fig. 8 [14] This counter receives its clock and control inputs, which determines the up or down counting state, from the frequency detector circuit. 
SIMULATION RESULTS
The proposed circuit has been designed and simulated in a 0.18 µm CMOS technology. This circuit consumes 55.3mW from a 1.8V supply voltage, where the most of it is consumed in the CML circuits. The input data is shown in Fig. 9 and the recovered data in response to a 2 7 -1 continuous mode PRBS is shown in Fig. 10 . The simulated jitter values for the recovered data are 6 ps, pp and 1.3 ps, rms. The recovered clock is shown in Fig. 11 . The simulated peak-to-peak and rms jitters for the recovered clock are 4.8 ps and 1.1ps, respectively. The ILO locking range of 150MHz is achieved. By using the frequency locked loop, the locking range of the CDR circuit is extended to 1.5 GHz. The CDR circuit recovers the data in less than 50 ps which is equal to 1 bit. Fig. 12 shows this fast locking time. Figure. 12 Simulated input and output waveforms Fig.13 depicts the FD outputs, and it shows that when the ILO locks to the input data the FD does not create any clock pulses for the counter. It means that after lock, the CLK becomes constant, hence the counter does not count anymore and it shows that the CDR circuit is locked to the input data. 
CONCLUSIONS
This work proposes a 20 Gb/s CDR circuit for burst mode applications which is designed and simulated in a 0.18µm CMOS process. The CDR employs a super harmonic injection-locked oscillator, hence frequency of the oscillator is reduced to the half of the bit rate and two flip flops are needed for sampling the data with the half rate clock. With the help of the proposed frequency tracking technique, the circuit accommodates frequency deviations caused by PVT variations. In fact, the simulation results demonstrate that the CDR circuit provides a truly wide operation range (1.5GHz) with reasonably low power consumption. 
