Abstract: This paper presents the design and implementation of a new digital-to-time converter (DTC). The obtained resolution is 1.02 ps, and the dynamic range is about 590 ns. The experimental results indicate that the measured differential nonlinearity (DNL) and integral nonlinearity (INL) are −0.17~+0.13 LSB and −0.35~+0.62 LSB, respectively. This DTC builds coarse and fine Vernier delay lines constructed by programmable delay lines (PDLs) to ensure high performance delay. Benefited by the close-loop feedback mechanism of the PDLs' control module, the presented DTC has excellent voltage and temperature stability. What is more, the proposed DTC can be implemented in a single field programmable gate array (FPGA) chip.
Introduction
A digital-to-time converter (DTC) is similar to a digital-to-analog converter (DAC), except that DTC converts digital values to time interval signals. DTCs are widely used in atomic frequency standards, high-precision positioning [1, 2] , time-correlated single-photon counting instruments (TCSPC), and measurement instruments [3] [4] [5] [6] such as the very large scale integration (VLSI) functional tester [7] .
The counting method is a widely used DTC method, which is realized by counting periods of a reference clock. The counting method is simple in implementation and large in dynamic range. However, its resolution is limited to several nanoseconds, which is determined by the frequency of the reference clock. Taking advantage of the technique of the Application Specific Integrated Circuit (ASIC), DTCs implemented in ASIC devices (ASIC-based DTCs) have been proposed [1] [2] [3] [4] 7] . The best resolution of the ASIC-based DTC is better than 1 ps [3] , which is realized by combining the ASIC technique and the capacitor charging principle. However, these DTCs are sensitive to variations in temperature and voltage owing to the use of analog devices. Moreover, DTCs implemented in ASIC devices is high in cost and long in time to market.
With the development of commercial integrated circuits, DTCs implemented in FPGA (field programmable gate array) devices (FPGA-based DTCs) have attracted an increasing amount of research interest in recent years. Compared with ASIC-based DTCs, FPGA-based DTCs have the advantage of a low cost and a short time to market. Many DTCs implemented in the FPGA are realized by phase shifting [6, 8, 9] , and the obtained resolution is about 8 ps. FPGA-based DTCs are mostly based on a Vernier delay line, a tapped delay line [10] , and a Vernier ring oscillator [5] . A DTC with a resolution of 11 ps was implemented in a Xilinx Kintex-7 FPGA device based on a tapped delay line [10] . A DTC based on a Vernier ring oscillator and the Altera Stratix-III FPGA device was proposed [5] , whose theoretical resolution was 1.58 ps.
DTCs with high resolution are mainly implemented in ASIC devices, which have the advantages of fully customized circuits and reasonably precise control of the internal propagation delay. However, it is difficult for FPGA-based DTCs to obtain high resolutions due to the additional delay introduced by the unpredictability of the routing strategy. Therefore, FPGA-based DTCs with a 1 ps resolution are very valuable for practical applications, but also very challenging for research.
In this paper, a novel realization of a DTC based on an FPGA device is proposed and tested. The obtained resolution is 1.02 ps, and the dynamic range is 590 ns. The proposed DTC benefits much from the coarse and fine Vernier delay line constructed by PDLs, which ensures a high performance delay. Meantime, the control module of PDL (programmable delay line) adopts a programmable close-loop feedback mechanism to adjust the tap delay of the delay lines, which improves the nonlinearity and stability of the DTC. Compared with the normal DTCs implemented in FPGA/ASIC devices, the proposed DTC achieves a higher resolution and improved nonlinearity and stability.
The remainder of the paper is organized as follows. Section 2 describes the principle and structure of the proposed DTC. Section 3 presents the realization of the DTC system. Section 4 shows the experimental results, and Section 5 summarizes the paper.
Principle of the Proposed DTC

Principle of the Vernier Delay Line
The Vernier delay line is widely used in time-to-digital converters (TDCs) [11] . The schematic of a TDC based on a Vernier delay line is shown in Figure 1 . In the figure, time interval signals (Start and Stop) are delayed by two delay lines, respectively. The Start signal is delayed by a delay line with a cell delay of τ 1 , and the Stop signal is delayed by another delay line with a cell delay of τ 2 (τ 1 < τ 2 ). When edge coincidence is detected, the measurement result can be calculated from Equations (1) and (2) .
Appl. Sci. 2017, 7, 52 2 of 11 tapped delay line [10] . A DTC based on a Vernier ring oscillator and the Altera Stratix-III FPGA device was proposed [5] , whose theoretical resolution was 1.58 ps. DTCs with high resolution are mainly implemented in ASIC devices, which have the advantages of fully customized circuits and reasonably precise control of the internal propagation delay. However, it is difficult for FPGA-based DTCs to obtain high resolutions due to the additional delay introduced by the unpredictability of the routing strategy. Therefore, FPGA-based DTCs with a 1 ps resolution are very valuable for practical applications, but also very challenging for research.
Principle of the Proposed DTC
Principle of the Vernier Delay Line
The Vernier delay line is widely used in time-to-digital converters (TDCs) [11] . The schematic of a TDC based on a Vernier delay line is shown in Figure 1 . In the figure, time interval signals (Start and Stop) are delayed by two delay lines, respectively. The Start signal is delayed by a delay line with a cell delay of τ1, and the Stop signal is delayed by another delay line with a cell delay of τ2 (τ1 < τ2). When edge coincidence is detected, the measurement result can be calculated from Equations (1) and (2) . Figure 1 . Schematic of a time-to-digital converter (TDC) based on a Vernier delay line.
The principle of the Vernier delay line can also be applied to DTCs, which generate time interval signal (Start and Stop). The schematic of a DTC based on Vernier delay line is shown in Figure 2 . Delay Line A and Delay Line B are adopted, which are with a cell delay of τA and τB. M is loaded into the counters as an input number, which is calculated according to the time interval needed. The output of the M-th delay tap in Delay Line A is the Start signal and the output of the N-th delay element in Delay Line B is the Stop signal. Thus, the generated time interval can be calculated from Equation (3) . The principle of the Vernier delay line can also be applied to DTCs, which generate time interval signal (Start and Stop). The schematic of a DTC based on Vernier delay line is shown in Figure 2 . Delay Line A and Delay Line B are adopted, which are with a cell delay of τ A and τ B . M is loaded into the counters as an input number, which is calculated according to the time interval needed. The output of the M-th delay tap in Delay Line A is the Start signal and the output of the N-th delay element in Delay Line B is the Stop signal. Thus, the generated time interval can be calculated from Equation (3) . 
Vernier Delay Line Realized by PDL
Herein, we use the programmable delay line (PDL) to realize the Vernier delay line. Figure 3a shows the schematic diagram of a PDL, which is a fully controllable, voltage-controlled, wrap-around delay line with a calibrated tap resolution. The control module is used to quantize a period of the reference clock and to ensure the time delay of the delay taps. If there are N taps in the delay line, the time delay of a single tap can be obtained from Equation (4) . A control module can calibrate all the PDL modules within its clock region.
The delay taps in the delay line are voltage-controlled delay taps. In the control module, a phase detector (PD) is used to compare the output phase of the delay line with the reference clock. If the phase difference is not equal to half a period of the reference clock, the output of the phase detector changes the control voltage of the voltage-controlled oscillator (VCO), and the time delay of delay taps until the phase difference becomes coincident. The close-loop feedback mechanism reduces the influence from the voltage and temperature variations on the delay line. If variations of the voltage and the temperature lead to changes in the time delay of delay taps, a phase difference will be detected, and the close-loop structure will compensate the changes.
The time delay of delay taps in a PDL is determined by the frequency of the reference clock. For an N-tap delay line, the time delay of a tap is τA when the frequency of the reference clock is fA, and the time delay of a tap is τB when the frequency of the reference clock is fB. Therefore, the resolution of a Vernier delay line built by PDLs can be calculated from Equation (5) . The schematic diagram of a Vernier delay line built by PDLs is shown in Figure 3b . The delay of a Vernier delay line built by PDLs can be programmed in two ways. First, the tap delay of PDLs can be programmed by changing the frequency of REFCLK A and REFCLK B. Second, the number of working delay taps can be specified by TAPCNT and the bit width, which is 5-bit (0~31). For the programmability, we call the delay line the programmable delay line. 
The time delay of delay taps in a PDL is determined by the frequency of the reference clock. For an N-tap delay line, the time delay of a tap is τ A when the frequency of the reference clock is f A , and the time delay of a tap is τ B when the frequency of the reference clock is f B . Therefore, the resolution of a Vernier delay line built by PDLs can be calculated from Equation (5) . The schematic diagram of a Vernier delay line built by PDLs is shown in Figure 3b . The delay of a Vernier delay line built by PDLs can be programmed in two ways. First, the tap delay of PDLs can be programmed by changing the frequency of REFCLK A and REFCLK B. Second, the number of working delay taps can be specified by TAPCNT and the bit width, which is 5-bit (0~31). For the programmability, we call the delay line the programmable delay line. 
DTC Based on a Programmable Delay Line
This paper uses a Vernier delay line based on PDLs to realize the DTC. In order to increase the dynamic range, a modified structure for the DTC is proposed, shown in Figure 5 . In the modified design, PDL A0~PDL A236 and PDL B0~PDL B236 act as the first stage delay line (coarse delay line). PDL A237~PDL A239 and PDL B237~PDL B239 act as the second stage delay line (fine delay line). In the first stage delay line, the number of delay taps of PDL A is set to 0. The number of working delay taps of PDL B is set according to Equation (6) . As for the second stage, the number of delay taps of PDL A and PDL B are the same, which is represented by CNTB. For a certain time interval Tx, CNTA, and CNTB can be calculated from the equations below. In the 
This paper uses a Vernier delay line based on PDLs to realize the DTC. 
This paper uses a Vernier delay line based on PDLs to realize the DTC. In order to increase the dynamic range, a modified structure for the DTC is proposed, shown in Figure 5 . In the modified design, PDL A0~PDL A236 and PDL B0~PDL B236 act as the first stage delay line (coarse delay line). PDL A237~PDL A239 and PDL B237~PDL B239 act as the second stage delay line (fine delay line). In the first stage delay line, the number of delay taps of PDL A is set to 0. The number of working delay taps of PDL B is set according to Equation (6) . As for the second stage, the number of delay taps of PDL A and PDL B are the same, which is represented by CNTB. For a certain time interval Tx, CNTA, and CNTB can be calculated from the equations below. In the In order to increase the dynamic range, a modified structure for the DTC is proposed, shown in Figure 5 . In the modified design, PDL A0~PDL A236 and PDL B0~PDL B236 act as the first stage delay line (coarse delay line). PDL A237~PDL A239 and PDL B237~PDL B239 act as the second stage delay line (fine delay line). In the first stage delay line, the number of delay taps of PDL A is set to 0.
The number of working delay taps of PDL B is set according to Equation (6) . As for the second stage, the number of delay taps of PDL A and PDL B are the same, which is represented by CNT B . For a certain time interval T x , CNT A , and CNT B can be calculated from the equations below. In the modified design, the frequency of CLKA and CLKB are still 202 MHz and 200 MHz. The effective range of the DTC is enlarged to 590 ns.
DTCs available with a large dynamic range are most realized by coarse and fine methods. The coarse method is commonly realized by counting the periods of a clock signal [3, 9, 10] , which is very simple in implementation. However, the counting circuit may introduce additional errors into the circuit and decrease the conversion accuracy, especially the error caused by the trigger of the counter. This paper uses a coarse and fine delay line to increase the dynamic range. The coarse delay line and the fine delay line are the same, except that, in the coarse delay line, the number of working delay taps of PDL A0~PDL A236 is set to 0. By doing this, we increase the dynamic range without decreasing the conversion accuracy. DTCs available with a large dynamic range are most realized by coarse and fine methods. The coarse method is commonly realized by counting the periods of a clock signal [3, 9, 10] , which is very simple in implementation. However, the counting circuit may introduce additional errors into the circuit and decrease the conversion accuracy, especially the error caused by the trigger of the counter. This paper uses a coarse and fine delay line to increase the dynamic range. The coarse delay line and the fine delay line are the same, except that, in the coarse delay line, the number of working delay taps of PDL A0~PDL A236 is set to 0. By doing this, we increase the dynamic range without decreasing the conversion accuracy.
Circuit Implementation
The DTC shown in Figure 5 is implemented in the Xilinx Virtex-6 FPGA chip. Part of the place 
The DTC shown in Figure 5 is implemented in the Xilinx Virtex-6 FPGA chip. Part of the place and route of the Vernier delay line constructed by PDLs is shown in Figure 6 . In the figure, the delay line on the left is the Start delay line, and the delay line on the right is the Stop delay line. The cyan lines are the propagation paths between the PDLs. In the figure, 1.459 ns is the path delay between PDL A68 and PDL A69, and 1.446 ns is the path delay between PDL B68 and PDL B69, which corresponds with One of the most important delays that may affect the performance of the DTC is the delay of propagation paths between the PDLs. The time delay of the propagation paths (ta and tb in Figure 5 ) is shown in Figure 7a , and the data are obtained from the FPGA editor (a design tool of the Xilinx ISE design suite). In the figure, most of the propagation paths on the Start delay line and the Stop delay line are equal and can be offset. The largest delay difference is 31 ps as shown in Figure 7b . The integral delay difference between the Start delay line and the Stop delay line is 202 ps. The measurement error introduced by the propagation paths is a fixed error and can be compensated. Another delay is the intrinsic delay of PDL, which is about 400 ps [12, 13] . The intrinsic delay indicates the time delay of a PDL when the number of working delay taps is set to 0. For a delay line with 240 PDLs, the intrinsic delay is about 100 ns. However, the intrinsic delay is identical on both lines. In the design shown in Figure 5 , the time interval signals (Start and Stop) is generated at the last PDL of the delay lines, which does not need multiplexers to collect and select the output signal. By doing this, the measurement error caused by the intrinsic delay of PDL can be offset completely. One of the most important delays that may affect the performance of the DTC is the delay of propagation paths between the PDLs. The time delay of the propagation paths (ta and tb in Figure 5 ) is shown in Figure 7a , and the data are obtained from the FPGA editor (a design tool of the Xilinx ISE design suite). In the figure, most of the propagation paths on the Start delay line and the Stop delay line are equal and can be offset. The largest delay difference is 31 ps as shown in Figure 7b . The integral delay difference between the Start delay line and the Stop delay line is 202 ps. The measurement error introduced by the propagation paths is a fixed error and can be compensated. Another delay is the intrinsic delay of PDL, which is about 400 ps [12, 13] . The intrinsic delay indicates the time delay of a PDL when the number of working delay taps is set to 0. For a delay line with 240 PDLs, the intrinsic delay is about 100 ns. However, the intrinsic delay is identical on both lines. In the design shown in Figure 5 , the time interval signals (Start and Stop) is generated at the last PDL of the delay lines, which does not need multiplexers to collect and select the output signal. By doing this, the measurement error caused by the intrinsic delay of PDL can be offset completely. One of the most important delays that may affect the performance of the DTC is the delay of propagation paths between the PDLs. The time delay of the propagation paths (t a and t b in Figure 5 ) is shown in Figure 7a , and the data are obtained from the FPGA editor (a design tool of the Xilinx ISE design suite). In the figure, most of the propagation paths on the Start delay line and the Stop delay line are equal and can be offset. The largest delay difference is 31 ps as shown in Figure 7b . The integral delay difference between the Start delay line and the Stop delay line is 202 ps. The measurement error introduced by the propagation paths is a fixed error and can be compensated.
Another delay is the intrinsic delay of PDL, which is about 400 ps [12, 13] . The intrinsic delay indicates the time delay of a PDL when the number of working delay taps is set to 0. For a delay line with 240 PDLs, the intrinsic delay is about 100 ns. However, the intrinsic delay is identical on both lines. In the design shown in Figure 5 , the time interval signals (Start and Stop) is generated at the last PDL of the delay lines, which does not need multiplexers to collect and select the output signal. By doing this, the measurement error caused by the intrinsic delay of PDL can be offset completely.
Experiments
Setup of Experiments
For function verification and performance evaluation, the Xilinx Virtex-6 FPGA chip is adopted for circuit realization. The evaluation board ML605 (designed by Xilinx Inc., San Jose, CA, USA) is used. The reference clock of the board is a crystal oscillator with a frequency of 200 MHz. CLKA and CLKB are generated by the Mixed-Mode Clock Managers (MMCMs), which is realized by multiplying or fractional dividing the reference clock. CLKA and CLKB are connected to global clock buffers, which is a dedicated interconnect network specifically designed to reach all clock inputs of the various resources inside an FPGA chip. These networks feature low skew and low duty cycle distortion, low power, and improved jitter tolerance.
The DTC evaluation board communicates with the PC via PCI (peripheral component interconnect) Express interface. The time interval signal generated is outputted through the SMA (small-motion accelerometer) connector, which has the advantage of low noise. The test system of the DTC is shown in Figure 8 . The evaluation board is supplied by RIGOL DP832A (RIGOL Technologies Inc., Beijing, China), and a digital oscilloscope (Keysight DSAZ592A, Keysight Technologies, Santa Rosa, CA, USA) is employed to measure the time interval generated. Additionally, a thermal chamber (ESPEC SH642, designed by ESPEC Corp., Osaka, Japan) is utilized to evaluate the temperature stability of the DTC. 
Experiments
Setup of Experiments
Analysis of Clock Jitter and Phase Noise
In this paper, CLKA and CLKB are generated by the MMCMs. The input clock is from an oven controlled crystal oscillator (OCXO) with a frequency stability of ±0.3 ppb. In the realization, the MMCM (mixed-mode clock managers) parameter is enabled to minimize output jitter. Experiments are conducted to measure the jitter of CLKA and CLKB. Figure 9 shows the Random Jitter, Period Jitter (RJ, PJ) Histogram of CLKA and CLKB, which is obtained above 6 × 10 6 transitions. The RJ of CLKA is 8.52 ps, and the PJ of CLKA is 9.79 ps. The RJ of CLKB is 8.92 ps, and the PJ of CLKB is 7.60 ps. Here, all the values are root mean square (RMS) values.
The influence of the jitter on the performance of the proposed DTC is analyzed in two aspects. First, the difference in RJ between CLKA and CLKB is −0.4 ps, and that of PJ is 2.39 ps. The proposed DTC is based on the Vernier delay line structure, which is a symmetrical structure. The clock jitter exists on both of the delay lines. Then, the influence of jitter of CLKA and CLKB on the DTC can be offset by the two delay lines to some degree. Second, the control module of the PDL is a close-loop structure, which can reduce the jitter of the clocks.
The period jitter of CLKA and CLKB is related to the phase noise. The relation between the period jitter and the phase noise can be calculated from Equation (9) [14] . In Equation (9), JPER is the period jitter, fc is the clock frequency, and L(f) is the phase noise spectrum. The period jitter of CLKA and CLKB are shown in Figure 9 , and both of them are <10 ps. The phase noise and jitter of CLKA 
The period jitter of CLKA and CLKB is related to the phase noise. The relation between the period jitter and the phase noise can be calculated from Equation (9) [14] . In Equation (9), J PER is the period jitter, f c is the clock frequency, and L(f ) is the phase noise spectrum. The period jitter of CLKA and CLKB are shown in Figure 9 , and both of them are <10 ps. The phase noise and jitter of CLKA and CLKB can transmit to the control modules of PDL and affect the time delay of delay taps. The largest jitter on tap delay of PDL is ±5 ps [13] . Though the theoretical resolution of the proposed DTC is 1 ps, and the measurement accuracy is lower, which is about 10 ps (RMS value).
Appl. Sci. 2017, 7, 52 8 of 11
largest jitter on tap delay of PDL is ±5 ps [13] . Though the theoretical resolution of the proposed DTC is 1 ps, and the measurement accuracy is lower, which is about 10 ps (RMS value). 
Average Resolution and Nonlinearity Tests
The performance of the Virtex-6 DTC is measured within the range of 0~500 ns. When CNTA and CNTB are both 0, the obtained time interval is 205 ps. When CNTA and CNTB are both 96, the obtained time interval is 303 ps. Therefore, the obtained average resolution is (303 − 205)/96 = 1.02 ps. According to Figure 7 
The performance of the Virtex-6 DTC is measured within the range of 0~500 ns. When CNT A and CNT B are both 0, the obtained time interval is 205 ps. When CNT A and CNT B are both 96, the obtained time interval is 303 ps. Therefore, the obtained average resolution is (303 − 205)/96 = 1.02 ps. According to Figure 7 
Temperature and Voltage Stability Tests
The temperature stability of the proposed DTC is tested on the evaluation board with the help of a thermal chamber. A constant time interval (5 ns) is generated under different temperature, in a range of 10~70 °C. The time interval is measured by the digital oscilloscope, and measurement under a temperature is repeated 50 times. The standard deviation under different temperature is shown in Figure 12a , which indicates that the proposed DTC is less sensitive to variations of temperature when compared with other DTCs implemented in a similar FPGA device [10] .
The 
The voltage tests are performed using the DC power supply RIGOL DP832A (RIGOL Technologies Inc., Beijing, China). The nominal operating voltage of the DTC board is 1 V, which is 
The temperature stability of the proposed DTC is tested on the evaluation board with the help of a thermal chamber. A constant time interval (5 ns) is generated under different temperature, in a range of 10~70 • C. The time interval is measured by the digital oscilloscope, and measurement under a temperature is repeated 50 times. The standard deviation under different temperature is shown in Figure 12a , which indicates that the proposed DTC is less sensitive to variations of temperature when compared with other DTCs implemented in a similar FPGA device [10] . measured by the digital oscilloscope, and measurement under a temperature is repeated 50 times. The standard deviation of the measurement results is shown in Figure 12b . In the figure, with the change of supply voltage, the largest differences in standard deviation is less than 0.5 ps, which indicates that the proposed DTC is not sensitive to variations of supply voltage. Table 1 compares the performance of the proposed DTC with previous work. Compared with other DTCs implemented in an FPGA device, the proposed DTC achieves the best resolution. The dynamic range and the linearity are also good among the DTCs in the table. The jitter and skew of the proposed DTC is larger than ASIC-based DTCs. That is mainly because the circuit and architecture of ASIC-based DTC is fully customized, while the FPGA is commercial and its architecture is not specialized for DTC. 
Comparison with Other DTCs
Conclusions
This paper presents a novel realization of a DTC based on the principle of the Vernier delay line. The proposed DTC is designed based on a coarse and fine Vernier delay line constructed by PDLs, which are controlled by a closed feedback delay loop and can ensure the linearity and stability of the DTC. The proposed DTC is implemented and tested in the Xilinx FPGA evaluation board. The obtained resolution is 1.02 ps, and the differential nonlinearity and integral nonlinearity of the DTC is −0.17~+0.13 LSB and −0.35~+0.62 LSB, respectively. Compared with other DTCs, the realized DTC is higher in resolution and less sensitive to variations in voltage and temperature. Figure 12b . In the figure, with the change of supply voltage, the largest differences in standard deviation is less than 0.5 ps, which indicates that the proposed DTC is not sensitive to variations of supply voltage. Table 1 compares the performance of the proposed DTC with previous work. Compared with other DTCs implemented in an FPGA device, the proposed DTC achieves the best resolution. The dynamic range and the linearity are also good among the DTCs in the table. The jitter and skew of the proposed DTC is larger than ASIC-based DTCs. That is mainly because the circuit and architecture of ASIC-based DTC is fully customized, while the FPGA is commercial and its architecture is not specialized for DTC. 
Comparison with Other DTCs
Conclusions
This paper presents a novel realization of a DTC based on the principle of the Vernier delay line. The proposed DTC is designed based on a coarse and fine Vernier delay line constructed by PDLs, which are controlled by a closed feedback delay loop and can ensure the linearity and stability of the DTC. The proposed DTC is implemented and tested in the Xilinx FPGA evaluation board. The obtained resolution is 1.02 ps, and the differential nonlinearity and integral nonlinearity of the DTC is −0.17~+0.13 LSB and −0.35~+0.62 LSB, respectively. Compared with other DTCs, the realized DTC is higher in resolution and less sensitive to variations in voltage and temperature.
