I. INTRODUCTION
T HE readout of Micro-Pattern Gaseous Detectors (MPGDs) by means of bare pixel chips instead of the conventionally used pad-planes was pioneered in 2004 [1] , [2] . Since then, this concept has been successfully applied in TPC-prototypes [3] , [4] , photoelectric X-ray polarimetry [5] and rare particle searches [6] .
The maximal detector integration is achieved in GridPix-detectors utilizing Micromegas [7] for signal amplification in the gas-phase. The operation principle of such detectors is illustrated in Fig. 1(a) ): Electron-ion pairs are created in the drift region by incident ionizing radiation. An electric field of up to several kV/cm causes the ions and electrons to move towards the cathode and the readout anode, respectively. The latter is made of the integrated amplification grid, the so-called InGridstructure [8] , [9] depicted in Fig. 1(b) ), and a pixelized readout ASIC. Underneath the amplification grid, the applied electric [4] . (b) Scanning microscope picture of an InGrid structure [10] . The amplification grid is supported by SU-8 pillars which are hidden inbetween the grid-holes. The latter are aligned with the pixels of the ASIC underneath.
field reaches several kV cm. Hence, an electron avalanche is triggered by each primary electron entering this region, thus, enabling signal detection by the pixels of the ASIC. Since the distance covered by the drifting electrons is directly proportional to the drift-time, the 3-dimensional coordinates of the primary ionizations are easily reconstructed from the positions of the hit pixels and the measured arrival time of the hit. In transverse direction, the achievable single-point resolution is predetermined by the pixel size, while it is limited by the accuracy of the drift-time measurement in longitudinal direction.
Improving the circuitry tested on GOSSIPO-3 [11] , [4] , the GOSSIPO-4 demonstrator discussed in this paper is the final prototype for Timepix3 [12] -the full featured successor of the Timepix ASIC [13] which was utilized in several previous studies validating the GridPix detector concept. The paper is organized as follows: The TDC-technique is described in Sec. II. The circuitry implemented on the demonstrator-chip is presented in Sec. III, the focus is here on the Phase Locked Loop (PLL) which ensures the high TDC accuracy. Before the conclusions are drawn in Sec. V, the achieved PLL-and TDC-performance are discussed in Sec. IV.
II. THE TDC-TECHNIQUE
In order to provide the required single point resolution in longitudinal direction, the Time of Arrival (ToA) of individual charge depositions has to be measured with a precision of a few nanoseconds in GOSSIP-detectors (acronym for: Gas On Slimmed Silicon Pixels, denoting GridPix detectors made of slimmed silicon chips. This concept is evaluated for the ATLAS upgrade [14] .). In principle, this can be achieved with the TDC-0018-9499 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information. technique known from the Timepix ASIC by simply increasing the clock frequency to several 100 MHz. However, due to the high demands on signal routing and power supply as well as on cross-talk suppression, a chip wide high frequency clock network is highly undesirable on a mixed-signal ASIC.
The TDC-technique introduced in [15] circumvents these issues by creating the high-frequency clock locally in the pixels only when it is needed, see Fig. 2 for a timing diagram. When the discriminator in a pixel senses a hit, a 640 MHz ring-oscillator located next to that pixel is enabled. The number of fast clock cycles until the occurrence of the first rising edge of the chip wide slow-clock (40 MHz) is counted with a Linear Feedback Shift Register (LFSR). Then the ring-oscillator is stopped and the number of slow clock cycles until the end of the acquisition window is counted. The ToA with respect to the end of the acquisition window can be calculated from the number of counted fast and slow clock cycles ( and ) and the corresponding frequencies ( and ):
Since the ring-oscillators are operated only for short time intervals, neither synchronization between the individual oscillators nor long term stability are an issue. However, as can be seen in [15] , the absolute oscillation frequency has to be tuned due to frequency variations caused by inaccuracies in the production process. In [4] , it has been found that tuning Voltage Controlled Oscillators (VCOs) by an on-chip Low DropOut Regulator (LDO) as done on GOSSIPO-3 is a feasible but not the ideal technique for such frequency tuning. On GOSSIPO-4, the VCOs are, consequently, replaced by the circuit described in Section III-B.
III. GOSSIPO-4 CIRCUITRY
GOSSIPO-4 is a demonstrator IC meant to verify that above described TDC-technique performs well when RC-controlled oscillators that are distributed across a pixel matrix are automatically tuned by a PLL in the chip periphery. The chip is produced in a 130 nm technology and operated at 1.5 V. For testing purposes, it carries a PLL which provides the oscillator control voltage and the digital part of 8 pixels organized in a m-pitch forming a so-called super-pixel. This layout resembles the structure of the succeeding Timepix3 which will feature pixels organized in 8192 super-pixels. 
A. The Super-Pixel
The logic implemented in the pixels forming the GOSSIPO-4 super-pixel is sketched in Fig. 3 . The 8 individual pixels of the super-pixel share one ring-oscillator which is enabled upon pixel activation. One of four available test-inputs can be logically interconnected with the counting-and synchronizationlogic which controls the operation of the two ToA-and the Time-over-Threshold-(ToT-) counters of a pixel. These test-inputs mimic the signal of the pixel front-end in a fully operational ASIC and allow activation of different pixels at arbitrary times.
Neglecting the parasitic influence of the time-walk effect introduced by front-end electronics, this setup allows to determine the accuracy which can be maximally achieved with the proposed TDC-technique. In case of the Timepix3 ASIC, the limitations introduced by the time-walk effect are addressed by fast analog circuitry and, using the ToT information on the input charge, by a time-walk compensation technique during data analysis.
The implemented counters with states are LFSRs following the De Bruijn technique [16] . The counting logic is a synchronous 3 bit finite-state machine which controls the counters during the data taking and the readout phase. Besides by the 40 MHz clock, it is externally controlled by three signals: Reset, Trigger and Token.
The synchronization logic is an asynchronous state machine that creates a gate-signal starting with the hit-signal and stopping at the first rising edge of the system clock. This gate-signal enables the local oscillator if necessary and synchronizes the fast clock signal with the system clock. Thus, glitches compromising correct counter operation are suppressed.
B. The RC-Controlled Ring-Oscillators
The schematic of the GOSSIPO-4 ring-oscillators is shown in Fig. 4 . Variation of the control voltage allows for tuning of the oscillation frequency since the varactor capacitances and, hence, the RC-times of the delay elements are voltage dependent. Since it is basically an unloaded static potential, distribution of the control voltage across the pixel matrix is easier than in the previous approach discussed in [11] , [4] .
The target frequency of 640 MHz can be reached in all process corners, see Fig. 5 . As seen from Monte Carlo simulations, the output frequencies vary by less than 1% from one oscillator to another on the same die due to the good resistor and capacitor matching properties.
C. The PLL Architecture
The oscillator control voltage is generated by means of a PLL and will be broadcasted across the super-pixel matrix of a fullyfletched pixel chip for adjustment of the oscillation frequencies, as shown in Fig. 6 . This approach allows to compensate in all pixels for frequency fluctuations caused by supply voltage or process parameter variations as well as for temperature changes with a single PLL in the chip periphery.
The developed PLL is based on the work presented in [17] . It is made of a Phase-Frequency-Detector (PFD), a charge-pump, a third-order passive loop-filter, a copy of the RC-controlled oscillator which is instantiated in the super-pixel and a clock divider. The functional blocks are described in some detail in the following.
Phase Frequency Detector: The PFD detects phase and frequency differences between the input reference clock ( ) and the feedback clock signal ( ). As shown in Fig. 7 , the PFD consists of two resettable, positive edge triggered D-flipflops, an AND-gate and a delay cell. The inputs of both flip-flops are tied to the -level. The functionality is that of a state machine with three states: If the reference clock leads the feedback clock, the -signal goes high while the -signal remains low. If the feedback clock leads the reference clock, a high pulse appears in the -signal while the -signal remains low. If input phase and frequency are equal the PLL is locked at the correct frequency. In this case, very narrow pulses appear at the -and -signals. A dead-lock can occur when the phase-error becomes small in comparison with the combined response time of PFD, chargepump, and the loop-filter. This causes the phase detector gain to drop to zero, which in turn prevents further frequency adjustments. In order to avoid this dead-lock, the delay on the PFD reset path is made larger than the switching time of the charge pump currents. Besides, the delay time has to be optimized for an ideal trade-off between the linearity and in-band noise of the charge pump.
The output of the other two D-flip-flops ( and ) track the -and the -signals with a delay , hence, realizing a lock-detection circuit whose output is made available off-chip. The value of the delay time determines the sensitivity of the loss of lock detection, it has to be chosen relatively large with respect to in order to give an accurate representation of the lock status.
Charge-Pump: The charge pump uses a differential architecture with a dummy branch (see Fig. 8 ). The two switched current sources charge or discharge the capacitor of the loop-filter with (configurable), thus, converting the phase and frequency differences into a control-voltage. The branches are controlled by the -and -, the -and -signals, respectively. The transistors (M5, M6 and M8) form current mirrors providing the biasing current for the charge pump. For charge-sharing minimization, the switch transistors (M1-M4) in the charge pump are minimum size.
Loop-Filter: As discussed in [18] , the third-order loop-filter shown in Fig. 8 eliminates undesired high frequency noise signals and, therefore, ensures a stable control voltage. It consists of two resistors ( and ) and three capacitors ( , and ) that create a three-pole one-zero network. Adequate selection of these parameters allows for optimization of the noise performance and circuit stability. Based on system-level simulations, , introducing low-pass characteristic to , has been fixed at 710 fF. and , largely determining the PLL stability, were fixed at 7.4 pF and k , respectively. The third branch of the loop filter ( and ) forms another non-dominant frequency pole that further filters high frequency noise on the control voltage . In order to meet the requirements on area and suppression performance, and have been chosen to be 320 fF and k , respectively. Frequency-Divider: The frequency divider is made of a chain of four D-flip-flops dividing the oscillator output frequency of 640 MHz by 16, hence, yielding the needed 40 MHz feedbackclock.
SLVDS-Driver: A Scalable Low Voltage Differential Signal (SLVDS) driver as shown in Fig. 9 has been designed enabling off-chip investigation of a 320 MHz clock signal which can be used for characterization of the the PLL. The chosen differential transmission scheme is insensitive to common-mode signal disturbances and crosstalk. Therefore, it allows to operate at small signal levels without a reduction of the noise margin.
As proposed in [19] , the driver implementation is based on a four-transistor switching scheme arranged in a bridge configuration. When the transistors M10 and M12 are switched off, the transistors M11 and M13 are switched on and the polarity of the output current is positive and vise versa. The choice of the dimensions of the switching transistors M10-M13 is made trading on-resistance for power consumption. A small on-resistance is 
IV. MEASUREMENT RESULTS
For measurements on the PLL-and the TDC-performance, the chip was bonded to a standard chip package and mounted on a printed circuit board, as shown in Fig. 10 . The logic stimuli needed for operation as well as the readout of the data are provided by a Spartan-3 FPGA hosted on the multipurpose S3-MultiIO-board developed at the University of Bonn [20] .
A. PLL Performance
For the measurements on the PLL performance, the 40 MHz reference clock is provided by an Agilent 81134A signal generator. The outputs of the on-chip LVDS driver are directly connected to a differential probe of a Tektronix MS-O70804 oscilloscope ( GS s, 8 GHz bandwidth). The S3-MultiIO-board is used only as power supply.
The Time Interval Error (TIE) defines how far each active edge of the output clock varies from its ideal position. A jitter histogram for a TIE measurement is shown in Fig. 11 . Fitting by a Gaussian distribution yields a width ( ) of only 23.4 ps. Fig. 12 shows an eye diagram of the 320 MHz output clock. Besides the central branch, two side-branches caused by crosstalk from the positive and the negative edge of the reference clock are observed in this plot. This is no issue of the PLL itself, but only of the present test-chip on which the reference-clock and the LVDS-wire-bond pads are located next to each other. The total eye-opening is 1.37 ns on the time axis and about 340 mV on the voltage axis. The main PLL characteristics are listed in Table I .
B. TDC Delay-Scan
A diagram of the setup used for TDC characterization is shown in Fig. 13 . The Qt-based DAQ controls an Agilent 81110A pulse generator and the S3-Multi-IO-board which provides the necessary logic stimuli for operation and read-out of the super-pixel.
The most important measurement is a delay-scan performed by activating the test-inputs at well defined times relative to the The individual TDC-bins are characterized by their widths , given by which are start-and end-point of a time-bin, as well as by , the steepness of the edges which is proportional to the time jitter. These parameters are obtained by matching (complementary) Gauss error functions: (2) to the data. See Fig. 15 for an example plot of a typical TDC-bin. The differential non-linearity (DNL) is calculated from the actual and the ideal TDC-bin width, and ns, respectively:
The DNL quoted for a specific TDC is the maximum of all individual absolute values of this TDC. The integral non-linearity (INL) is given by the maximum deviation from the ideal TDC-characteristics:
Characterization of Individual Pixels: The spread of the observed TDC-bin sizes is visualized in Fig. 16 . The error-bars represent the jitter of the individual bins. It is rather low since only digital circuitry is investigated.
DNL and INL of all pixels on chip 1 are listed in Table II . It has to be noted that both DNL and INL are driven by the width of the second TDC-bin which shows the largest deviation from the ideal value. Since this is the first non-zero value, it is CHIP 1 assumed that this effect is introduced by the start-up behavior of the ring-oscillator.
For all 80 investigated pixels on 10 different chips, similar results have been obtained. The mean DNL of all investigated pixels is 300 ps with a RMS of 80 ps. The INL is distributed around a mean of 380 ps with a RMS of 130 ps.
Characterization of Multiple Pixels in the Same Super-Pixel: In any application, it has to be anticipated that several pixels of one super-pixel are activated within a few nano-or even picoseconds. Whether or not this deteriorates the TDC performance is investigated by enabling all pixel in a super-pixel during the delay-scan. The result of this delay-scan performed with chip 1 is shown in Fig. 17 . Some events which do not match the expectation are observed. This is only observed on chip 1 and the number of outliers is negligible. Consequently they are not taken into account for analysis. TDC-bin width and jitter are displayed in Fig. 18 . Measured DNL and INL are listed in Table III. The mean DNL of all investigated pixels is 330 ps with a RMS of 90 ps. The INL is distributed around a mean of 300 ps with a RMS of 100 ps. Concluding, it is stated that the DNL is not influenced if multiple pixels in one super-pixel are activated at the same time. The INL is even improved since the impact of the ring-oscillator start-up behavior does not affect pixels which start counting when the oscillator is already running.
Dependence on Power Supply Stability: Considering application in a full featured GOSSIP-detector, it has to be ensured that the TDC performance is not affected by perturbations on the power supply rail.
As intended, the PLL compensates for global variations of the supply voltage by readjusting the control voltage of the ringoscillators. See Fig. 19 for a display of the control voltages in a 
V. CONCLUSION
The GOSSIPO-4 demonstrator IC, fabricated in a 130 nm process, has been designed for verification of a novel PLL-based TDC-technique with nanosecond accuracy: A 640 MHz clock signal, providing high TDC accuracy, is created by a ring oscillator in the pixel matrix. The frequency of all oscillators in a full scale pixel chip is automatically tuned by a PLL in the chip periphery. In this paper, this approach has been verified: In each pixel, the TDC bin size is found to be about 1.56 ns. Differential and integrated non-linearity are less than 400 ps. At an 320 MHz test-output, the TIE jitter performance of the implemented PLL is below 24 ps, the duty cycle is 50.75%. Based on this success, Timepix3 featuring charge sensitive pixels is designed using the central building blocks of the GOSSIPO-4 IC for time of arrival measurements.
