## IEEE Photonics Journal

# Gated SPAD Arrays for Single-Photon Time-Resolved Imaging and Spectroscopy 

Volume 11, Number 6, December 2019

Enrico Conca, Student Member, IEEE Iris Cusini<br>Fabio Severini<br>Rudi Lussana<br>Franco Zappa, Senior Member, IEEE<br>Federica Villa, Member, IEEE



# Gated SPAD Arrays for Single-Photon Time-Resolved Imaging and Spectroscopy 

Enrico Conca © , Student Member, IEEE, Iris Cusini © Fabio Severini © , Rudi Lussana ${ }^{(\bullet)}$ Franco Zappa © , Senior Member, IEEE, and Federica Villa ${ }^{\bullet}$, Member, IEEE<br>Dipartimento di Elettronica, Informazione e Bioingegneria-Politecnico di Milano, Milano 20133, Italy

DOI:10.1109/JPHOT.2019.2952670
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/


#### Abstract

Manuscript received September 13, 2019; revised October 31, 2019; accepted November 5, 2019. Date of publication November 11, 2019; date of current version December 17, 2019. This work was supported by the European Union's Horizon 2020 research and innovation programme under Grant 801060 ("Quantum-enhanced on-chip interference microscopy", Q-MIC). Corresponding author: Enrico Conca (e-mail: enrico.conca@polimi.it).


#### Abstract

In this paper, we present the architecture and the experimental characterization of an improved version of a previously developed $32 \times 32$ Single Photon Avalanche Diodes (SPADs) and Time to Digital Converters (TDCs) array, and two new arrays (with $8 \times 8$ and $128 \times 1$ pixels) with the additional capability of actively gating the detectors with subnanosecond rise time. The arrays include high performance SPADs ( $0.04 \mathrm{cps} / \mu \mathrm{m}^{2}, 50 \%$ peak PDE) and provide down to 410 ps Full-Width at Half-Maximum (FWHM) single shot precision and excellent linearity. We developed a camera to exploit these imagers in timeresolved, single-photon applications.


Index Terms: Single photon avalanche diode (SPAD) array, time to digital converters (TDC), SPAD gating.

## 1. Introduction

Single Photon Avalanche Diodes (SPADs) are single photons detectors that, after being used for several decades in research applications, are recently gaining interest also in industrial, automotive and consumer electronics. Key parameters that favor the SPAD commercial exploitation are small size, possibility to be integrated in CMOS processes, ruggedness to high intensity light, room temperature operability, low power supply required to bias the detector and the possibility of being rapidly enabled or disabled. Instead, other single photon detectors, such as Photomultiplier Tubes (PMTs), Superconducting Nanowire Single Photon Detectors (SNSPDs) and Hybrid Photon Detectors (HPDs), do not feature these useful properties all at once.

The main strengths of SPAD arrays with respect to Charge Coupled Devices (CCDs) and CMOS Active Pixel Sensors (APSs) are the absence of readout noise (which allows down to one photon counting within each integration time), the possibility to precisely time stamp the photon arrival time, and to rapidly gate on and off the detector. CMOS SPADs do however suffer from some limitations, namely a limited detection efficiency, especially in the near-infrared, the relatively large pixel pitch and the higher power consumption with respect to conventional image sensors. The new SPAD
array designs presented in this paper aim to exploit technology-specific opportunities, in particular photon time-tagging.

We will focus on the architecture and on the experimental characterization of the improved version of a $32 \times 32$ SPADs and Time to Digital Converters (TDCs) array already presented in [1] and of brand new arrays with $8 \times 8$ and $128 \times 1$ pixels. The new designs, in addition to the photon timestamping capabilities of the former, also allow to actively gate the detector with sub-nanosecond rise time.

Applications for the $32 \times 32$ SPAD and TDC array include Light Detection and Ranging (LiDAR) from satellites [2], first photon imaging [3], under-water obstacle identification [4], coincidence detection in quantum applications [5], multiple time constants Fluorescence Lifetime Imaging (FLIM) [6], Diffuse Optical Tomography (DOT) [7] and quantum physics [8]. The new $32 \times 32$ array overcomes the limitations which hindered the exploitation of the previous version, maintaining the same SPAD size $(30 \mu \mathrm{~m})$ and pixel pitch ( $150 \mu \mathrm{~m}$ ), whereas the new $8 \times 8$ and $128 \times 1$ arrays halve the pixel pitch $(75 \mu \mathrm{~m})$ and include active gating of the detector. Applications like non-line of sight 3D ranging and time domain DOT can benefit from such a feature to discard the effect of the strong first reflection that would saturate the SPADs and prevent to measure the arrival time of the late photons carrying the useful information. The linear array is perfectly suited for spectroscopy applications, with time-gating and time-tagging capabilities being desirable for advanced Raman techniques [9].

The rest of the paper is structured as follows: Section 2 describes the architecture of the three arrays, Section 3 shows the experimental characterization in terms of SPAD and TDC performance, while Section 4 summarizes the results and provides conclusions.

## 2. SPAD Arrays Architectures

The three arrays have been designed and fabricated in a $0.35 \mu \mathrm{~m}$ high-voltage CMOS technology; despite it being quite an old technology node, it offers state-of-the-art SPADs [10] and thus represents an excellent technology choice when the pixel pitch can be relaxed. All the arrays are based on the same architecture, including an array of pixels (SPAD, front end circuit, counter and TDC, internal memories and output buffers) and the TDC and readout global electronics, very similar to the one described in [1].

### 2.1 Timing Electronics and Pixel Architecture

The timing electronics consists of a 16-phases clock interpolation scheme, with separate "START" and "STOP" interpolators to adopt the sliding scale technique. The "START" interpolator is shared by the whole array, while each pixel includes the STOP interpolator, triggered by a photon detection, with an 8 -bit counter to extend the Full Scale Range; the counter can be repurposed to operate the detectors in photon-counting mode. Double-buffering allows global shutter operation and concurrent acquisition and readout to reduce dead time.

Differently from the previous $32 \times 32$ array [1], the TDC works with START and STOP in "direct" configuration, where the START is a global synchronization signal and the STOP is the in-pixel photon detection. The "reverse" configuration has clear advantages for a single TDC operating with a stable laser at high frequencies. In fact, in this case the START is provided only when a photon is detected and the STOP is provided by the subsequent trigger signal, thus the TDC converts only the useful signal reducing the power dissipation. In case of low frequency or not stable lasers, the "reverse" configuration requires the synchronization signal to be provided to the chip at the end of the measurement cycle, which means that, unless the laser can be externally triggered by the camera, a long delay (potentially as long as the TDC full scale range) needs to be introduced on the laser sync by means of a delayer or long cables. The "direct" configuration overcomes this requirement because the sync is provided at the beginning of the measurement, allowing to easily exploit the arrays also in applications with not constant laser repetition rate. Furthermore, in an array of TDCs the power consumption does not increase significantly with the "direct" approach


Fig. 1. Layouts of the three arrays: (a) $32 \times 32$, (b) $8 \times 8$, (c) $128 \times 1$. The chip sizes are $8.7 \times 8.0 \mathrm{~mm}^{2}, 3.6 \times 3.0 \mathrm{~mm}^{2}$ and $10.0 \times 3.0 \mathrm{~mm}^{2}$, respectively.
with respect to the "reverse" one, because most of the dissipation is related to the distribution of the clocks signal, with just a minor contribution from the running in-pixel counters.

The TDC performance has been improved, reaching $1 \mu$ s Full Scale Range (FSR) and 260 ps resolution, corresponding to 150 m and 4 cm respectively in LiDAR measurements, by redesigning the clock generation and distribution circuits (which allowed to improve the achievable resolution to 260 ps , down to 312.5 ps of the previous chips, by means of a higher operating frequency) as well as by extending the in-pixel coarse counter (increasing the full scale range).

The layouts of the three arrays are shown in Fig. 1. Alignment marks for micro-lenses mounting have been implemented in the chip design, as we expect to improve the overall equivalent fill-factor (FF) up to a theoretical limit of about $78 \%$ (given by the fact that the micro-lens is circular whereas the SPAD pixel is square), using micro-lens arrays (MLAs) already developed and tested [12] or by means of new MLAs developed by Micro Photon Devices [13].

## $2.232 \times 32$ SPADs and TDCs Array

The main feature that sets apart the revised version of the $32 \times 32$ array from its predecessor is the ability to increase the measurement duty cycle in photon timing mode both when operating with low repetition rate lasers (thanks to the extended TDC range) and with higher repetition rate lasers (by allowing multiple detection windows within the same frame). In fact, the measurement duty cycle is set by the ratio between the time within which the SPADs are active over the frame-time; in time-tagging mode, the limit is set by the TDC FSR. However, if multiple excitation windows can be opened within the same frame, the duty cycle (D) can increase to (Fig. 2):

$$
\mathrm{D}=\frac{N_{G A T E S} \cdot F S R}{T_{\text {FRAME }}}
$$

The previous array could approach a unity duty cycle by opening $\sim 30$ gate windows per frame of acquisition, requiring a minimum sync frequency of 3 MHz , due to the 320 ns FSR and $10 \mu \mathrm{~s}$ frame-time. The new revision, thanks to a longer FSR ( $<1 \mu \mathrm{~s}$ ) and shorter frame-time ( $5 \mu \mathrm{~s} \mathrm{~min}$, thanks to a redesigned readout circuit) can obtain the same duty cycle with a sync frequency as low as 1 MHz , with the advantage of a doubled throughput and potentially allowing to distinguish even faster variations in the imaged scene.

Multiple detection windows are allowed by an out-of-pixel counter that, per each frame, stores the ID of the detection window where the photon is detected; this value is appended to the 12 bit TDC conversion and, together with a 64-entry global START interpolator memory, allows to open multiple detection windows while still providing the correct time tag of the photon arrival time; the


Fig. 2. Timing diagram of the multi-gate operation. The duty cycle of the measurement is set by the SPAD gate-ON time over the frame time.


Fig. 3. Left: block schematic of the gated SPAD front-end circuitry implemented in the $8 \times 8$ and $128 \times 1$ arrays. The SPAD gating is performed by transistors M3 and M4, respectively, $V_{G A T I N G}$ being equal to the excess bias plus the undervoltage. Right: a portion of the $8 \times 8$ array active area, which includes the SPADs and the front-end transistors.
conversion rate is still limited to one TDC conversion per pixel per frame. We experimentally verified that it is possible to keep SPADs and TDCs active for about $80 \%$ of the frame, operating at 200 kfps , which represents a significant improvement in respect to the $35 \%$ of the first implementation of the $32 \times 32$ array [11].

Despite the additional counters are outside the imaging area, the FF is only $3.14 \%$; however, it will be recovered by means of an array of micro-lenses, whose effectiveness has been already proved for f-\# larger than 16 [12]; recent developments in microlenses developed by Micro Photon Devices (MPD) [13] have shown a concentration factor larger than 20 for $\mathrm{f}-\mathrm{\#}$ as small as 5 , allowing to approach the $78 \%$ fill-factor recovery limit for round microlenses.

Although recently many SPAD arrays for photon timing have been developed by several research groups [14]-[18], with different trade-offs and recommended applications, we believe that the array we present provides remarkable flexibility with its per-pixel TDCs with extended range and high duty cycle, coupled with extremely low-noise SPADs.

## $2.38 \times 8$ and $128 \times 1$ Gated SPADs and TDCs Array

The developed $8 \times 8$ and $128 \times 1$ SPAD arrays have a main additional feature with respect to the larger array, namely the possibility to be actively gated on and off, bringing the SPAD bias voltage above or below the breakdown voltage. Such active gating is performed by the same circuit that quenches the SPAD and senses the avalanche, as shown in the simplified schematic in Fig. 3 left. In particular, transistors M1 and M2 sense the avalanche, "event detection" block masks spurious events synchronous with the SPAD disabling, "signal generation + holdoff" block assures the correct


Fig. 4. DCR CDF of the $32 \times 32$ array, which is representative also for the $8 \times 8$ and the $128 \times 1$ arrays (left) and PDE of the detectors (right) at three different excess biases ( 4,5 and 6 V ), measured with the camera at room temperature operating in counting mode.
hold-off duration after each triggering event and drives transistors M3 and, through a level shifter, M4, which reset and disable the SPAD when operated in gated mode.

In order to improve the FF of the $8 \times 8$ array, only the front-end transistors M1, M2, M3, M4 have been laid out close to the SPAD active region, whereas the remaining part of the circuitry has been located outside this area, as shown in Fig. 3 right. Given the $75 \mu \mathrm{~m}$ pitch and $30 \mu \mathrm{~m}$ diameter SPADs for both the arrays, this allowed to achieve a fill-factor of $12.5 \%$ for both $8 \times 8$ and $128 \times 1$ variants.

## 3. Experimental Characterization

In order to characterize the arrays and exploit them in final applications, a camera able to host each one of the three chips has been developed. Three interchangeable chip carriers have been designed to allow the connection of the different arrays to the camera. The camera provides the power supplies, manages the communication with the SPAD arrays and sends the acquired data to a remote computer through a USB 3.0 link. It has been optimized to facilitate the heat dissipation of the chip, specifically for the $32 \times 32$ array, whose power consumption may reach 5 W when operating for $80 \%$ of the frame duration. To this aim, the $32 \times 32$ array is directly glued on a copper heat sinker to move the heat towards the housing; cooling is completely passive to avoid the introduction of a fan which may introduce unwanted vibrations in optical setups. Only the $32 \times 32$ pixel camera at the moment has been fitted with the cooling system.

### 3.1 SPAD DCR and PDE

All the arrays include $30 \mu \mathrm{~m}$ diameter SPADs with the same performance presented in [10]. The Dark Counting Rate (DCR) Cumulative Distribution Functions (CDFs) of the three arrays present the same trend, with 60 cps median DCR at 5 V excess bias and $5 \%$ of hot pixels. Fig. 4 left shows the DCR CDF of the $32 \times 32$ array at $4 \mathrm{~V}, 5 \mathrm{~V}$ and 6 V excess bias.

The Photon Detection Efficiency (PDE) at $4 \mathrm{~V}, 5 \mathrm{~V}$ and 6 V excess bias is shown in Fig. 4 right, with no appreciable differences among the three arrays. The peak PDE is about $50 \%$ at 450 nm and $4 \%$ at 850 nm .

### 3.2 Optical Crosstalk

We measured the optical crosstalk as in [19]: the arrays were operated in photon-timing mode while keeping them in a dark environment. SPADs were activated with 5 V excess bias over 350 ns gate windows and the arrival time of one ignition per frame per pixel was recorded in each window, collecting a total of $5 \cdot 10^{9}$ measurements. Then, a pixel was arbitrarily selected as an "aggressor", and for each "victim" pixel a histogram of the difference between "aggressor" and "victim" arrival


Fig. 5. "Aggressor" and "victim" arrival time difference histogram used for crosstalk computation. The red line is the expected triangular cross-correlation in case of no crosstalk.


Fig. 6. Crosstalk probability color map for the $32 \times 32$ (a), $8 \times 8$ (b), and $128 \times 1$ (c) arrays. Color bar in a $\log _{10}$ scale; white pixel is the "aggressor".


Fig. 7. Example of single shot precision histogram for a representative pixel of the $32 \times 32$ SPAD array (left). Single shot precision (FWHM) of the three arrays versus resolution (right).
times was built. The resulting distribution for a pixel adjacent to the "aggressor" is shown in Fig. 5. In order to get rid of spurious coincidences introduced by dark counts, the expected triangular cross-correlation in absence of crosstalk was subtracted (Fig. 5, red line); the resulting histogram contains only the $\mathrm{N}_{\mathrm{xy}}$ counts due to crosstalk, which were used to compute the crosstalk probability as:

$$
X=N_{x y} /\left(N_{x}+N_{y}\right)
$$

where $N_{x}$ and $N_{y}$ are the total counts accumulated in "aggressor" and "victim" pixels, respectively.


Fig. 8. DNL of the $32 \times 32$ (a), $8 \times 8$ (b), and $128 \times 1$ (c) arrays. The better linearity of the $32 \times 32$ array is achieved thanks to the absence of active gating.


Fig. 9. Gate shape for the $8 \times 8$ (left) and $128 \times 1$ (right) arrays, showing fast rising edge transitions of 750 and 780 ps , respectively.


Fig. 10. Micrographs of the three arrays: (a) $32 \times 32$, (b) $8 \times 8$, (c) $128 \times 1$.

The results for each array are shown in Fig. 6. For the smaller arrays, crosstalk is higher for adjacent pixels ( $10^{-3.8}$ ), decreases for the diagonal ones in the $8 \times 8$ array $\left(10^{-4.7}\right)$ and is negligible for the farther away pixels, resulting in a total crosstalk probability of $6.6 \cdot 10^{-4}$ for the $8 \times 8$ array, dropping to $5.6 \cdot 10^{-4}$ for the $128 \times 1$. The crosstalk for the $32 \times 32$ array is lower because of its larger pitch $(150 \mu \mathrm{~m})$, reaching $10^{-4.3}$ for adjacent pixels in the horizontal direction and resulting in a total crosstalk probability of $2.3 \cdot 10^{-4}$. Unexpectedly, the crosstalk in the horizontal direction is much stronger than on the vertical, probably due to the nonsymmetric layout of the pixel and of the metal lines.

TABLE I
Summary of the Performance Metrics for the Arrays Presented in This Paper

|  | $32 \times 32$ array <br> (previous version) [1] | $32 \times 32$ array <br> (this paper) | $8 \times 8$ array <br> (this paper) | $128 \times 1$ array <br> (this paper) |
| :--- | :---: | :---: | :---: | :---: |
| Active gating | NO | NO | Yes, 750 ps rising edge | Yes, 780 ps rising edge |
| TDC resolution / FSR | $312.5 \mathrm{ps} / 320 \mathrm{~ns}$ | $260 \mathrm{ps} / 1.06 \mu \mathrm{~s}$ | $260 \mathrm{ps} / 1.06 \mu \mathrm{~s}$ | $260 \mathrm{ps} / 1.06 \mu \mathrm{~s}$ |
| Single shot precision | $600 \mathrm{ps} \mathrm{(FWHM)}$ | $420 \mathrm{ps} \mathrm{(FWHM)}$ | $450 \mathrm{ps}(\mathrm{FWHM})$ | $410 \mathrm{ps}(\mathrm{FWHM})$ |
| Framerate $(\max )$ | 100 kfps | 200 kfps | $>1 \mathrm{Mfps}$ | $>1 \mathrm{Mfps}$ |

TABLE II
Comparison Between SPAD Arrays Presented in This Work and Other SPAD Arrays With Time-Tagging Capability Presented in Literature
$\left.\begin{array}{|c|c|c|c|c|c|c|c|}\hline \text { Ref. } & \text { Pixel \# } & \begin{array}{c}\text { Pixel fill } \\ \text { factor }\end{array} & \text { Process } & \text { TDC } & \begin{array}{c}\text { Photon } \\ \text { Counting }\end{array} & \begin{array}{c}\text { Peak PDE } \\ \text { @ operating VEX } \\ \text { [cps/ }\end{array} \\ \hline \text { This work }\end{array}\right]$

### 3.3 TDC Precision and Linearity

The TDC single shot precision has been estimated as the Full-Width at Half-Maximum (FWHM) of the histogram shown in Fig. 7 left. The dependence of the measured TDC precision versus resolution is shown in Fig. 7 right for the three arrays, which present similar results. A single shot precision as good as 410 ps FWHM can be achieved with 260 ps resolution. In order to reduce power dissipation, it is possible to decrease the TDC clock frequency, with a maximum LSB of 625 ps and a correspondingly longer FSR, resulting in about 1 ns single shot precision FWHM.

The TDC non-linearity has been measured through a code density test and expressed in terms of Differential-Non-Linearity (DNL) and Integral-Non-Linearity (INL). The root mean square (rms) DNL is $0.61 \%, 1.4 \%, 2.2 \%$ of the Least Significant Bit (LSB), while the rms INL is $10.1 \%, 16.9 \%, 13.1 \%$ of the LSB, respectively for the $32 \times 32,8 \times 8,128 \times 1$ arrays. As the $32 \times 32$ array is not designed for gated mode operation, the first four bins of the histogram have been discarded for DNL and INL computation, whereas for the $8 \times 8$ and $128 \times 1$ also the enabling and disabling transitions have been considered and they provide the dominant contribution to non-linearity, as visible in Fig. 8, which shows the DNL for representative pixels. The very good TDC linearity performance has been achieved by implementing the sliding scale technique in the TDC architecture [20], which has two separate interpolators for START and STOP signals.

### 3.4 Active Gate

The shape of the active gate has been characterized by illuminating the $8 \times 8$ and $128 \times 1$ arrays with a pulsed laser and by shifting the laser pulse with 50 ps steps (by means of a programmable delayer) while operating the arrays in photon counting mode. The results for a representative pixel
are presented in Fig. 9, which shows enabling edges of $750 \mathrm{ps}(8 \times 8$ array $)$ and $780 \mathrm{ps}(128 \times 1$ array), considering $10 \%-90 \%$ transitions.

Falling edges are faster than rising ones, since they are given by the masking operation of the "event detection" block (Fig. 3, left) whereas rising edges are representative of the actual excess bias provided to the SPADs. The result is almost comparable to state of art single pixel fast gating circuits based on the SPAD-dummy approach, such as the one described in [21], which achieves 430 ps transition edges. In both the curves an overshoot, whose amplitude is lower than $10 \%$ of the average counts and who is caused by the bond wire parasitic inductance, is visible at the beginning of the activation window. Note that the different background levels before and after the gate period in Fig. 9 (left) are caused by trends on the chip temperature, which varied during the measurement causing a variation in the SPADs DCR.

## 4. Conclusions

Three SPAD arrays for photon timing applications have been presented and the micrographs are shown in Fig. 10. The arrays have a different number of pixels ( $32 \times 32,8 \times 8$ and $128 \times 1$ ) and different FF ( $3.14 \%$ the $32 \times 32$ array and $12.5 \%$ the $8 \times 8$ and $128 \times 1$ arrays). The $30 \mu \mathrm{~m}$ diameter SPADs integrated in the arrays present low DCR ( $0.12 \mathrm{cps} / \mu \mathrm{m}^{2}$ at operating temperature) and $50 \%$ peak PDE at 450 nm . The crosstalk among adjacent pixels is kept below $10^{-3.8}$ in all the arrays. The TDCs have a long FSR of $1 \mu \mathrm{~s}$ (corresponding to 150 m in LiDAR measurements) and a resolution as low as 260 ps, which leads to 410 ps single shot precision (FWHM). Very good linearity performance has been achieved by exploiting the sliding scale technique, limiting the rms DNL to few \% of LSB and the INL well below $20 \%$. The two smaller arrays can also be operated with sub-nanosecond edges active gating. Excellent performance has been achieved in all the most important parameters for SPAD imagers, enabling their exploitation in many single-photon time-resolved applications.

Table I summarizes the main performance figures of the SPAD arrays presented in this paper, namely in terms of temporal resolution, gating capability and maximum framerate. Table II compares this work with respect to other CMOS SPAD imagers with time-tagging capabilities presented in literature; it can be noticed that the arrays presented in this work compare favorably in terms of TDC FSR and, although limited in terms of temporal resolution by the old technology node, obtain best-in-class detector noise and detection efficiency.

## References

[1] F. Villa et al., "CMOS imager with 1024 SPADs and TDCs for single-photon timing and 3D time-of-flight," IEEE J. Sel. Topics Quantum Electron., vol. 20, no. 6, pp. 364-373, Nov/Dec. 2014.
[2] P. Foglia Manzillo et al., "ALART: A novel lidar system for vegetation height retrieval from space," Proc. SPIE, vol. 9645, 2015, Art. no. 96450E.
[3] D. Shin et al., "Photon-efficient imaging with a single-photon camera," Nature Commun., vol. 7, no. 24, 2016, Art. no. 12046.
[4] D. McLeod, J. Jacobson, M. Hardy, and C. Embry, "Autonomous inspection using an underwater 3D LiDAR," in Proc. OCEANS - San Diego, 2013, pp. 1-8.
[5] M. Unternährer et al., "Coincidence detection of spatially correlated photon pairs with a novel type of monolithic time-resolving detector array," in Proc. Conf. Lasers Electro-Opt. Europe Eur. Quantum Electron. Conf., 2017, p. 218.
[6] M. J. Cole et al., "Fluorescence lifetime imaging for biomedicine and spectroscopy," in Proc. Conf. Dig. Conf. Lasers Electro-Opt Europe, 2000, p. 1.
[7] J. Bouchard et al., "A low-cost time-correlated single photon counting system for multiview time-domain diffuse optical tomography," IEEE Trans. Instrum. Meas., vol. 66, no. 10, pp. 2505-2515, Oct. 2017.
[8] F. Piacentini et al., "Determining the quantum expectation value by measuring a single photon," Nature Phys., vol. 13, no. 12, pp. 1191-1194, 2017.
[9] I. Nissinen, J. Nissinen, P. Keränen, D. Stoppa, and J. Kostamovaara, "A $16 \times 256$ SPAD line detector with a 50-ps, 3-bit, 256-channel time-to-digital converter for Raman spectroscopy," IEEE Sens. J., vol. 18, no. 9. pp. 3789-3798, May 2018.
[10] F. Villa et al., "CMOS SPADs with up to $500 \mu \mathrm{~m}$ diameter and $55 \%$ detection efficiency at 420 nm ," J. Modern Opt., vol. 61, no. 2, pp. 102-115, Jan. 2014.
[11] R. Lussana, F. Villa, A. Dalla Mora, D. Contini, A. Tosi, and F. Zappa, "Enhanced single-photon time-of-flight 3D ranging," Opt. Exp., vol. 23, no. 19, pp. 24962-24973, Sep. 2015.
[12] G. Intermite et al., "Fill-factor improvement of Si CMOS single-photon avalanche diode detector arrays by integration of diffractive microlens arrays," Opt. Exp., vol. 23, no. 26, pp. 33777-33791, 2015.
[13] 2019. [Online]. Available: http://www.micro-photon-devices.com/
[14] M. Perenzoni, D. Perenzoni, and D. Stoppa, "A $64 \times 64$-pixels digital silicon photomultiplier direct TOF sensor with 100-MPhotons/s/pixel background rejection and imaging/altimeter mode with $0.14 \%$ precision Up to 6 km for spacecraft navigation and landing," IEEE J. Solid-State Circuits, vol. 52, no. 1, pp. 151-160, Jan. 2017.
[15] S. Lindner, C. Zhang, I. M. Antolović, M. Wolf, and E. Charbon, "A $252 \times 144$ SPAD pixel FLASH LiDAR with 1728 dual-clock 48.8 ps TDCs, integrated histogramming and 14.9-to-1 compression in 180 nm CMOS technology," in Proc. IEEE Symp. VLSI Circuits, 2018, pp. 69-70.
[16] R. K. Henderson et al., "A $192 \times 128$ time correlated single photon counting imager in 40 nm CMOS technology," in IEEE 44th Eur. Solid State Circuits Conf., 2018, pp. 54-57.
[17] G. Acconcia, A. Cominelli, I. Rech, and M. Ghioni, "High-efficiency integrated readout circuit for single photon avalanche diode arrays in fluorescence lifetime imaging," Rev. Sci. Instrum., vol. 87, no. 11, 2016, Art. no. 113110.
[18] R. M. Field, S. Realov, and K. L. Shepard, "A 100 fps, time-correlated single-photon-counting-based fluorescence lifetime imager in 130 nm CMOS," IEEE J. Solid-State Circuits, vol. 49, no. 4, pp. 867-880, Apr. 2014.
[19] F. Villa et al., "High fill-factor $60 \times 1$ SPAD array with 60 sub-nanosecond integrated TDCs," Photon. Technol. Lett., vol. 27, no. 12, pp. 1261-1264, 2015.
[20] B. Markovic, S. Tisa, F. A. Villa, A. Tosi, and F. Zappa, "A high-linearity, 17 ps precision time-to-digital converter based on a single-stage delay Vernier loop fine interpolation," IEEE Trans. Circuits Syst. I, vol. 60, no. 3, pp. 557-569, Mar. 2013.
[21] A. Ruggeri, P. Ciccarella, F. Villa, F. Zappa, and A. Tosi, "Integrated circuit for sub-nanosecond gating of InGaAs/InP SPAD," IEEE J. Quantum Electron., vol. 51, no. 7, Jul. 2015, Art. no. 4500107.
[22] L. H. Campos Braga et al., "An $8 \times 16$-pixel 92 k SPAD time-resolved sensor with on-pixel 64 ps 12 b TDC and $100 \mathrm{MS} /$ s real-time energy histogramming in $0.13 \mu \mathrm{~m}$. CIS technology for PET/MRI applications," in Proc. IEEE Int. Solid-State Circuits Conf., Dig. Tech. Papers, 2013, pp. 486-487.
[23] C. Veerappan et al., "A $160 \times 128$ single-photon image sensor with on-pixel 55 ps 10 b time-to-digital converter," in Proc. IEEE Int. Solid-State Circuits Conf., Dig. Tech. Papers, 2011, pp. 312-314.

