A CMOS 8×8 SPAD array for Time-of-Flight measurement and light-spot statistics by Vornicu, Ion et al.
A CMOS 88×  SPAD Array for Time-of-Flight 
Measurement and Light-Spot Statistics 
 
I. Vornicu, R. Carmona-Galán, A. Rodríguez-Vázquez 
Institute of Microelectronics of Seville (IMSE-CNM), CSIC-University of Seville (Spain) 
E-mail: ivornicu@imse-cnm.csic.es 
 
 
Abstract - The design and simulation of a CMOS 88 × single 
photon avalanche diode (SPAD) array is presented. The chip has 
been fabricated in a 0.18μm standard CMOS technology and 
implements a double functionality: measuring the Time-of-
Flight with the help of a pulsed light source; or computing focal-
plane statistics in biomedical imaging applications based on a 
concentrated light-spot. The incorporation of on-chip processing 
simplifies the interfacing of the array with the host system. The 
pixel pitch is 32μm, while the diameter of the quasi-circular 
active area of the SPADs is 12μm. The 113μm2 active area is 
surrounded by a T-well guard ring. The resulting breakdown 
voltage is 10V with a maximum excess voltage of 1.8V. The pixel 
incorporates a novel active quenching/reset circuit. The array 
has been designed to operate with a laser pulsed at 20Mhz. The 
overall time resolution is 115ps. Focal-plane statistics are 
obtained in digital format. The maximum throughput of the 
digital output buffers is 200Mbps. 
I. INTRODUCTION 
Last developments in the integration of single photon 
avalanche diodes (SPADs) in standard CMOS technology [1] 
have opened new research lines in the field of 3D vision and 
medical applications. Time-of-flight (ToF) estimation has 
been successfully employed for distance measurement and 
generating a depth map of the scene [2] . In addition to this, it 
is employed in enhanced imaging techniques for nuclear 
medicine, such as positron emission tomography (PET) [3]. 
ToF measurement requires the control of the illumination 
source. Pulse modulation is the most straightforward 
technique [4], [5]. Also this method has some benefits when 
compared to continuous-wave modulation. It is unambiguous 
and ensures a higher SNR with lower average optical power 
[6]. Obviously, regarding 3D imaging, the most important 
parameters are the overall time resolution and the uniformity 
of the array.  
In the field of nuclear medicine imaging, several 
techniques can be classified as single photon imaging. One of 
them is detecting the high-energy photons emitted directly by 
a radioactive nucleus (SPECT: single-photon emission 
computed tomography). Another one is detecting pairs of 
511-keV photons emitted in exactly opposite directions (PET: 
positron emission tomography) and evaluating their 
coincidence [7]. ToF measurement has started to be used in 
this type of applications in order to reduce the uncertainty 
error along the line of response (LOR) [8]. Basically, a PET 
detector is composed of a scintillator crystal, which convert 
those high-energy photons into visible light photons [9], and 
an array of single-photon detectors, in this case SPADs, close 
to the imaging volume [10].  Because of conversion into 
visible photons can occur at different depths within the 
scintillator crystal —depth of incidence (DOI)—, there is 
uncertainty in the determination of the actual LOR, what 
results in a blurred reconstruction. In order to enhance the 
spatial resolution of the detection, statistics on the light-spot 
can help to precisely determine the DOI [11].  
The chip presented in this paper has a double 
functionality: it can be employed for ToF imaging by 
multiplexing each SPAD to an external time-to-digital 
converter; in addition, each photon detection event is 
accounted for at each column and row of the array in order to 
realize focal-plane statistics.              
II. ARRAY OF SPADS 
The test chip is composed of an array of 88 × SPAD cells. 
Each one of them includes the photosensitive device, the 
active quenching/reset (AQR) circuitry and some peripheral 
circuits to independently select and connect each pixel to the 
output, to create statistics with the actual spot position, and to 
store and serialize the data in the output. The block diagram 
of the chip is presented in Fig. 1.   
 
Figure 1.  Block diagram of the chip 
The size of the chip core is 256256 × μm2 and can be 
easily scaled to a larger array size.  
978-1-4673-5762-3/13/$31.00 ©2013 IEEE 2626
A. ToF configuration  
The digital output of each pixel is multiplexed to one 
single output channel such that ToF measurements are 
performed off-chip. This scheme is more appropriate 
compared to column level buffering due to the fact that a 
bank of digital buffers can present a large mismatch on the 
delay, which can seriously affect the time resolution. The 
time accuracy of one individual SPAD is given by the jitter of 
the avalanche current spike while in the case of arrays the 
uniformity of the back-end circuitry is also an important 
factor. Therefore the variability of the delay, if it is larger 
than the jitter of the SPAD, can compromise the desired time 
resolution. A constant delay could be considered in this case a 
systematic error and can be removed through calibration. 
Finally, the overall distance uncertainty can be divided by 
M  by merely averaging M  ToF measurements [5]. 
B. Light spot statistics configuration 
Fig. 2 sketches the block diagram of the array. Each row 
and column is connected to a different 12b counter through a 
pulled-up line. Therefore each individual event that is 
detected at pixel level is counted once by the row and once by 
the column counters. Obviously events that are detected at the 
same time on the same column or row cannot be individually 
counted, but this scheme is thought for applications in which 
this situation is not common. From the area and power 
consumption point of view, the proposed scheme is a better 
alternative than having a counter integrated at pixel level. 
When the accumulation time window ends the content of the 
counters can be serially read out. An additional output flag is 
used to signal a counter overflow.              
V
D
D
V
D
D
E
N
R
ST
E
N
R
S
T
O
V
F
B
12
B
1
O
V
F
B
12
B
1
 
Figure 2.  Block diagram of an array subset 
C. Digital pixel  
The proposed AQR circuit is presented in Fig. 3. It is 
based on the same principle as the circuit reported in [12], but 
employs a much simpler hold-off circuit. Compared to 
passive QR, AQR has the advantage of achieving fast 
transitions in both phases, an increased sensitivity that avoids 
count losses, and an adjustable hold-off time [13]. The model 
of the SPAD that has been used in the simulations [14] 
merges both the DC characteristic of the model in [15] and 
the statistical model in [16]. The cross section of the 
implemented SPAD is depicted in Fig. 4. Basically, the 
SPAD is a typical P+-diffusion/N-well diode with an 
additional T-well guard ring, i. e. an additional P-well layer. 
Its purpose is equalizing the breakdown voltage (VBD) [17].     
 
Figure 3.  Proposed pixel schematic including the AQR circuit 
The quenching circuit operates as follows: before a 
photon is detected there is no current flowing through the 
reverse-bias diode. The anode is connected to ground and M4 
is turned off, meaning that the SPAD is turned on. In other 
words, the avalanche is ready to be triggered. Whenever a 
photon event is detected a large current flows through the 
SPAD and, consequently, the anode voltage (node A) is 
pulled up. Therefore the output of inverter Inv1 is pulled 
down, M2 and M3 transistor are turned off and on, 
respectively, which means that the anode terminal is 
connected to VDD quenching the avalanche and disabling the 
SPAD by setting up VE to be at most equal to zero. 
Furthermore, with the output of Inv1 set to zero, the MOS 
capacitor starts to charge through M6 and M5. The time 
constant is controlled by Vhold_off signal. When the voltage of 
the capacitor M8 reaches the threshold of Inv3 then Vreset is set 
high turning on M4 transistor. Further, the anode of the SPAD 
is pulled down for a short time, until the output of Inv1 goes 
to VDD turning off M3, a large current will flow through M3 
and M4. Because of this reset mechanism, a large current 
spike is required, sometimes of the same magnitude as the 
current through the SPAD. In order to avoid this 
inconvenience the width of M1, 2 should be larger than M4. On 
the other hand M4 should be stronger than M3 in order to be 
able to pull down the anode terminal.  
 
Figure 4.  Schematic cross section of the implemented SPAD 
Inv2 is used to drive the readout circuitry, that consists in 
a transmission gate to connect the output signal to a column 
bus (signal Vout in Fig. 3), and two transistors M9 and M10 to 
pull down the column (Vcol) and row (Vrow) detection signals, 
respectively. Each SPAD can be asynchronously reset 
through RST signal. Fig. 5 depicts the layout of the pixel.          
2627
 
Figure 5.  Pixel layout  
We have designed an integrated SPAD array in a 0.18μm 
standard CMOS technology. It has a quasi-circular shape 
having a diameter of the active area of 12μm. Thanks to the 
fact that we are interested only in detecting the impinging 
photons without having a linear ISPAD versus VE characteristic, 
the maximum current through the P+-diffusion/N-well diode 
can be limited to 3.5mA by means of M1, 2 transistors. 
III. SIMULATION RESULTS 
The 88 × -SPAD array still needs to be measured and 
characterized together with individual SPAD devices. In the 
following post-layout simulation results regarding ToF 
measurement and AQR circuit functionality are presented. 
A. ToF measurements  
Each SPAD is multiplexed to the output node. Therefore, 
besides the jitter of the current pulse, the time accuracy is 
also given by the uniformity of the signal path. Fig. 6 shows 
the jitter of the digital output due to non-uniformities of the 
readout path. Thus if the maximum jitter of the SPADs is 
small compared to those 115ps, then the overall time 
accuracy is mainly limited by the signal paths non-
uniformity. Notice that for pixel level ToF measurement, with 
a TDC per pixel, the time resolution is defined only by the 
maximum jitter of the sensors and TDC non-uniformity.   
295 300 305 310
0
0.5
1
1.5
Digital output of the array
Time (ns)
Vo
lta
ge
 (V
)
115ps
 
Figure 6.  Post-layout simulations of the time accuracy 
Fig. 7(a) sketches the worst case signal path delay, where 
the contributions of the different elements at pixel, array and 
pad level, can be inferred from the accumulated delay. 
Voltage Vout pixel is the output of one pixel, Vout array is the 
output before row/column multiplexing scheme and Vout PAD 
is the output of the array taken at pad level. This last 
component is the same for all the pixels of the array, thus it 
can be cancelled by calibration without jeopardizing the 
actual ToF. Fig. 7(b) illustrates the operation of the AQR 
circuit, VA is the anode voltage and Vreset is the reset signal 
that turn-on the SPAD. Fig. 7(c) shows the range within 
which the dead time (DT) can be continuously tuned, in order 
to lower the dark-count rate as much as possible. 
0
0.5
1
1.5
2 /Vout pixel /Vout array /Vout PAD
Vo
lta
ge
 (V
)
 
 
348 350 352 354 356 358
0
0.5
1
1.5
/Anode current spike
Cu
rr
e
n
t (m
A)
time(ns)
 
 
pixel
array
PADT1=374ps
T2=2.8ns
 
(a) 
0
0.5
1
1.5
2
/V
reset /VA
Vo
lta
ge
 
(V
)
 
 
248 249 250 251 252 253 254 255 256 257
0
0.5
1
1.5
/I(A)-anode current spike
C
u
rr
e
n
t (m
A)
time(ns)
 
 
V
reset
VA
Active quenching
Active reset
Due to the current that flows
through M3 and M4 
 
(b) 
0
0.5
1
/V
cap
0
1
2
/V
out
Vo
lta
ge
 (V
)
 
 
250 255 260 265 270 275 280 285 290
0
0.5
1
1.5
/I(A)-anode current spike
C
u
rr
e
n
t (m
A)
time(ns)
DT=5.6ns DT=41ns
DT=5.6ns
DT=41ns
 
(c) 
Figure 7.  Post-layout transient simulations: (a) Worst case single 
path delay, (b) AQR circuit transient, (c) DT range 
B. Light spot statistics  
After an appropriate accumulation time, the number of 
counts stored in each row/column counter can be used to 
2628
compute the center of mass and the dispersion of the light-
distribution. Thus the center of mass (μ1) and the variance 
(μ2) can be written as: 
∑∑
==
−==
8
1
2
12
8
1
1 )(,
i
i
i
i piip μμμ
           
 (1) 
where pi is the probability of the i-th row/column of the array 
of being hit. According to [11] the standard deviation of the 
light-distribution ( 2σ μ= ) is directly related with the DOI 
(z) as follows:  
          20 0( )z z σ σ− ∝ −  (2) 
where z0 and σ0 are obtained by experimental data fitting. 
Fig. 8 depicts the signals for light-spot statistics calculation. 
All the SPADs receive the same illumination in this 
simulation; therefore all of them will detect photons at the 
same rate. The signal labeled DIGITAL OUTPUT shows the 
output of one of them. The programmed EN allows counting 
only 10 pulses. After that, the content of the counters is 
serially downloaded, through pad labeled COLUMN 
OUTPUT/ROW OUTPUT. The waveform shows how code 
10 is delivered for every row and column. 
Characterization of the statistical magnitudes like dark-
count rate (DCR) and after-pulsing (AP) for different excess 
voltages (VE) and hold off time and the variation of DCR 
with temperature, the linearity of the sensor and the 
estimation of the power consumption depending on the 
number of the events detected will be reported after full test 
of the chip. 
0
1
2
0
1
2 WRITE/ SHIFT
0
1
2 RESET
0
1
2 EN
Vo
lta
ge
 
(V
)
0
1
2 DIGITAL OUTPUT
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
0
1
2 COLUMN OUTPUT/ ROW OUTPUT
time(us)
CLK
 
Figure 8.  Transient simulation with all the SPADs detecting photon events 
IV. CONCLUSION 
A 88× perfectly scalable array of SPADs has been 
designed and sent to fabrication in a 0.18μm standard CMOS 
technology. The pixel pitch is 32μm. Each pixel incorporates 
a SPAD with quasi-circular shape and an active 
quenching/reset circuitry with tunable hold-off time. The chip 
is able to perform ToF measurements and to estimate the 
actual position of the center and the dispersion of a light-spot 
as needed in PET applications. Each pixel can be 
successively connected on a single digital output to measure 
the overall time accuracy. The non-uniformity of the 
multiplexing scheme limits the time resolution to 115ps. In 
addition, maximum spot detection is implemented using a 
column and row level counters scheme.  
ACKNOWLEDGEMENTS 
This work has been funded by the Spanish Ministry of 
Economy and Competitivity through projects IPT-2011-1625-
430000 and IPC-20111009, co-funded by the European Fund 
for Regional Development, and by the US Office of Naval 
Research through grant N000141110312. 
REFERENCES 
[1] H. Finkelstein, M. J. Hsu, and S. C. Esener, "STI-bounded single-
photon avalanche diode in a deep-submicrometer CMOS technology", 
IEEE Electron Device Letters, Vol. 27, 2006, pp. 887-889. 
[2] C. Niclass et al., “A 100m-range 10-frame/s 340x96-pixel time-of-
flight depth sensor in 0.18μm CMOS”, Proc. of the European Solid-
State Circuits Conference (ESSCIRC), Sept. 2011, pp. 107-110. 
[3] E. Auffray, et al. “Towards a time-of-flight positron emission 
tomography system based on multi-pixel photon counter read-out”, 
Nucl. Science Symp. Conf. Rec., 2010, pp. 1050-1055.  
[4] R. Lange and P. Seitz, “Solid-state time-of-flight range camera”, IEEE 
J. of Quantum Electronics, Vol. 37, No. 3, pp. 390-397, March 2001.  
[5] C. Niclass, A. Rochas, P.-A. Besse, and E. Charbon, “Design and 
characterization of a CMOS 3-D image sensor based on single photon 
avalanche diodes”, IEEE J. of Solid-State Circuits, Vol. 40, No. 9, Sept. 
2005, pp. 1847-1854. 
[6] S. B. Gokturk, H. Yalcin, C. Bamji, “A time-of-flight depth sensor – 
system description, issues and solutions”, Computer Vision and Pattern 
Recognition Workshop (CVPRW), June 2004,pp. 35. 
[7] M. Conti, “State-of-the-art and challenges of time-of-flight PET”. 
Physica Medica, Vol. 25, No. 1, March 2009, pp. 1-11. 
[8] T. K. Lewellen, “Recent developments in PET detector technology”,  
Phys. Med. Biol. ,Vol. 53, 2008, pp. R287–R317. 
[9] A. Kuhn, S. Surti, J. S. Karp, G. Muehllehner, F. M. Newcomer, and R. 
VanBerg, “Performance assessment of pixelated LaBr3 detector 
modules for time-of-flight PET”, IEEE Trans. on Nuclear Science, Vol. 
53, No. 3, June 2006, pp. 1090-1095. 
[10] M. Conti, “Improving time resolution in time-of-flight PET”, Nucl. 
Instruments and Methods in Physics Research A, Vol. 648, 2011, pp. 
S194–S198. 
[11] C. W. Lerche et al. "Depth of γ-ray interaction within continuous 
crystals from the width of its scintillation light-distribution," IEEE 
Trans. on Nuclear Science, Vol.52, No.3, pp. 560- 572, June 2005 
[12] R. Mita and G. Palumbo, "High-Speed and Compact Quenching Circuit 
for Single-Photon Avalanche Diodes," Instrumentation and 
Measurement, IEEE Transactions on, vol. 57, pp. 543-547, 2008. 
[13] S. Tisa, F. Zappa, A. Tosi, and S. Cova, "Electronics for single photon 
avalanche diode arrays," Sensors and Actuators A: Physical, vol. 140, 
pp. 113-122, 2007. 
[14] M. Moreno-García et al. “CMOS SPADs Selection, Modeling and 
Characterization Towards Image Sensors Implementation”. IEEE Int. 
Conf. Electronics, C&S (ICECS), pp. 332-335, Dec. 2012 
[15] F. Zappa, A. Tosi, A. Dalla Mora, S. Tisa, “Spice modeling of single 
photon avalanche diodes”, Sensors and Actuators A, Vol. 153, 2009, 
pp. 197-204. 
[16] G. Giustolisi, R. Mita and G. Palumbo, “Behavioral modeling of 
statistical phenomena of single-photon avalanche diodes”. Int. J.  Circ. 
Theor. and Appl., DOI: 10.1002/cta. 748, 2011. 
[17] N. Faramarzpour, M. Jamal Deen, S. Shirani, and Q. Fang, “Fully 
integrated single photon avalanche diode detector in standard CMOS 
0.18-μm technology”, IEEE Trans. on Electron Dev., Vol. 55, No. 3, 
March 2008, pp. 760-767. 
2629
