A monolithic ASIC demonstrator for the Thin Time-of-Flight PET scanner by Valerio, Pierpaolo et al.
A monolithic ASIC demonstrator for the Thin
Time-of-Flight PET scanner
P. Valerio,a R. Cardarelli,b G. Iacobucci,a L. Paolozzi,a E. Ripiccini,a D. Hayakawaa S.
Bruno,b A. Caltabiano,b M. Kaynak,c H. Rücker,c M. Nessia,d
aDPNC, Département de physique des particules et corpusculaire,
Geneva
bINFN, Sezione di Roma Tor Vergata,
Roma
cIHP, Leibniz-Institut für innovative Mikroelektronik,
Frankfurt (Oder)
dCERN,
Geneva
E-mail: pierpaolo.valerio@unige.ch
Abstract: Time-of-flight measurement is an important advancement in PET scanners to improve
image reconstruction with a lower delivered radiation dose. This article describes the monolithic
ASIC for the TT-PET project, a novel idea for a high-precision PET scanner for small animals. The
chip uses a SiGe Bi-CMOS process for timing measurements, integrating a fully-depleted pixel
matrix with a low-power BJT-based front-end per channel, integrated on the same 100 µm thick die.
The target timing resolution is 30 ps RMS for electrons from the conversion of 511 keV photons.
A novel synchronization scheme using a patent-pending TDC is used to allow the synchronization
of 1.6 million channels across almost 2000 different chips at picosecond-level. A full-featured
demonstrator chip with a 3×10 matrix of 500×500 µm2 pixels was produced to validate each block.
Its design and experimental results are presented here.
Keywords: Analogue electronic circuits, Digital electronic circuits, Front-end electronics for
detector readout, Timing detectors, Pixelated detectors and associated VLSI electronics
ar
X
iv
:1
81
1.
10
24
6v
1 
 [p
hy
sic
s.i
ns
-d
et]
  2
6 N
ov
 20
18
Contents
1 The TT-PET project 1
2 System design aspects 1
3 The TT-PET small-size demonstrator chip 2
3.1 Specifications 3
3.2 Front-end design 4
3.3 Readout logic and other blocks 6
4 Results 7
5 Conclusions 8
1 The TT-PET project
Conventional PET imaging techniques use scintillating crystals to detect two back-to-back photons
produced by a positron-electron annihilation to determine where the annihilation occurred. Without
additional information, the event is placed anywhere on the line of response between the two acquired
signals and, with enough statistics, an accurate image can be reconstructed. The addition of a Time
of Flight (TOF) measurement can restrict the initial placement of the interaction point on the line
of response, reducing it to a segment. A more precise timing information corresponds to a shorter
segment, resulting in a less noisy image, or in a reduced dose to the patient due to the smaller
statistics required. In order to extract valuable information on the position of the annihilation point,
a high TOF precision is required (at least 200 ps), as the particles travel at the speed of light.
Goal of the TT-PET (Thin TOF-PET) project is to build a novel small-animal PET scanner with
a target of 30 ps RMS time resolution for photon detection[1]. This value is well beyond the
state-of-the-art for time-of-flight PET systems[2], and is obtained by a radically different approach
compared to traditional scanners. Multiple layers of monolithic silicon pixel detectors and high-Z
photon-converters are stacked to convert incoming photons and digitize hits, providing their 3D
position and timing. Data are reconstructed off-line to correct for systematic offsets, discriminate
coincidences and reconstruct the acquired image.
The TT-PET project is funded by the Swiss National Science Foundation. The front-end design
was carried out by the University of Geneva and the INFN Rome Tor Vergata.
2 System design aspects
The TT-PET scanner is formed by 16 identical wedges, called towers, containing the detector stack,
the mechanical support structures, the cooling and the interconnections (figure 1). Each detection
– 1 –
layer is composed by two 100 µm thick monolithic pixel silicon detectors placed side by side, a
50 µm lead converter and dielectric glue layers, as shown in figure 2. Pixels have an area of 500 µm
by 500 µm, which corresponds to an input capacitance for the Front-end of about 500 fF including
routing.
  2 / 2
Silicon monolithic layer  (100 μm)
G
lu
e
Lead (50 μm)
Photon
Silicon monolithic layer (100 μm)
Lead (50 μm)
Figure 1. CAD image of the TT-PET scanner, with
the 16 towers and the cooling blocks between them
represented in blue. The wedge-shaped towers are
formed by ASICs of three sizes, with larger ones at
larger radii.
Figure 2. Two detection layers, including a mono-
lithic detector, a lead converter and glue. The 60
detection layers of a tower are divided in 12 stacks
of 5 layers each (called “supermodules”). The lead
and silicon layers are glued together with 5 µm and
50 µm thick adhesive tape.
Detectors are grouped every 5 layers in a "super-modules", sharing services and interconnections.
The chips in a super-module are all connected to the same flex cable with stacked wirebonds and
are daisy-chained to minimize the number of connections needed for the readout.
Cooling is provided with a microchannel liquid flow in the blocks between the towers. This solu-
tion minimizes the dead area, but it can only dissipate a limited amount of power. Heat transfer
simulations by FEA, confirmed by measurements on a mechanical mock-up, were used to calculate
the power budget of the detectors, which was set to 200 µW per channel.
Three different chip sizes (25mm long and 7, 9 or 11mm wide), are implemented to form wedges.
The number of chips was optimized with GEANT4 simulations that allowed the calculation of the
scanner sensitivity and efficiency.
3 The TT-PET small-size demonstrator chip
After some small-scale test structures, a 3×10 matrix of fully-featured pixels (shown in figure 3)
was submitted in aMPW run in Spring 2017. The chip has been fully characterized with radioactive
sources and in the SPS beam test facility at CERN.
Each of the 500×500 µm pixels includes a BiCMOS preamplifier, a fast discriminator and an 8-bit
calibration DAC for threshold equalization, placed in a column next to the active collection area. In
the periphery a single TDC is used to digitize timing information, with all the pixels multiplexed
to it. A digital logic block encodes the digitized data along with the hit position and implements
a simple I/O protocol for chip readout and configuration. Other blocks include tunable biasing
structures for the analog circuits. A block diagram of the pixel electronics can be found in figure 4.
– 2 –
Figure 3. Layout of the TT-PET demonstrator chip, with a 3×10 pixel matrix. On the left, five guard-ring
test-structures are visible, that were submitted to independently test the high-voltage insulation of the pixels.
Analog Front-End
13-Sep-17P. Valerio - pierpaolo.valerio@unige.ch 10
CAL DAC
HIT LATCH
threshold
PMOS-based 
resistors
Pixel
MOS Discriminator
To Logic
 Front-ends sit outside of the active 
Figure 4. Block diagram of the pixel electronics. The pixel is shown as a diode, connected to the BJT-based
preamplifier. Its output is discriminated by an open-loop MOS amplifier, controlled by a local DAC to adjust
its threshold. The digitized output is sampled by a latch and sent to the periphery to the TDC.
3.1 Specifications
The main specifications for the front-end are shown in table 1.
Power supply 1.8V
Gain 90mV fC−1
Equivalent Noise Charge (for a 1 pF input capacitance) 600 e−
Power consumption 135 µW
Peaking time 1.3 ns
Simulated ToA jitter (for a 1 fC signal) 82 ps
Table 1. Main specifications of the simulated analog front-end
The pixel size is a compromise between input capacitance and power consumption. Having smaller
pixels would lead to better spatial resolution of the scanner, but since a PET image has an intrinsic
resolution of about 500 µm[3], the image quality would not improve. A smaller pixel would result in
a smaller input capacitance for the amplifier, and thus lower noise, leading to more accurate timing.
– 3 –
On the other hand, more channels would be required to cover the same area, so power consumption
would increase.
Noise is the main contributor to the timing resolution. Given an accurate enough TDC (TDCs with
precision of a few ps can be found in literature[4]), the uncertainty is dominated by the effect of
the analog front-end. This includes different factors, such as the pixel-to-pixel threshold variation,
the intrinsic electronic noise of the preamplifier and the distribution of charge collection time in the
substrate.
3.2 Front-end design
The front-end features a preamplifier using a Silicon-GermaniumHeterojunction Bipolar Transistor
(specifically, IHP 130 nm SiGe-HBT technology), which was chosen to minimize the series noise
which represents the main contribution to the noise performance.[5]. This front-end was already
tested and found to perform well, with the capability of achieving a 100 ps jitter for up to 1 pF input
capacitance[6].1
The amplifier is connected to the input diode, which is integrated in the electronics substrate, being
the chip monolithic. The chip has a 1 kΩ substrate and is thinned to 100 µm in order to optimize
the charge collection time and increase the electric field uniformity. Ground reference is provided
to the cathode through a back-plane metalization, while the anode is capacitively coupled to the
front-end input. Figure 5 shows the I-V characteristic of the pixel matrix up to a voltage of 200V.
The leakage current is less than 0.6 nA per channel, and it is mostly due to the implantation process
performed on the backplane. Since the front-end is capacitively coupled to the sensor, the dark
current is filtered out and it has a negligible impact on the chip performance.
Figure 5. I-V curve of the 3×10 pixel matrix, connecting the backplane to ground and HV through a resistive
distribution network. The current going through the diode is in blue (measured when both increasing and
decreasing the voltage to show the hysteresis), while the current flowing through the guard ring is in orange.
The preamplifier schematic is shown in figure 6. The BJT is used in a simple common-emitter
configuration, with an active PMOS load and a MOSFET feedback, which can be tuned to adjust
the equivalent feedback impedance.
The choice of a common-emitter configuration comes from the need to minimize the input and
1This value is compatible with the target of 30 ps for 511 keV photons. Detailed GEANT4 simulations showed that the
average charge deposited by a PET photon would be more than three times larger than the one deposited by a minimum
ionizing particle.
– 4 –
Q1
In
CC
CDET
Out
M1
VCC
Bias Preamp
M2
Bias Feedback
M3
M5 OutM4In
VCC
Figure 6. Schematics of the preamplifier. The left block is a common-emitter configuration capacitively
coupled to the sensor, while the right one emulates a floating MOS-based feedback resistor which can be
tuned from the periphery with a current DAC.
output capacitances to achieve a high gain while keeping the rise time as short as possible. Indeed,
the time resolution is directly proportional to the rise time and inversely proportional to the signal-
to-noise ratio[5]. This implementation features a 20%-80% rise time of about 600 ps. Total charge
integration time is about 1.3 ns, which is compatible with the charge collection time in silicon. A
plot of the simulated output of the preamplifier is shown in figure 7. Due to the much larger peaking
time compared to the target time resolution, time walk must be taken into account and compensated
when calculating the time of arrival because different input charges can change the time stamp
by hundreds of ps, as shown in figure 8. This is possible by estimating the charge performing a
time-over-threshold measurement and then correcting the time-walk error off-line. Figure 9 shows
the Equivalent Noise Charge referred to the input of the preamplifier for different values of input
capacitances.
Figure 7. Typical output of the preamplifier for
an input charge of 1 fC, extracted from a Cadence
Spectre simulation.
Figure 8. Effect of time-walk on the measurement,
showing the difference in timing for different input
charges, ranging from 1 to 20 fC.
Each preamplifier is connected to a 3-stage MOS discriminator with a 4mV hysteresis to compare
its output with a fixed threshold. In order to minimize the load capacitance of the amplifier the input
– 5 –
0.2 0.6 1
200
300
400
500
600
Input capacitance [pF]
Eq
ui
va
le
nt
 N
oi
se
 C
ha
rg
e 
[e
-]
0.4 0.8
Figure 9. Equivalent Noise Charge of the preamplifier as a function of the input capacitance of the detector.
This plot doesn’t include the contribution of the discriminator, that filters part of this noise due to its limited
bandwidth.
stage of the discriminator uses very small NMOS transistors, leading to a significant pixel-to-pixel
threshold mismatch (simulations showed a 3σ value of 100mV). To compensate for this effect, an
8-bit calibration DAC is included in each front-end. It is a binary-weighted, current-steering DAC
connected to the first stage of the discriminator that is used to unbalance the current flowing in
the two branches and moves the effective threshold of the discriminator. This can compensate for
other pixel-to-pixel effects, due for example to the DC output of the preamplifier. The total current
produced by the DAC can be tuned to change the calibration range.
3.3 Readout logic and other blocks
Given the low hit rate that we expect in any of the TT-PET chips, all pixels are multiplexed to the
same TDC, so that the chip will not be able to detect simultaneous particles. Since this event is
very rare[7], this approach was chosen to simplify the design and reduce the power consumption of
the chip. A single 50-ps binning TDC is placed in the chip periphery and all pixels are connected
to it through a balanced ladder of NAND/NOR gates. The TDC measures both time of arrival and
time over threshold of the signal, used to compensate for time-walk effects. A separate set of row
and column lines are used to extract the pixel address and store it in a readout buffer. Pixel-to-pixel
delay, while minimized by the balanced multiplexing network, is still larger than the time resolution,
so it requires off-line calibration. The contribution to the time resolution of the digital chain was
measured using a testpulse injection circuit and found to be in the order of 1 ps.
The chip features a simple serial interface for both readout and programming, with data shifted in
the pixel configuration memories being connected as a long shift register. Since the chip can only
store a single hit at a time, there is a dead time of about 5 µs (this value is much )after every hit to
allow for the readout of the TDC data. A trigger signal, produced by a fast OR of all the pixels, is
also available in output to implement a trigger logic or for debugging purposes.
– 6 –
4 Results
The demonstrator chip was thoroughly tested with a 90Sr source at the University of Geneva. For
testing purposes the inclusion of a fast trigger signal was very useful as it allowed to characterize
and debug the pixel front-end and the TDC separately. The chip is fully working at the nominal
power consumption.
Noise scans were performed by sweeping the threshold and looking at the real-time output of the
fast-OR with an oscilloscope. S-curves (figure 10) were produced and fitted to extract the electronic
noise at the output of the preamplifier. The error function fitting the experimental data corresponds
to a gaussian curve with a standard deviation of 2.35mV, corresponding to an input referred noise
of less than 400 electrons. It has to be noted that the discriminator has an important impact on
this measurement, as it acts as a band-pass filter for the noise. According to Cadence Spectre
simulations, the noise standard deviation at the output of the discriminator was reduced by 30%
compared to one at the output of the preamplifier.
875 880 885 890 895 900
 Vth [mV]
0
0.2
0.4
0.6
0.8
1
O u
t p u
t  ( n
o r m
a l i
z e
d )
Figure 10. S-curve noise measurement at the output of the discriminator.
Time-of-flight measurements were performed with a 90Sr source. Two chips were put on top of each
other and time differences between them recorded and analyzed. The time-of-flight distribution
between the two chips is shown in figure 11. The measured time resolution of 130 ps for the core
of the distribution mis a very promising results, far better than what was previously achieved by
monolithic particle detectors.
Combined simulations of the sensor and the electronics showed an expected resolution of 92 ps.
The larger value measured can be attributed to a non-ideal correction for time-walk and to an added
input capacitance due to pixel routing, in addition to possible system-level cross-talk from the
readout system.
– 7 –
hdeltat__4
Entries  253045
Mean  0.00563− 
Std Dev      0.32
 / ndf 2χ
  2069 / 194
 coreN  48.0±  9097 
 
core
µ
 0.000541± 0.004546 
 coreσ  0.0007± 0.1807 
 tailN  44.7±  3019 
 
tail
µ
 0.00223±0.03561 − 
 tailσ  0.0031± 0.4734 
Δt [ns]3− 2− 1− 0 1 2 3
0
4000
8000
12000
16000
20000
delta_
0. 056−
 48±  9097 
 . 005± 0.0045
 45±  3019 
 0.002±0.035−
 0. 03± 0.470
t∆
#  E
v e
n t s
Figure 11. Time-of-flight between two chips obtained with a 90Sr source. This distribution is fitted with
a double Gaussian; the standard deviation σcore=180±0.7 ps hints to a time resolution of approximately
130 ps, assuming equal performace for the two chips.
5 Conclusions
The design of the demonstrator of a monolithic pixel detector for the TT-PET project was presented,
together with test results. The chip includes a novel SiGe BiCMOS-based front-end to achieve
better than state-of-the-art time resolutions. A time resolution of 130 ps was measured with a 90Sr
setup, with a power consumption as little as 135 µW per channel.
Acknowledgments
We would like to thank the Electrical Engineering team of the University of Geneva as well as the
colleagues of the University of Bern and INFN Tor Vergata for their help with the readout system.
This study was funded by the SNSF SINERGIA grant CRSII2_160808.
References
[1] P.Valerio et al, A high-Precision Timing ASIC for TOF-PET Applications, PoS TWEPP (2017) 043,
043. 10.22323/1.313.0043.
[2] S. Vandenberghe and E. Mikhaylova and E. D’Hoe and P. Mollet J. S. and Karp, Recent developments
in time-of-flight PET, EJNMMI Physics, vol 3, issue 1, 2016
[3] W. W. Moses, Fundamental limits of spatial resolution in PET, Nuclear Instruments and Methods in
Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment,
Volume 648, Supplement 1, 21 August 2011
[4] M. Zhang, H. Wang,Y. Liu, A 7.4 ps FPGA-Based TDC with a 1024-Unit Measurement Matrix,
Sensors (Basel). 2017;17(4):865. Published 2017 Apr 14. doi:10.3390/s17040865
[5] L. Paolozzi, Development of particle detectors and related Front End electronics for sub-nanosecond
time measurement in high radiation environment, PhD thesis at University of Roma Tor Vergata, 2014
– 8 –
[6] M. Benoit et al., 100 ps time resolution with thin silicon pixel detectors and a SiGe HBT amplifier,
Journal of Instrumentation, vol 11, issue 3, 2016
[7] E. Ripiccini et al., Expected performance of the TT-PET scanner, publication in preparation
– 9 –
