Report on power, thermal and reliability prediction for 3D
  Networks-on-Chip by Dang, Khanh N. et al.
Technical Report 
1 
Report on power, thermal and reliability prediction for 3D 
Networks-on-Chip 
Khanh N. Dang*1, Akram Ben Ahmed2, Abderazek Ben Abdallah3, Xuan-Tu Tran1 
1VNU-key Laboratory for Smart Integrated System (SISLAB), VNU University of Engineering and 
Technology, Vietnam National University, Hanoi, Vietnam  
2National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba, 305–8568, Japan 
3University of Aizu, Aizu-Wakamatsu, Japan 
 
Abstract 
By combining Three Dimensional Integrated Circuits with the Network-on-Chip infrastructure to obtain 3D 
Networks-on-Chip (3D-NoCs), the new on-chip communication paradigm brings several advantages on lower 
power, smaller footprint and lower latency. However, thermal dissipation is one of the most critical challenges for 
3D-ICs where the heat cannot easily transfer through several layers of silicon. Consequently, the high-temperature 
area also confronts the reliability threat as the Mean Time to Failure (MTTF) decreases exponentially with the 
operating temperature. Apparently, 3D-NoCs must tackle this fundamental problem in order to be widely used. 
Therefore, in this work, we investigate the thermal distribution and reliability prediction of 3D-NoCs. We first 
present a new method to help simulate the temperature (both steady and transient) using traffics value from realistic 
and synthetic benchmarks and the power consumption from standard VLSI design flow. Then, based on the 
proposed method, we further predict the relative reliability between different parts of the network. Experimental 
results show that the method has an extremely fast execution time in comparison to the acceleration lifetime test. 
Furthermore, we compare the thermal behavior and reliability between Monolithic design and TSV-based TSV.  
We also explorer the ability to implement the thermal via a mechanism to help reduce the operating temperature. 
Keywords: Thermal dissipation, Reliability, 3D-ICs, 3D-NoCs. 
 
   
1. Introduction* 
3D Networks-on-Chip (3D-NoCs), as a result of combining Networks-on-Chip (NoCs) [1] with 3D 
Integrated Circuit (3D-ICs) [2], is considered as one the most promising technologies for IC design [3]. 
By providing parallelism and scalability of the NoCs to 3D-ICs, we even obtain lower power 
consumption, shorter wire length while reduce the design area cost by several times. Among several 3D-
ICs, Through-Silicon-Via which constitutes as inter-layer wire is one of the near-future technologies. 
Monolithic 3D ICs is another method to implement the 3D-ICs [30][34].  With both technologies, we 
expect to have multiple layers of the system. To support communication within the system, 3D-NoCs 
offer router-based infrastructure where the 3D mesh topology is used. 
Despite several advantages, 3D-ICs and 3D-NoCs have to confront the thermal dissipation issue. 
The temperature variation between the two layers has been reported to reach up to 10°C in [4]. Cuesta 
et al. [5] also conduct an experiment of four-layer and 48 cores which gives the temperature variation 
up to 10°C between a single layer. The main reason of thermal dissipation difficulty in 3D-ICs is the top 
layers act like obstacles that prevent the heat could be dissipated by the heatsink. To solve this problem, 
fluid cooling [5] or thermal cooling TSV [6] have been proposed. 
________ 
* Corresponding author. E-mail.: khanh.n.dang@vnu.edu.vn 
 
  Technical Report 
 
By having a higher operating temperature, it is apparent that 3D-NoCs easily encounter thermal 
throttling. Moreover, in terms of reliability, there is an expected acceleration in the failure rate (or 
reduction in Mean-time-to-Failure). For semiconductor devices, one of the most well know model of 
thermal impact in reliability is the Black’s model [7] where the fault rate acceleration πT is: 
𝜋𝑇 = 𝐴(𝐽)
𝑛 × 𝑒
− 
𝐸𝑎 
𝑘𝐵𝑇 (1) 
where 𝐴 is constant, 𝐽 is the energy, 𝑘𝐵 is Boltzmann constant, 𝐸𝑎 is activation energy and 𝑇 is the 
temperature in Kelvin. Here, we would like to note that the activation energy of Copper is much higher 
than CMOS material which makes TSV more vulnerable than the normal gate. Since TSV can act as a 
cooling device, TSV-based NoC has a lower operating temperature than Monolithic; however, TSV also 
has lower reliability. Therefore, the reliability differences between Monolithic and TSV-based 3D-ICs 
also need to be investigated. 
While the thermal behavior could be extracted by performing the real-chip, reliability cannot be 
directly measured. Most industrial methods based on  Black’s model [7] in Equation 1 by baking the 
chip under high temperature to accelerate the failure [31][32][33]. 
In this work, we  investigate the impact of the thermal dissipation difficulty of 3D-ICs by proposing 
a method to predict the temperature and MTTF of each region of the system. We first use the commercial 
EDA tools [8][9] to design and analyze the power and energy per data bit of 3D-NoC router. Then, we 
extract the number of bit and operating time of synthetic and PARSEC benchmarks to obtain the average 
power consumption of each router inside the network. We then use a thermal emulation tool named 
Hotspot 6.0 [10] to obtain the steady grid temperature of the system. By adopting the Black’ model of 
reliability, the tool follows up with a reliability prediction of the system. By following the method, the 
designer can fast extract the potential hotspots inside the 3D-ICs and predict the potential of the 
vulnerable region due to high operating temperatures. The results also suggest the possible mapping of 
fluid cooling or thermal TSV insertion [5][21][22]. The contribution of this work is as follows: 
 A platform to model the power, temperature, and reliability of any NoC systems. Here, we specify 
for 3D-NoC but the technique is general can be applied for the traditional planar NoC system. 
 Analyze the reliability of Monolithic and TSV-based NoC. While TSV-based NoC has a lower 
operating temperature, TSV’s material (Copper) has lower reliability. 
 Exploration and comparison between different layout strategies and cooling methods. 
The rest of this work is organized as follows. Section 2 first survey the existing works. Section 3 
describes the proposed method in detail. Experimental results are discussed in Section 4. Finally, Section 
5 concludes this work. 
2. Related works 
In this section, we summary the literatures related to our proposed method. We start with the power 
model and then present the work on thermal estimation. Finally, the reliability estimations for 3D-NoCs 
are presented. 
2.1. Power modeling for 3D Network-on-Chip 
To measure the power consumption of a 3D-IC, the straight forward method is to fabricate and set 
up a measuring system [15]. However, it is difficult to obtain such a system, especially designing and 
fabricate the chip is expensive and designers want to estimate the value before sending to production. 
Therefore, modeling the power consumption is a necessary step. 
To model the power of any digital IC system, two major parts which are static and dynamic power 
is considered as follows: 
P.T. Nam et al. / VNU Journal of Science: Comp. Science & Com. Eng., Vol. 31, No. 3 (2015) 1-9 13 
𝑃 = 𝑃𝑑𝑦𝑛𝑎𝑚𝑖𝑐 + 𝑃𝑠𝑡𝑎𝑡𝑖𝑐 = 𝑠𝑓𝑐𝐶𝐿𝑉𝐷𝐷
2 + 𝐼𝑜𝑓𝑓𝑉𝐷𝐷 (2) 
where 𝑠 is the switching probability (or activity ratio), 𝑓𝑐 is the clock frequency, 𝐶𝐿 is the load 
capacitance, 𝐼𝑜𝑓𝑓 is the leakage current and 𝑉𝐷𝐷 is the supply voltage. Based on Equation 2, common 
EDA tools [8][9] can estimate the power consumption based on the parameter of the library and the 
switching activity. In fact, power estimation tool such as Primetime requires switching activity to obtain 
the most accurate result. 
Using Equation 2 can estimate the power consumption of any circuit; however, for a fast prediction, 
the power consumption of NoCs can be obtained by its switching activity. By obtaining the number of 
flits went through the router during simulation, it can estimate the dynamic power consumption. 
Meanwhile, the static power consumption is constant for the same configuration (voltage, frequency, 
design). For instance, ORION 2.0 [16] model power consumption as dynamic and static power. Physical 
parameters such as wire length and leakage current are calculated to estimate the static power. In [17], 
the authors use regression to estimate the power consumption of the system based on the existing values. 
Other works in [18][19] also consider dynamic voltage frequency scaling in power consumption.  
While these works can help estimate the power consumption of our system, we observe it is not the 
most accurate once because of the differences in design choice and library. Therefore, in this work, we 
propose our own power extraction method. We used the EDA tools to estimate the dynamic and static 
power and then combine with the switching of the routers in the used benchmarks. 
2.2. Thermal behavior prediction for 3D Network-on-Chip 
Once we obtain the power consumption of modules within a system, we can estimate the 
temperature the chip. HotSpot [20] is one of the first tool to help estimate the temperature grid. The 6th 
version of HotSpot now can estimate the temperature of 3D-ICs. There are also different tools such as 
3D-ICE [21] and MTA [22]. While MTA performs a similar task as Hotspot by using the finite element 
method, 3D-ICE focuses on the potential of liquid cooling. Cuesta et al. [5] also explored different 
layout strategies and liquid cooling for 3D-ICs. 
2.3. Reliability prediction for 3D Network-on-Chip 
By having the temperature of the system, we now can estimate the potential reliability. As we 
previously have shown, Black’s model [7] in Equation 1 is one of the first models for CMOS. MIL-
HDBK-217F of the US Military [12] also released its own model of reliability acceleration related to 
temperature. HRD4 from industry [13] and RAMP from academics [14] are the other two models to 
estimate the reliability of the system. 
Among these model, HRD4 consider the reliability as the same for the chip bellow 70°C. The rest 
of the models follows the exponential acceleration with operation temperature (in Kelvin). 
On the other hand, industrial approaches on reliability prediction [31][32][33] is to bake the chip to 
high temperature and measure the average time to failure of the samples. By using Black’s model, they 
can estimate the potential lifetime reliability under normal temperature. 
3. Proposed method 
Figure 1 shows the proposed method for the thermal and reliability prediction of 3D-NoCs. We first 
built Verilog HDL of 3D-NoC. Then, synthesis and place and route are the following steps to obtain the 
layout, netlist file, wire length, and physical parameters.  
  Technical Report 
 
We then perform post-layout simulation and use Synopsys Primetime to extract the power 
consumption of the system. Based on the number of data-bit, we further extract the energy per data bit. 
Then, we now can estimate the power consumption of all benchmarks by multiplying the obtained value 
with the number of bit per router per time. The power consumption of each router is taken to the 
temperature estimator tool (Hotspot 6.0) to obtain the temperature map. At the end of this step, we obtain 
all temperature maps of all benchmarks. 
 
Figure 1: Thermal and reliability prediction method of 3D Networks-on-Chip. 
One notable thing in 3D-NoCs is the possibility to have redundant Through-Silicon-Vias (TSVs). 
TSVs are usually made out of Copper and have a larger size than normal wire which can dissipate heat 
faster than normal silicon. Monolithic 3D-ICs fails to have the same feature since the via is extremely 
small. Consequently, we take the redundancy mapping into the hotspot prediction. 
Once we can predict the temperature, we can obtain the reliability prediction using Black’s model 
in Equation 1. Note that the activation energy also varies among materials. The output of reliability can 
also affect redundancy mapping as a close loop. Consequently, designers can further optimize the system 
to have the most balancing point of temperature, reliability, and area overhead. In the following part, we 
explained in detail each part of the proposed method. 
We would like to note that our method reuses and follows the principle of existing works in academic 
and industrial approaches [8][9] [20][31][32][33]. 
3.1. Design of 3D Network-on-Chip 
Here, we adopted our previous work in [3] with some modifications where the TSVs of a router is 
divided into four groups and place in four directions (west, east, north, south) of the router to support 
sharing and fault tolerance. However, we here provide more flexibility in the design since fault tolerance 
is not our objective of this work. Figure 4 shows the architecture of our 3x3x3 Network on Chip. Each 
router can connect to at most six neighboring routers in six directions and one local connection to its 
attached processing element. The inter-layer connections are TSV and we support optional redundant 
TSV group (yellow TSVs) which can be used to repair a faulty group group in the router. Borrowing 
and sharing mechanisms are another features we support to have high reliability in our system. More 
details on fault tolerance method can be seen in our previous work [3].  
Each router receives a header flit of packet and support routing inside the network. Based on the 
destination, it forwards the header flit and the following flits (body and tail flits) to the desired port. 
Once the tail flit completes its transmission, the router starts to route the new packet. 
In the router layout of [3], the design is not well optimized since it leases space between routers in 
layout. Figure 2(a) shows the layout of [3]. In order to optimize it, we use two different floorplans in 
this work. We first place TSVs and router logic in separated regions as in Figure 2 (b). Then, we place 
P.T. Nam et al. / VNU Journal of Science: Comp. Science & Com. Eng., Vol. 31, No. 3 (2015) 1-9 13 
TSV surround the router logics as in Figure 2 (c). We can notice that we reduce the size of the router 
significantly by removing the empty space.  
 
 
Figure 2: Layout option for 3D-NoC router: (a) Previous work in [12]; (b) Separated TSV region; (c) Surround 
TSV region. 
 
Figure 3: 3D IC layer structure (heat sink on top) of Monolithic 3D IC vs TSV-based 3D IC [35]. 
Among the two new layouts, Figure 2(c) provides the best thermal balance because it isolates the 
logic of a router to the nearby module. Since routers are usually hotspots inside the system, placing them 
near a hot area can raise it temperature significantly. Here, by surrounding by TSVs, we create isolation 
for the router. Furthermore, Copper has low thermal resistivity which can dissipate the heat from the 
router to the upper layers. By doing so, we can transfer then heat to the top layer and the heatsink. In the 
evaluation section, we then discuss the efficiency and cost of inserting thermal via in our design. 
Figure 3 shows the different between Monolithic and TSV-based 3D-ICs. While TSV is made out 
of Copper that dissipate thermal faster than Silicon layers. However, there are bonding layers between 
stacking using TSV which creates an isolation of thermal disspation between them. 
Our 3D-NoC also support fault tolerance for not only TSVs but also for soft and hard fault [36-47]. 
For soft error in data, we used two SECDED(22,16) to detect and correct two and four faults [36][38], 
respectively. For pipeline stage, we support double execution for detection and correction using a third 
execution [36][45][46]. We also tolerate permanent faults in crossbar, buffer and link [36][38][48] 
  Technical Report 
 
 
Figure 4: Architecture of our 3D Network-on-Chip with the size of 3x3x3. 
3.2. EDA tools and power extraction 
The following part of the method is to use EDA tool to extract the power consumption. Apparently, 
we can use any supported EDA to obtain power consumption. For our experiment, we use Synopsys 
Design Compiler, ICC and Primetime to do the physical design and extract the power consumption.  
To extract the power, we perform a heuristic transmission benchmark of a single router. Here, we 
generate two packets of ten flits in all possible directions. Because our router supports returns the flit 
from it sending ports, we have 7x7=49 possible directions. By using PrimeTime, we can obtain the 
dynamic and static power. 
Here, we also classify the energy into static and dynamic. While static power consumption is stable, 
we keep the value as it is. For the dynamic power, we calculate the total energy and the energy per data 
bit. 
3.3. Power and Temperature Estimation 
Once we obtain the energy per data-bit, we can obtain the overall power consumption as follows: 
𝑃 = 𝑃𝑠𝑡𝑎𝑡𝑖𝑐 + 𝑃𝑑𝑦𝑛𝑎𝑚𝑖𝑐 = 𝑃𝑠𝑡𝑎𝑡𝑖𝑐 + 𝐸𝑑𝑦𝑛𝑎𝑚𝑖𝑐 ×
𝑁𝑏𝑖𝑡
𝑡𝑖𝑚𝑒
 (3) 
where Nbit is the number of a data bit in the benchmark. We can also scale the power with the dynamic 
frequency and voltage if needed. Here, we also support dynamic scaling for voltage and frequency by 
using Equation 2 where different voltage and frequency can be converted using the following equations: 
𝑃𝑠𝑡𝑎𝑡𝑖𝑐
𝑉1𝐹1
𝑃𝑠𝑡𝑎𝑡𝑖𝑐
𝑉2𝐹2
≈
𝑉1
𝑉2
 (4) 
𝑃𝑑𝑦𝑛𝑎𝑚𝑖𝑐
𝑉1𝐹1
𝑃𝑑𝑦𝑛𝑎𝑚𝑖𝑐
𝑉2𝐹2
≈
𝑓1𝑉1
2
𝑓2𝑉2
2
(5) 
 where V1,f1 and  V2,f2 are two pairs of supply voltage and frequency. 
P.T. Nam et al. / VNU Journal of Science: Comp. Science & Com. Eng., Vol. 31, No. 3 (2015) 1-9 13 
The power trace and floorplan are taken into Hotspot 6.0 to obtain the thermal map of the design. 
The results of Hotspot 6.0 is the steady temperature of each router and its TSVs. We can also support 
transient power and temperature. However, since we consider the reliability as the major target, the 
steady temperature is the most important value. 
3.4. Defect mapping 
After getting the thermal map, we can extract the reliability to obtain the defect map. Figure 6 shows 
the normalized thermal acceleration model in academics and industry. We illustrate the MIL-HDBK-
217F of the US Military [12], HRD4 from industry [13] and RAMP from academics [13]. Notably, we 
used Black’s model [7] in our work. However, we could also adopt the existing model if needed as in 
Figure 6.  One common between the model is the exponential curve of acceleration of the fault rate with 
the temperature. Note that HRD4 uses 70°C as the threshold of reliability concern. 
 
Figure 6: Normalized thermal acceleration of fault rate. 
Table 1: Normalize fault rate of Copper TSV mapping using Black’s model [7]. 
Temperature (K) Normalize fault rate to 70°C 
303.15 0.011537 
313.15 0.039174 
323.15 0.123317 
333.15 0.362371 
343.15 1 
353.15 2.605435 
363.15 6.439561 
373.15 13.94691 
Table 1 shows the fault rate mapping obtained by Black’s model [7]. At 30°C, the fault rate is less 
than 2% at 70°C. However, once the IC operates at 80°C, its fault rate is 2.6× at 70°C and 220× at 30°C. 
By mapping to fault rates, we can find the critical part of the 3D-NoCs in terms of reliability. 
4. Experimental results 
In this section, we evaluate the 3D Network on Chip [3] using the proposed platform. Furthermore, 
explore the idea of the different floorplan and cooling strategies. At first, we extract the power 
consumption from the synthetic benchmark of a router. Then, we estimate the power consumption of 
the 3D-NoC system under various benchmarks. Then, temperature and reliability prediction is 
illustrated. In the final part, we compare different strategies for layout and cooling. 
4.1. 3D-NoC router power estimation. 
  Technical Report 
 
We used the previous work [3] model of a router to estimate the power consumption and the energy. 
Note that we modified the router with some optimizations and further fault tolerances. We use 
NANGATE 45nm library [23] and NCSU FreePDK TSV [24]. The hardware complexity of the router 
is shown in Table 2. We perform a heuristic benchmark for this router by sending each port to all possible 
ports two packets of ten flits of 32 bits. The number of bits is 7 × 7 × 2 × 10 × 32 = 31360 bits. The 
desired injection rate is 1 flit/port/cycle. The final result for static power and energy per data bit are 
7.66e-4 W and 9.246e-13 J/bit, respectively. 
Table 2: Hardware complexity of our 3D-NoC router. 
Parameter Value 
Area cost 38,838 𝜇𝑚2 
Maximum Frequency 537.63 MHz 
Operating Frequency 500MHz 
Technology 45nm (NANGATE 45)  [23] 
Voltage  1.1 V 
Static Power (at 500MHz) 7.64e-4 Watt 
Dynamic Power (at 500MHz) 1.028e-2 Watt 
Simulation time 2.823200e-6 second 
Energy 2.9022496e-8 Joule 
Energy per data bit 9.2546e-13 Joule/bit 
4.2. 3D-NoC system power estimation. 
To estimate the power of 3D-NoC system, we use Equation 3 with the scaling Equation 4 and 5 for 
different voltage and frequency if needed. Apparently, we need to obtain the number of the bit through 
the routing during its operation. Here, we perform both synthetic benchmarks (Matrix, HotSpot, 
Uniform, and Transpose) from [3] and we design a 3D-NoC version of garnet 2.0 in gem5 [25] then 
perform the PARSEC benchmarks suite [26].  PARSEC is one of the most well-known benchmarks for 
multi-core computing systems. Here, we use 64 core x64 processors as the processing elements of the 
PARSEC benchmarks. Here, we only extract the number of flits that went through the routers to estimate 
the power consumption. The power consumption of the processing elements can be obtained by using 
McPAT tools by HP inc. [27]; however, it is out-of-scope of this work. 
 
 
Figure 7: Power consumption of our 3D-NoC under PARSEC benchmarks. 
Figure 7 shows the power consumption of our 3D-NoC under PARSEC benchmark.  Here, we scale 
the frequency to 2GHz to fit with the configuration of gem5 using Equation 4 and 5. Among these 
P.T. Nam et al. / VNU Journal of Science: Comp. Science & Com. Eng., Vol. 31, No. 3 (2015) 1-9 13 
benchmarks, we observe cannel has the highest power consumption and also the highest variation 
(between the minimum and maximum power of router). 
Figure 8 shows the power consumption of the 3D-NoC system under synthetic benchmarks. We 
keep the frequency as 500MHz and inject the flit with a maximum inject rate. Note that we perform two 
Hotspot benchmarks where two nodes are the destination of 5% and 10% of total flits. We can easily 
observe the significant drop when increasing the number of flits to the hotspot nodes. This can be 
explained by the congestion created by more flits coming to these nodes which extend the execution 
time of the system.  On the other hand, the matrix benchmark has the lowest router power consumption. 
We also notice that the synthetic benchmarks have much higher power consumption than the PARSEC 
benchmarks since no computation is taken in this benchmark. As a consequence, the execution time is 
shorter which makes the power consumption higher than PARSEC. 
 
Figure 8: Power consumption of our 3D-NoC under synthetic benchmarks. 
4.2. 3D-NoC thermal estimation. 
By using the power estimation of the previous section, we conduction the thermal estimation using 
Hotspot 6.0 [10]. Table 3 shows the configurations for thermal estimation using Hotspot 6.0. We modify 
the thermal resistivity corresponding to our designed TSV (Copper with the size of 4.06𝜇𝑚 × 4.06𝜇𝑚) 
using the following equation [28]: 
𝑅𝑗𝑜𝑖𝑛𝑡 =
𝐴𝑟𝑒𝑎
𝐴𝑟𝑒𝑎 − 𝐴𝑟𝑒𝑎𝑇𝑆𝑉
𝑅𝑇𝐼𝑀
−
𝐴𝑟𝑒𝑎𝑇𝑆𝑉
𝑅𝐶𝑜𝑝𝑝𝑒𝑟
 (6)
 
where TIM is the thermal interface material. The result of the thermal resistivity of the layout in Figure 
2(c) can be found in Table 3. The final TSV area thermal resistivity is 0.0226mK/W. 
Table 3: Configurations for thermal estimation. 
Parameter Value 
Router floor-plan 290 𝜇𝑚 ×290 𝜇𝑚 
Floorplan Figure 2(c) 
One TSV area 4.06μm×4.06μm 
Router logic area 220 𝜇𝑚 ×220 𝜇𝑚 
Router logic utilization 80% 
TSV area/utilization 35,700 𝜇𝑚2 / 10.16% 
Copper thermal resistivity  0.0025mK/W 
TIM thermal resistivity 0.25mK/W 
TSV area thermal resistivity 0.0226mK/W 
  Technical Report 
 
 
Figure 9: Temperature of our 3D-NoC under PARSEC benchmarks. 
To compare with Monolithic 3D-IC, we also adopt the method in [35] where we remove the 
bonding layers between silicon layers. We keep the thickness of the silicon layer as it is for a fair 
comparison. Obviously, if we thin the layer, the transfer of heat is much faster. 
 Figure 9 shows the router temperature under PARSEC benchmark. Here, we also compare with 
the monolithic technology where no TSV needed [33]. As we can observe in Figure 9, TSV-based 
system has lower operating temperature thanks to the ability to transfer the heat of Copper TSVs. The 
difference in temperature is around 1K at the bottom layer and even reach 3.5K in the cannel benchmark. 
Figure 10 shows the operating temperature under synthetic benchmarks of our 3D-NoC. We can 
easily notice that the operating temperature of Monolithic systems is much higher than TSV ones since 
we stress the system under it saturation points. The highest temperature of Monolithic 3D-NoC even 
reaches 351.64 K (78.49°C). The hottest layer of the TSV-based system has a similar temperature as the 
coolest layer of Monolithic 3D-NoC. 
 
Figure 10: Temperature of our 3D-NoC under synthetic benchmarks. 
4.2. 3D-NoC reliability estimation. 
In this section, we use Black’s model to evaluate the MTTF of 3D-NoC. Figure 11 and Figure 
12 show the normalized MTTF of each layer to 323.15K (50°C) under PARSEC and synthetic 
benchmarks. Here, we can observe TSV-based 3D-NoC dominates Monolithic in the PARSEC 
benchmark. With synthetic benchmarks, TSV-based 3D-NoC is slightly better than Monolithic ones. 
 
P.T. Nam et al. / VNU Journal of Science: Comp. Science & Com. Eng., Vol. 31, No. 3 (2015) 1-9 13 
 
Figure 11: Normalized MTTF of our 3D-NoC under PARSEC benchmarks. 
4.4. Exploring different layout and thermal dissipation method 
In this section, we explore different layout and thermal dissipation for our 3D-NoC. First, we 
perform thermal and reliability prediction for our layout in Figure 2(b). Then, we insert four thermal 
TSVs with the size 15 𝜇𝑚 ×15 𝜇𝑚 in four corners of the router floorplan in Figure 2(c). This size of 
TSV is still feasible in the existing manufacture process[34]. We also add 10 𝜇𝑚 Keep-out-Zone 
distance this thermal TSV  to avoid mechanical stress. The thermal TSV went through all layer of TSV 
but does not contact with the heatsink. The heatsink and thermal TSV are separated by a layer of thermal 
interface material.  
 
Figure 12: Normalized MTTF of our 3D-NoC under synthetic benchmarks. 
Figure 13 and Figure 14 show the thermal behaviors under PARSEC and synthetic benchmark for 
different layouts and cooling. We can notice that with the layout in Figure 2(b) has the worst thermal 
behavior among the TSV designs. On the other hand, adding thermal TSV can help reduce the operating 
temperature significantly. By adding four TSVs, we can even reduce the temperature by nearly 1K at 
the bottom layer in the uniform benchmark which is the most stressed benchmark. Another benchmarks’ 
results also show a slight improvement in thermal behaviors.  
One thing we can easily notice the top layer’s temperatures do not change. This is due to the fact it 
is already cool down by the heatsink and adding TSV cannot help it reduce the temperature. Also, the 
heatsink temperature is raised near the top layer temperature which reduces the ability to transfer heat. 
If the thermal TSV can contact the heatsink, it can significantly cool down the bottom layer. Also, liquid 
cooling could be extremely helpful in this situation. 
  Technical Report 
 
In comparison to the traditional 2D-ICs, we observe that the TSV-based ICs have higher operating 
temperatures. The 2D-based 3D-NoCs operate under 319K and 322K with PARSEC and synthetic 
benchmarks, respectively. On the other hand, TSV-based system increases at most 10K in maximum 
temperature with the layout in Figure 2(b). 
In summary, different layouts can make different thermal behaviors. The layout Figure 2(b) does 
not surround the router by TSV area, therefore, the router could heat up each other and reach a higher 
temperature. On the other hand, adding thermal TSV to cool down the bottom layer is helpful since it 
can reduce nearly 1 Kelvin in the worst case. By mapping to the reliability, we can easily obtain a 2×~3× 
improvement of MTTF.  
 
Figure 13: Thermal behavior of different layout and cooling methods under PARSEC 
benchmark. 
 
Figure 14: Thermal behavior of different layout and cooling methods under synthetic 
benchmarks. 
4.5 Execution time 
In this work, we evaluate the proposed method using a system with Xeon E5-2620 8 cores 
2.1GHz, 16GB RAM and Linux Subsystem and PowerShell under Windows 10. The platform is 
written under C++, Python, and Bash. The execution time is measured using command time under 
Linux and Measure-Command under Windows PowerShell. Here, the simulation time of PARSEC 
and synthetic benchmarks are not considered because they are separated from our own flow.  As 
shown in Table 4, all steps in our flow perform under two seconds. Our method easily outperforms 
P.T. Nam et al. / VNU Journal of Science: Comp. Science & Com. Eng., Vol. 31, No. 3 (2015) 1-9 13 
in terms of execution time the fabrication-based methods which usually take hours regardless of 
designing, fabrication and assembly time [31][32][33]. 
Table 4: Execution time of the proposed flow. 
Work Step Time 
Ours Power extraction (one benchmark) 1.22 s 
Floorplan generate 0.095 s 
Temperature estimation (one 
benchmark) 
81 s 
Reliability estimation (12 
benchmarks) 
1.12 s 
[31] Reliability test 96h 
[32] The longest step in reliability test 1000h 
[33] Lifetime acceleration test 100-5000h 
4.6. Discussion 
In this section, we would like to discuss some technical details of our methods. Advantages and 
drawbacks are also mentioned in this part. 
In our evaluation, we point out that Monolithic has a higher temperature than TSV-based 3D-NoC 
due to two major reasons: (1) TSVs act like thermal conduct devices and (2) Monolithic 3D-ICs has a 
higher density than TSV-based system. However, we would like to note that Monolithic 3D-ICs has 
lower area cost than TSV-based system. 
Fluid cooling [5] is one of the most advanced methods to reduce the operating temperature of the 
system. Although we have not explored the ability of this method, it has shown promising efficiency for 
3D-ICs[5]. With a fast velocity of the fluid, we expect the system can be cooled down significantly. 
However, we would like to note that fluid cooling has unknown reliability which needs to be carefully 
investigated for being widely used. 
5. Conclusion 
In this work, we proposed a platform to quickly estimate the power, thermal behavior, and reliability 
of 3D-NoC systems. The method has shown extremely quick execution time. We also analyze and 
simulate the reliability of TSV and Monolithic 3D-ICs. Furthermore, we explore and compare different 
layout strategies and cooling methods. 
In the future, advanced cooling such as liquid could be investigated. The impact of DVFS and fault 
tolerance on performance and thermal behavior also could be studied.  
Acknowledgment 
This research is funded by the Vietnam National Foundation for Science and Technology 
Development (NAFOSTED) under grant number 102.01-2018.312. 
References 
[1] Khanh N. Dang, Akram Ben Ahmed, Xuan Tu Tran, Yuichi Okuyama, Abderazek Ben Abdallah, “A 
Comprehensive Reliability Assessment of Fault-Resilient Network-on-Chip Using Analytical Model,” IEEE 
Transactions on Very Large Scale Integration (VLSI) Systems, vol. 25, no. 11, pp. 3099-3112, 2017. 
https://doi.org/10.1109/TVLSI.2017.2736004 
  Technical Report 
 
[2] K. Banerjee K. Banerjee, S. J. Souri, P. Kapur and K. C. Saraswat, “3-D ICs: A novel chip design for improving 
deep-submicrometer interconnect performance and systems-on-chip integration,” Proc. IEEE, vol. 89, no. 5, pp. 
602–633, 2001. https://doi.org/10.1109/5.929647 
[3] Khanh N. Dang , Akram Ben Ahmed, Yuichi Okuyama, Abderazek Ben Abdallah, “Scalable design methodology 
and online algorithm for TSV-cluster defects recovery in highly reliable 3D-NoC systems,” IEEE Transactions 
on Emerging Topics in Computing, p. 1-14, (in-press). https://doi.org/10.1109/TETC.2017.2762407 
[4] Y. J. Park et al., “Thermal Analysis for 3D Multi-core Processors with Dynamic Frequency Scaling,” in 
IEEE/ACIS 9th Int. Conf. on Computer and Information Science, Aug 2010, pp. 69–74. 
[5] D. Cuesta et al., “Thermal-aware floorplanner for 3D IC, including TSVs, liquid microchannels and  thermal 
domains optimization,” Applied Soft Computing, vol. 34, pp. 164–177, 2015. 
https://doi.org/10.1016/j.asoc.2015.04.052 
[6] Park, Changyok. "Dummy TSV to improve process uniformity and heat dissipation." U.S. Patent No. 10,181,454. 
15 Jan. 2019. https://patents.google.com/patent/US20110215457A1/en (access 16 March 2020). 
[7] J. R. Black, “Mass transport of aluminum by momentum exchange with conducting electrons,” in 6th 
Annual Reliability Physics Symposium (IEEE). IEEE, 1967, pp. 148–159.  
[8] Synopsys Inc., “RTL Design and Synthesis,” https://www.synopsys.com/implementation-and-signoff/rtl-
synthesis-test.html. (accessed 16 March 2020). 
[9] Synopsys Inc., “Physical Implementation,” https://www.synopsys.com/implementation-and-signoff/physical-
implementation.html  (accessed 16 March 2020). 
[10] Zhang, Runjie, Mircea R. Stan, and Kevin Skadron. “Hotspot 6.0: Validation, acceleration and 
extension.” University of Virginia, Tech. Rep (2015).  
[11] Khanh N. Dang Akram Ben Ahmed, Abderazek Ben Abdallah, Xuan-Tu Tran, “TSV-OCT: A Scalable Online 
Multiple-TSV Defects Localization for Real-Time 3-D-IC systems” IEEE Transactions on Very Large Scale 
Integration Systems, vol. 28, no. 3, pp. 672 - 685, 2020. https://doi.org/10.1109/TVLSI.2019.2948878 
[12] United States of America: Department of Defense, Military Handbook: Reliability Prediction of Electronic 
Equipment: MIL-HDBK-217F, 1991. 
[13] J. B. Bowles, “A survey of reliability-prediction procedures for microelectronic devices,” IEEE Trans. Rel., vol. 
41, no. 1, pp. 2–12, 1992. https://doi.org/10.1109/24.126662 
[14] J. Srinivasan et al., “Lifetime reliability: Toward an architectural solution,” IEEE Micro, vol. 25, no. 3, pp. 70–
80, 2005. https://doi.org/10.1109/MM.2005.54 
[15] Erdmann, Christophe, et al. "A heterogeneous 3D-IC consisting of two 28 nm FPGA die and 32 reconfigurable 
high-performance data converters." IEEE Journal of Solid-State Circuits 50.1 (2014): 258-269. 
https://doi.org/10.1109/JSSC.2014.2357432 
[16] Kahng, Andrew B., et al. "ORION 2.0: A fast and accurate NoC power and area model for early-stage design 
space exploration." 2009 Design, Automation & Test in Europe Conference & Exhibition. IEEE, 2009. 
[17] Lee, Seung Eun, and Nader Bagherzadeh. "A high level power model for Network-on-Chip (NoC) router." 
Computers & Electrical Engineering 35.6 (2009): 837-845. https://doi.org/10.1016/j.compeleceng.2008.11.023 
[18] Lee, Seung Eun, and Nader Bagherzadeh. "A variable frequency link for a power-aware network-on-chip (NoC)." 
Integration 42.4 (2009): 479-485. https://doi.org/10.1016/j.vlsi.2009.01.002 
[19] Lebreton, Hugo, and Pascal Vivet. "Power modeling in SystemC at transaction level, application to a DVFS 
architecture." 2008 IEEE Computer Society Annual Symposium on VLSI. IEEE, 2008. 
[20] Zhang, Runjie, Mircea R. Stan, and Kevin Skadron. "Hotspot 6.0: Validation, acceleration and extension." 
University of Virginia, Tech. Rep (2015). 
[21] Sridhar, Arvind, et al. "3D-ICE: Fast compact transient thermal modeling for 3D ICs with inter-tier liquid 
cooling." 2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD). IEEE, 2010. 
[22] Scott Ladenheim, Yi-Chung Chen, Milan Mihajlović, Vasilis F. Pavlidis, "The MTA: An Advanced and Versatile 
Thermal Simulator for Integrated Systems." IEEE Transactions on Computer-Aided Design of Integrated Circuits 
and Systems 37.12 (2018): 3123-3136. https://doi.org/10.1109/TCAD.2018.2789729 
[23] NanGate Inc., “Nangate Open Cell Library 45 nm,” http://www.nangate.com/ , (accessed 16 June 2016). 
[24] NCSU Electronic Design Automation, “FreePDK3D45 3D-IC process design kit,” 
http://www.eda.ncsu.edu/wiki/FreePDK3D45:Contents, (accessed 16 June 2016). 
[25] Binkert, Nathan, et al. "The gem5 simulator." ACM SIGARCH computer architecture news 39.2 (2011): 1-7. 
P.T. Nam et al. / VNU Journal of Science: Comp. Science & Com. Eng., Vol. 31, No. 3 (2015) 1-9 13 
[26] Bienia, Christian, et al. "The PARSEC benchmark suite: Characterization and architectural implications." 
Proceedings of the 17th international conference on Parallel architectures and compilation techniques. 2008. 
[27] Li, Sheng, et al. "McPAT: an integrated power, area, and timing modeling framework for multicore and manycore 
architectures." Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture. 
2009. 
[28] J. Meng, K. Kawakami, and A. K. Coskun, “Optimizing energy efficiency of 3-d multicore systems with stacked 
dram under power and thermal constraints,” in DAC Design Automation Conference 2012. IEEE, 2012, pp. 648–
655. 
[29] Khanh N. Dang, Akram Ben Ahmed, Abderazek Ben Abdallah, Michael Corad Meyer, Xuan-Tu Tran, “2D Parity 
Product Code for TSV online fault correction and detection”, REV Journal on Electronics and Communications 
(in-press). http://dx.doi.org/10.21553/rev-jec.242 
[30] Wong, Simon, et al. "Monolithic 3D integrated circuits." 2007 International Symposium on VLSI Technology, 
Systems and Applications (VLSI-TSA). IEEE, 2007. 
[31] Hamada, Dorothy June M., and William J. Roesch. "Evaluating device reliability using wafer-level 
methodology." CS Mantech Conference. 2008. 
[32] Renesas’s Semiconductor Reliability Handbook 
https://www.renesas.com/us/en/doc/products/others/r51zz0001ej0250.pdf, 2017 (access 17 March 2020). 
[33] Toshiba’s Reliability Handbook https://toshiba.semicon-storage.com/content/dam/toshiba-
ss/shared/docs/design-support/reliability/reliability-handbook-tdsc-en.pdf 2018 (access 17 March 2020). 
[34] Van der Plas, Geert, et al. "Design issues and considerations for low-cost 3-D TSV IC technology." IEEE Journal 
of Solid-State Circuits 46.1 (2010): 293-307. 
[35] Samal, Sandeep Kumar, et al. "Fast and accurate thermal modeling and optimization for monolithic 3D ICs." 
2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC). IEEE, 2014. 
[36] Dang Nam Khanh “Development of On-Chip Communication Fault-Resilient Adaptive Architectures and 
Algorithms for 3D-IC Technologies”, Ph.D. thesis, The University of Aizu, 2017. 
http://doi.org/10.15016/00000140 
[37] Khanh N. Dang, Akram Ben Ahmed, Xuan-Tu Tran, Yuichi Okuyama, Abderazek Ben Abdallah, “A 
Comprehensive Reliability Assessment of Fault-Resilient Network-on-Chip Using Analytical Model”, IEEE 
Transactions on Very Large Scale Integration Systems (TVLSI). IEEE, Volume 25, Issue 11, pp 3099-3112, 
2017. https://doi.org/10.1109/TVLSI.2017.2736004 
[38] Khanh N. Dang, Michael Meyer, Yuichi Okuyama, Abderazek Ben Abdallah, “A Low-overhead Soft-Hard Fault 
Tolerant Architecture, Design and Management Scheme for Reliable High-performance Many-core 3D-NoC 
Systems”, Journal of Supercomputing (SUPE), Springer, Volume 73, Issue 6, pp 2705–2729, 2017. 
https://doi.org/10.1007/s11227-016-1951-0 
[39] Khanh N. Dang, Michael Meyer, Akram Ben Ahmed, Abderazek Ben Abdallah, and Xuan-Tu Tran, “2D-PPC: 
A single-correction multiple-detection method for Through-Silicon-Via Faults”, 2019 IEEE Asia Pacific 
Conference on Circuits and Systems (APCCAS), Nov. 11-14, 2019 
[40] Khanh N. Dang, Akram Ben Ahmed, and Xuan-Tu Tran, “ An on-communication multiple-TSV defects detection 
and localization for real-time 3D-ICs”, 2019 IEEE 13th International Symposium on Embedded 
Multicore/Many-core Systems-on-Chip (MCSoC), Oct. 1-4, 2019. 
[41] Khanh N. Dang, Akram Ben Ahmed, Ben Abdallah Abderrazak and Xuan-Tu Tran, “TSV-IaS: Analytic analysis 
and low-cost non-preemptive on-line detection and correction method for TSV defects”, The IEEE Symposium 
on VLSI (ISVLSI) 2019, Jul. 15-17, 2019. 
[42] Khanh N. Dang and Xuan-Tu Tran, “Parity-based ECC and Mechanism for Detecting and Correcting Soft Errors 
in On-Chip Communication”, 2018 IEEE 12th International Symposium on Embedded Multicore/Many-core 
Systems-on-Chip (MCSoC), Sep. 12-14, 2018. 
[43] Abderazek Ben Abdallah, Khanh N. Dang, Yuichi Okuyama, “A Low-overhead Fault tolerant Technique for 
TSV-based Interconnects in 3D-IC Systems”, The 18th International conference on Sciences and Techniques of 
Automatic control and computer engineering (STA), Dec. 21-23, 2017. 
[44] Khanh N. Dang, Michael Meyer, Yuichi Okuyama, Abderazek Ben Abdallah, “Reliability Assessment and 
Quantitative Evaluation of Soft-Error Resilient 3D NoC System”, 25th IEEE Asian Test Symposium (ATS), 
Nov. 21-24, 2016. 
  Technical Report 
 
[45] Khanh N. Dang, Yuichi Okuyama, Abderazek Ben Abdallah, “Soft-Error Resilient Network-on-Chip for Safety-
Critical Applications”, 2016 IEEE International Conference on Integrated Circuit Design and Technology 
(ICICDT), Jun. 27 – 29, 2016. 
[46] Khanh N. Dang, Michael Meyer, Yuichi Okuyama, Abderazek Ben Abdallah, Xuan-Tu Tran, “Soft-Error 
Resilient 3D Network-on-Chip Router”, IEEE 7th International Conference on Awareness Science and 
Technology (iCAST), Sep. 22-24, 2015. 
[47] Nam Khanh Dang, Thanh Vu Le Van, Xuan Tu Tran, “FPGA implementation of a low latency and high 
throughput network-on-chip router architecture”, 2011 International Conference on Integrated Circuits and 
Devices in Vietnam (ICDV), 2011. 
[48] Akram Ben Ahmed, “High-throughput Architecture and Routing Algorithms Towards the Design of Reliable 
Mesh-based Many-Core Network-on-Chip Systems”, Ph.D. thesis, University of Aizu, 2014. 
 
