As thermal problems become more evident, new physical design paradigms and tools are needed to alleviate them. Incorporating thermal vias into integrated circuits (ICs) is a promising way of mitigating thermal issues by lowering the thermal resistance of the chip itself. However, thermal vias take up valuable routing space, and therefore, algorithms are needed to minimize their usage while placing them in areas where they would make the greatest impact. With the developing technology of three-dimensional integrated circuits (3D ICs), thermal problems are expected to be more prominent, and thermal vias can have a larger impact on them than in traditional 2D ICs. In this paper, thermal vias are assigned to specific areas of a 3D IC and used to adjust their effective thermal conductivities. The thermal via placement method makes iterative adjustments to these thermal conductivities in order to achieve a desired maximum temperature objective. Finite element analysis (FEA) is used in formulating the method and in calculating temperatures quickly during each iteration. As a result, the method efficiently achieves its thermal objective while minimizing the thermal via utilization.
INTRODUCTION
As the technology node progresses, chip areas and wire lengths continue to increase, causing such problems as increased interconnect delays, power consumption, and temperature, all of which can have serious implications on reliability, performance, and design effort. Three dimensional technology attempts to overcome some of these limitations by stacking multiple active layers into a monolithic structure, using special processing technologies such as silicon-on-insulator (SOI) or wafer bonding [1] . By expanding vertically rather than spreading out over a larger area, the chip space is better utilized, interconnects are decreased, and transistor packing densities are increased, leading to better performance and power efficiency [2] . Despite the advantages that 3D ICs have over 2D ICs, thermal effects are expected to be more pronounced because of higher power densities and greater thermal resistance along heat dissipation paths. With the advent of better processing technologies for 3D ICs, design tools are needed to realize their full potential and overcome thermal and efficiency issues. Current design tools used for 2D ICs can not be easily extended to 3D ICs [2] especially when taking into account thermal effects.
The idea of using thermal vias to alleviate thermal problems was first utilized in the design of packaging and printed circuit boards (PCBs). Lee et al. studied arrangements of thermal vias in the packaging of multichip modules (MCMs) and found that as the size of thermal via islands increased, more heat removal was achieved but less space was available for routing [3] . Li studied the relationships between design parameters and the thermal resistance of thermal via clusters in PCBs and packaging [4] . These relationships were determined by simplifying the via cluster into parallel networks using the observation that heat transfer is much more efficient vertically through the thickness than laterally from heat spreading. Pinjala et al. performed further thermal characterizations of thermal vias in packaging [5] .
Chiang et al. first suggested that "dummy thermal vias" can be added to the chip substrate as additional electrically isolated vias to reduce effective thermal resistances and potential thermal problems [6] . A number of papers have addressed the potential of integrating thermal vias directly inside chips to reduce thermal problems internally [6, 7, 8, 9] . Because of the many dielectric layers, thermal problems are greater and thermal vias can have a larger impact in 3D ICs than 2D ICs. In addition, interconnect structures can create efficient thermal conduits and greatly reduce chip temperatures.
It has become of particular interest to design efficient heat conduction paths right into a chip to eliminate localized hot spots directly. Despite all the work that has been done in evaluating This work was supported in part by an SRC fellowship and by DARPA under grant N66001-04-1-8909.
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. thermal vias, thermal via placement algorithms are lacking even with 2D ICs. As circuits and temperature profiles increase in complexity, efficient algorithms are needed to accurately determine the location and number of internal thermal vias. The thermal via placement method presented in this paper uses designated thermal via regions to place thermal vias and efficiently adjusts the density of thermal vias in each of these regions with minimal perturbations on routing. In the design process, thermal via placement can be applied after placement and before routing.
THERMAL VIA REGIONS Figure 1. Uniform density of thermal vias within a region.
In order to make the placement of thermal vias more manageable, certain areas of the chip are reserved for placing thermal vias. In thermal via regions as shown in Figure 1 , a uniform density of thermal vias is used, and the thermal via placement algorithm determines the density in each of these regions. In a 3D IC as shown in Figure 2 , these regions are evenly placed between the rows in a standard cell design. The thermal via regions are composed of electrically isolated vias and are oriented vertically between the rows. The density of the thermal vias determines the thermal conductivity of the region which in turn determines the thermal properties of the entire chip. They are generally obstacles to routing except for regions which require only a low density of thermal vias. Placing thermal vias in specific regions allows for predictable obstacles to routing. The density of these routing obstacles is limited in any particular area so that the design does not become unroutable. It also allows for regularity and uniformity in the entire design process.
The value of the thermal conductivity, K, in any particular direction corresponds to the density of thermal vias that are arranged in that direction. Increasing the number of thermal vias in one direction does increase the thermal conductivity in the other directions but at an order of magnitude less. For simplicity, the interdependence can be considered to be negligible, and the K's in the x, y, and z directions can be considered to be independent to a certain extent.
Current integration technologies for producing 3D ICs result in the layers being closely stacked together and the design space being tightly compressed in the z direction. In addition, the location of the heat sink in relation to the heat sources produces a heat flux that is primarily downward in direction with very minor lateral components. Furthermore, with the thermal via regions being oriented vertically, lateral thermal vias would have little effect. As a result, lateral thermal conductivities (in the x and y directions) are generally unchanged by this method because the thermal gradients in the vertical (z) direction are almost two orders of magnitude larger than in the lateral directions. In this paper, the method will be developed for all three directions, but for the reasons outlined above, only vertically oriented thermal vias will be considered in the implementation and results.
TEMPERATURE CALCULATION
At steady state, heat conduction within the chip substrate can be described by the following differential equation:
where T is the temperature, K x , K y , and K z are the thermal conductivities, and Q is the heat generated per unit volume. A unique solution exists when convective, isothermal, and/or insulating boundary conditions are appropriately applied. The nature of the packaging and heat sink determines the boundary conditions. The FEA method from [10] was used for temperature calculations in these experiments. An overview of this method, as applied to 3D ICs, is presented in the remainder of this section.
FEA Background
In finite element analysis, the design space is first discretized or meshed into elements. An example of an 8-node hexahedral element is shown in Figure 3 . The temperatures are calculated at discrete points (the nodes of the element), and the temperatures elsewhere within the element are interpolated using a weighted average of the temperatures at the nodes. For an 8-node hexahedral (rectangular prism) element, the following interpolation function is used: From the shape functions, the thermal gradient, {g}, can be found as follows:
where
Similar to circuit simulation using the modified nodal formulation, stamps are made for each element and added to the global system of equations. In FEA, these stamps are called element stiffness matrices, [k c ], and can be derived as follows using the variational method for an arbitrary element type [11] : 
Application of FEA
For a right prism with a width of w, a height of h, and a depth of d as shown in Figure 3 , the element stiffness matrix is given in Equation (6) as an 8x8 symmetrical matrix with rows and columns corresponding to the nodes 1 through 8 [10] .
[ ] 
For the entire mesh, the elements are aligned in a grid pattern with nodes being shared among at most 8 different elements. The element stiffness matrices are combined into a global stiffness matrix, [K global ], by adding the components of the element matrices corresponding to the same node together. The global heat vector, {P}, contains power dissipated or heat generation as represented at the nodes. This is produced by distributing the heat generated by the standard cells among its closest nodes. A linear system of equations is produced, [K global ]{T} = {P} with {T} being a vector of all the nodal temperatures.
Isothermic Boundary Conditions
Isothermic boundary conditions are applied to the global matrix using the following procedure [11] , and this results in a reduced, nonsingular system of equations. 
A 11 is a nonsingular matrix, T 1 contain the unknowns, and the right-hand side is vector of constants so this linear system of equations can now be solved.
Meshing the 3D IC
In this method, 3D ICs are meshed into regions (elements) as shown in Figure 2 . Vertically (in the z direction), the chip is separated into bulk substrate, layer, and inter-layer elements. The bulk substrate is located at the bottom attached to the heat sink. Above the bulk substrate elements are the layer and inter-layer elements. In a 3D IC, the inter-layers are composed of inter-layer vias and bonding materials that connect the layers together, and the layers contain the device and metal levels. Transistors, the primary sources of heat, are located at the bottom of each layer in the row regions. In standard cell designs, cells are placed into rows with inter-row spaces between them. These inter-row regions are necessary to accommodate interconnects between different layers. Some of the inter-row regions can serve as thermal via regions. They can be represented with special elements in the FEA mesh with variable thermal conductivities.
ITERATIVE THERMAL VIA METHOD
For a given placed 3D circuit, an iterative method was developed in which, during each iteration, the thermal conductivities of certain FEA elements are modified so that thermal problems are reduced or eliminated. Thermal vias are generically added to elements to achieve the desired thermal conductivities. The goal of this method is to satisfy given thermal requirements using as few thermal vias as possible, i.e., keeping the thermal conductivities as low as possible.
Updating the Thermal Conductivities
During each iteration, the thermal conductivities of thermal via regions are modified. These thermal conductivities reflect the density of thermal vias needed to be utilized within the element. The new thermal conductivities are derived from the element FEA equations. Using Equation (6) 
If we multiply Equation (10) by 
where g ideal is a nonnegative value and α is a user defined parameter between 0 and 1. If the magnitude of the old thermal gradient is below g ideal , the value is increased toward g ideal for the new thermal gradient. If the magnitude of the old thermal gradient is above g ideal , the value is decreased toward g ideal . If the magnitude of the old thermal gradient equals g ideal , then the value isn't changed. This causes the K's to be decreased when the thermal gradient is below g ideal and increased when the thermal gradient is above g ideal . This process works to eliminate major sources of thermal impedance in areas of greatest heat transfer, but when the element is not on a critical heat sinking path, the K's are decreased to eliminate unnecessary thermal vias.
Combining Equations (20)-(25) yields the formulas used for updating the thermal conductivities during each iteration:
The ideal thermal gradients, g ideal , must be chosen and adjusted specifically to satisfy some desired thermal objective. Initially it is set to the magnitude of the average thermal gradient, and in order to achieve a desired maximum temperature, it is updated during each iteration according to the following equation: T is the desired maximum temperature and T max is the maximum temperature from the last iteration. If the maximum temperature was too low in the previous iteration, g ideal is increased. Likewise, if the previous maximum temperature exceeds the desired maximum temperature, g ideal is decreased to lower the thermal effects.
Thermal Via Density
As stated earlier, the thermal conductivity of a thermal via region is determined by its density of thermal vias. After thermal via placement determines the thermal conductivities, the thermal via densities can be derived. Individual thermal vias are assumed to be much smaller than the thermal via region, are arranged uniformly within the thermal via region, and change the effective thermal conductivity of the thermal via region. The percentage of thermal vias, m, (also called thermal via density) in a thermal via region is given by the following equation: 
In this implementation, only the vertical thermal conductivities will be optimized using Equation (28). Equations (26) and (27) will not be used for updating the lateral thermal conductivities because the thermal vias are only oriented vertically in these experiments. During each iteration, the new vertical thermal conductivity is used to calculate the thermal via density, m, and the lateral thermal conductivities for each thermal via region. The effective lateral thermal conductivity can be found using the percentage of thermal vias: Figure 4 and Table 1 show the relationship between the thermal vias density and the thermal conductivities in the vertical and lateral directions for thermal via regions having vertically oriented thermal vias. In Figure 4 , Equations (31) and (33) were used to plot the relationship between the percentage of thermal vias in a thermal via region and its effective thermal conductivities. As you can see, the vertically oriented vias (as shown in Figure 1 ) produce a much greater effect in the vertical thermal conductivity. In Table 1 , the minimum, midrange, and maximum thermal conductivity values are given. In the midrange case, the average of the minimum and the maximum thermal via density values was used.
The Relationship between Thermal Via Density and Thermal Conductivity 
IMPLEMENTATION
In the thermal via placement method shown in Figure 5 , the thermal gradients are used to update the thermal conductivities of the thermal via regions. The thermal gradients are compared to an ideal thermal gradient value which is modified during each iteration so that the maximum temperature converges to a given ideal maximum temperature, ideal max T . Figure 5 . Thermal via optimization to a max. temperature. The thermal via placement algorithm is initialized by setting all K's to their minimum values and by calculating the temperature profile. The temperature profile is calculated using FEA, described earlier in Section 3. This gives the temperatures of the nodes and thermal gradients of the elements. In each iteration of the main loop, the thermal conductivities of the thermal via regions are modified, and the temperature profile of the chip is recalculated. For each thermal via region, the vertical thermal gradient, g, at the center of the element is calculated. Using the magnitude of the thermal gradient, a new vertical thermal conductivity is calculated using Equation (28) with α set to 0.5.
MAX TEMP OPTIMIZATION(
If the new thermal conductivity exceeds the minimum or maximum value for that element, it is modified to the value of the bound that it exceeds. Using the new vertical thermal conductivity, the thermal via density and lateral thermal conductivities are also updated using Equations (32) and (33).
The K's are adjusted so that the maximum temperature approaches a specified value by modifying the ideal thermal gradient during each iteration based on the current maximum temperature. The algorithm terminates after the 1-norm of the change in K's is less than some small ε > 0. The value of ideal max T should be less than the maximum temperature when all the K's are minimized and greater than the maximum temperature when all the K's are maximized.
In these experiments, the ideal max T values were obtained by using the maximum temperatures from the case where all the thermal via regions were given the midrange values as shown in Table 1 . This gives a maximum temperature value that is greatly reduced from the case where no thermal vias were used and provides a comparison to the case where all the thermal via regions are given the same via density. In practice, it would be useful to use the maximum allowable operating temperature for ideal max T .
RESULTS
The program was written in C++ and run on an Intel Pentium 4 2.8 GHz machine with Linux. The conjugate gradient solver with ILU factorization preconditioning from the LASPack package [12] was used in our program to solve the FEA systems of equations. The thermal via placement method was tested using benchmark circuits from the MCNC suite [13] and the IBM-PLACE benchmarks [14] .
The bulk substrate thickness was set to 500µm, the layer thicknesses were set to 5.7µm, and the interlayer thicknesses were set to 0.7µm. Four layers were used, and the chip size was fixed at 2cm x 2cm with the cell sizes adjusted accordingly. Thermal via regions were even distributed between the rows and given 10% of the total chip area. The thermal conductivity of the silicon in the bulk substrate was set to 150W/mC, and the thermal vias were assumed to be copper with a thermal conductivity of 398W/mC. The thermal conductivities used in the layer and interlayer elements are shown in Table 1 . Thermal via regions had variable thermal conductivities ranging from the minimum to maximum values given in Table 1 . All other elements used the thermal conductivities corresponding to the lateral midrange values.
A random power distribution was used with 90% of the cells having power densities ranging from 0 to 2 x 10 6 W/m 2 and 10% of the cells having power densities ranging from 2 x 10 6 to 4 x 10 6 W/m 2 . The bottom of the chip was made isothermic with the ambient temperature to represent the heat sink, and the top and sides of the chip were made insulated in order to simulate the low heat sinking properties of the packaging.
The ambient temperature was set to 0 o C for convenience, but the temperatures can be translated by the amount of any other ambient as desired.
In the first set of experiments, thermal via densities were increased from their minimum to maximum values, and the change in the temperatures and thermal gradients was observed as shown in Table 2 . In this table, T ave is the average temperature, T max is the maximum temperature, and g ave is the average thermal gradient. The thermal via regions were given the same minimum, midrange, and maximum thermal conductivity values from Table  1 . The minimum values correspond to the case where no thermal vias are used. The maximum values correspond to maximum thermal via usage. In the midrange case, thermal via regions were given thermal via densities that were the average of the minimum and maximum values. In Table 2 , we can seen that as the thermal via densities of the thermal via regions were increased, the temperatures and thermal gradients decreased. The temperatures and thermal gradients in the minimum and maximum cases define the bounds on the thermal values that can be obtained from adjusting the thermal via densities. At the midrange with an average thermal via density of 23.9% in the thermal via regions, the maximum temperatures were 47.1% lower and the average temperatures were 28.3% lower than in the case where no thermal vias were used.
In Table 3 , thermal via placements were obtained using the maximum temperatures from the midrange case (in Table 2 ) as the objective. In this table, K ave is the average vertical thermal conductivity of the thermal via regions, and m is the average density of thermal vias in the thermal via regions. The thermal via placement method was very accurate in achieved its maximum temperature objective.
Maximum temperatures, T max , were obtained within an average of 0.05% of their desired ideal maximum temperatures. On average, thermal via placement used only a thermal via density of 11.9% in the thermal via regions in order to obtain these maximum temperatures. This means that 50.3% fewer thermal vias were needed with thermal via placement than in the midrange case to obtain the same maximum temperatures. These maximum temperatures were 47.1% lower than in the case where no thermal vias were used. In these experiments, thermal via regions were assigned to only 10% of the chip area, and thermal vias required only 11.9% of this area. In all, thermal vias occupy only 1.19% of the total chip area, and it is expected that routability would be minimally affected from this small amount of blockages. 
Benchmark Circuits
Temperature profiles before and after thermal via placement are shown in Figures 6 and 7 for the struct benchmark. In Figure 6 , the heat sink is located at the bottom of the chip, and temperature contours are superimposed on the standard cells. Red indicates areas of high temperature, and blue represents areas of low temperature. As you can see, the temperatures increase as you go away from the heat sink toward the upper layers and toward the middle of the chip. After thermal via placement as shown in Figure 7 , the temperatures are greatly reduced.
In Figure 7 , the thermal via regions are superimposed on the resulting temperature profile after thermal via placement is performed. Thermal via regions are represented as small squares arranged in a grid pattern. The percentage of thermal vias in the thermal via regions is represented by its color. A red square represents a thermal via region with maximum thermal vias utilized. A blue square represents a thermal via region with minimum thermal vias utilized. The colored contours around the thermal via regions represent the temperatures. The heat sink is located at the bottom, and the thermal via regions were of greatest strength at the bottom of the chip where the thermal gradients are the highest and the most impact can be made in reducing thermal problems. However, at the top of the chip where the temperatures are the highest, the thermal vias are minimally used. In these areas, the thermal gradients are quite low and little impact can be made there.
CONCLUSION
An efficient thermal via placement method was presented that attempts to overcome the thermal issues produced in the design of 3D ICs. The resulting placements have lower temperatures and thermal gradients with the use of thermal vias kept to a minimum. In these experiments, thermal via placement uses 50.3% fewer thermal vias to obtain the same 47.1% reduction in the maximum temperatures that were obtained using the simpler method where all regions were blindly given the same midrange via densities. The iterative method modifies the thermal conductivities of thermal via regions in order to satisfy a maximum temperature requirement.
Each thermal conductivity corresponds to a particular percentage of thermal vias in the thermal via region. There is a tradeoff between reducing thermal effects and the percentage of thermal vias used as seen in Table 2 . There is also a tradeoff between routing space and thermal via space which results in a tradeoff between thermal problems and routability. An important observation is that placing thermal vias in areas of high temperature such as on the upper most layer has little impact in reducing thermal problems. This algorithm places thermal vias where they will have the most impact using the thermal gradient as a guide. High temperatures can only be reduced by alleviating the high thermal gradients leading up to them. The thermal resistance of these heat conduction paths is reducing by lowering the thermal conductivities of elements along it.
