ABSTRACT
I. INTRODUCTION
Over the years, the state-of-the-art technologies have continued to push the ULSI chips to higher clock speed and packing density. The speed requirement causes large power consumption and high packing density results in large power density (power per unit area). One direct impact of the increasing power density is the dramatic on-chip temperature rise. For a chip under normal operating condition, the temperature rise can be as much as a few tens of degrees above the ambience. Without good thermal engineering, significantly non-uniform temperature distribution can lead to considerable on-chip temperature gradient. The temperature rise and temperature gradient has strong effects on both chip performance and reliability. Although many research efforts have been focusing on the development of low power and new package design for better IC reliability, thermal problems continue to grow and demand more attention.
Some research work has been done earlier to address the electrothermal problems in the device level and the small-scale integrated (SSI) level by developing the computer-aided design (CAD) tools [1] [2] . The attempt at providing the electrothermal simulation capability at the VLSI level was introduced in ILLIADS-T [3] . Most of the existing electrothermal simulators focus on accurately calculating the temperatures at the expense of simulation time. In this paper, we present a new fast thermal analysis method, which has been incorporated into the thermal simulation framework in ILLIADS-T. It focuses on the on-chip hot-spot identification in an extremely efficient way. The hot spot information can be used to pinpoint the locations with high temperatures for reliability concerns. It can also be used to facilitate the adaptive mesh generation process for the numerical 3D thermal simulator in ILLIADS-T.
II. THERMAL ANALYSIS FRAMEWORK

A. Background
The general heat diffusion equation for temperature calculation is [4] ρc p ∂T x¡ y¡ z¡ t
In (1) and (2), T is temperature, g is power density of the heat source(s), k is thermal conductivity, ρ is density of material, c p is specific heat, h is heat transfer coefficient between the object under simulation and the ambience, f i x y z is an arbitrary function, and n i is the outward direction normal to the surface i. The details of the boundary condition specification can be found in [3] .
B. Fast thermal analysis (FTA)
For a VLSI chip containing a large number of heat sources, the exact numerical method can be very expensive. In the early chip design phase, especially when no specific package information is given or the thermal boundary conditions are not fully characterized, a fast thermal analysis method emphasizing on the hot-spot identification is, therefore, much needed. We have developed a new fast thermal analysis (FTA) tool to meet such needs.
The FTA approach is based on the fact that, the dimensions of the gate-level or subcircuit-level heat sources in a VLSI chip are small compared with the chip size. Thus all heat sources can be viewed as located in a relatively infinite body. Consider a point source in a chip as shown in Fig. 1(a) . Since ICs have a passivation layer, the top of the chip is insulated. We therefore have a boundary value problem with infinite dimension in the x-y plane while with semi-infinite dimension in the z direction. Moreover, the boundary condition (BC) at z 0 is ∂T r t ∂z 0. To find the temperature subject to this specific geometry and BCs, we use the method of images analogous to the electromagnetics problems [5] . We add an identical heat source symmetric 0-7803-5832-X /99/$10.00 ©1999 IEEE. with respect to z 0 and remove the insulating boundary. Now the problem in Fig. 1(a) is transformed to that in Fig. 1(b) .
The Green's function solution G r t r τ to the heat diffusion equation for the point source in Fig. 1 can be derived as G
where α is the thermal diffusivity. We formulate the resulting temperature rise above the ambience at observation point r due to parallelepiped heat source with dimensions a
τ¢ dv% dτ¡ (4) where the coordinate origin has been chosen to be at the center of the source and P 0 is the source power. To proceed, we integrate (4) by using error functions:
where the observation point is set to be on the chip surface (z 0), and 
In order to perform the integration in (7) analytically, we piecewisely linearize the error functions by [6] erf x¢
By defining t a1 (7), we need to reduce the number of possible permutations (120) of t a1 , t a2 , t b1 , t b2 and t c . To achieve this, we assert the following six constraints:
Constraints (2) and (3) are straightforward algebraically. Constraints (4) and (5) are valid because the thickness of heat sources is in the order of 0f 1µm which is much smaller than the physical dimensions (i.e., a ( b) of a logic gate on chip. Constraint (1) is equivalent to transforming all the observation points to the first quadrant by using the symmetric property as graphically shown in Fig. 2 . To satisfy Constraint (6), we use the coordinate transformation as shown in Fig. 3 . With the above specified constraints, (7) now becomes tractable. In other words, the precedence of t a1 , t a2 , t b1 , t b2 , and t c must belong to one of the eight cases shown in Fig. 4 . We have derived the analytical solutions for all cases. Proper solutions will be used during simulation, depending on the geometry and the size of the heat source, as well as the relative locations between the heat source and the observation point.
For a VLSI chip with n heat sources, the temperature rise at the center of source i is obtained by considering the heat diffusion from source i, plus that from other n @ 1 sources using superposition: (7) with one of the eight cases in Fig. 4 . Take Fig. 5 as an example, where one source with power P I is located at (0,0) while the other one with power P II is at (3, 2) . To find the temperature rise at the center of source II due to source I (i.e., 
Finally, the steady-state temperature rise at the center of source II can be obtained, according to (9), as
Mathematical formulation of the FTA method is based on the closed-form Green's function with the assumption of semiinfinite boundary condition. In practice, this assumption is not totally valid due to the existence of package and heat sink in a chip. Therefore, the temperature rise in (9) best represents the 500 um 500 um 500 um Heat Source Figure 6 : Chip structure and heat source locations.
relative value instead of the absolute temperature rise. However, the FTA method provides a quick estimate of the temperature distribution. This is particularly useful when the number of heat sources is large or when a large number of repeated thermal simulations are needed. It is also useful when the chip package specification is not known in the early design phase.
III. EXPERIMENTAL RESULTS
Among different thermal simulation methods, the FTA method is primarily used for fast hot-spot identification. In order to observe how accurately the FTA method can identify the hot spots, we performed the following experiment. Consider a chip containing 10 heat sources, all with dimensions of 50µm ( 50µm. The sources are confined within the area (source area) with dimensions of 500µm ( 500µm, and the distance between the boundary of the source area and the chip's bonding pad is 500µm as shown in Fig. 6 . The heat sources were randomly placed and the power values from 10 mW to 100 mW were randomly assigned to the sources, and this process was repeated for 50 times (i.e., 50 tests). We claim that a violation happens in a test if the hot spot identified by the FTA method is different from the one identified by our numerical method developed earlier. The number of violation and the violation rate among 50 tests using the FTA method are shown in the second and third rows of Table 2 for different h (heat transfer coefficient in (2)) values.
s T hot spot represents the average difference of the actual temperatures of the two distinct hot spots identified by the FTA and numerical methods for all violated tests. The data shown in Table 2 reveal that a violation occurs only when the two hot spots have very close temperature values, which is insignificant for reliability concerns.
To demonstrate the advantage of FTA when the problem size increases, both FTA and numerical method are used to identify the on-chip hot spots among all heat sources. The speedup factor of FTA over the numerical method with increasing number of heat sources is plotted in Fig. 7 . In the numerical simulation, one grid line is assigned for each heat source in both x and y directions. This is to assign a minimum number of grids required to calculate the temperature of heat sources.
Another experiment is performed on a prototype chip with the unit-level layout shown in Fig. 8 . Each unit contains several functional unit blocks (FUBs), and the power values were given for all FUBs. There are totally 510 FUBs in this chip. The symbols H and C in Fig. 8 represent the most significant hot and cold spots identified by FTA, while H and C represent the less significant hot and cold spots. The hot spots are caused primarily by placing two or more high power-density FUBs in close proximity. The result agrees very well with the detailed numerical simulation with package modeling [7] , which is computationally expensive. By using FTA in the early chip design phase, it is possible to interchange the locations of hot and cold FUBs in order to ensure more uniform on-chip temperature profile and thus better reliability.
As a final remark, the efficiency of the iterative mesh generation technique employed by our numerical method depends on the initial guess of the number and locations of the mesh grids [3] . Chip area with larger temperature gradient often requires more grids for better accuracy. Currently the FTA routine has been incorporated into our numerical thermal simulator as a preprocessor to estimate the temperature gradient, which helps to determine a better initial guess of the grids and therefore facilitate a more accurate and efficient thermal simulation. 
IV. CONCLUSION
In this paper, we presented a novel thermal analysis method -FTA. This method was developed to quickly find the on-chip hot spots for potential reliability concerns. FTA is based on the Green's function and error function approximation. With proper mathematical manipulation and constraint assertion, concise analytical formulae were derived for fast hot-spot identification. The FTA method is able to accurately locate the hot spots with comparatively little time complexity.
