This work presents a unique time-efficient and reliable floorplan algorithm DOTFloor (Diffusion Oriented Time-improved Floorplanner), built around a SA (Simulated Annealing) engine and targeted to optimize the peak on-chip temperature along with the traditional design metrics like chip area and wire length. This paper also proposes a novel heat-diffusion based stochastic thermal model called the FATT (Fast Assumption Technique for Temperature) which provides a fast assumption of the degree of hotness during the optimization process. The incorporation of FATT in DOTFloor results in a significant improvement in the run time of the optimization process. Upon experimentation on MCNC (Microelectronics Center of North Carolina) benchmark circuits with the proposed floorplanner, a good optimization in area, wire length metric and peak on-chip temperature with a significant reduction in execution time have been achieved over the existing floorplanning tool, the HotFloorplan.
I. INTRODUCTION
In the realm of thermal management in processors, floorplanning plays a key role at the physical design level of VLSI technology. Thermal-aware floorplanning facilitates the mitigation of on-chip hotspots and temperature gradients along with the optimization of other design metrics like area, wire length, routability, delay, etc. Besides, the reliability and time efficiency of such algorithms are very critical in the arena of CAD tool development. The floorplanner for its operation requires Models of Computations (MOCs) for the estimation of its metrics like temperature, area, interconnect delay, etc. Improvement in reliability and time efficiency of the floorplanners can be achieved at the algorithmic stage and/or at the level of MOCs. For example at the MOCs level, a major chunk of time is generally consumed for the temperature estimation and accurate techniques like finite element method (FEM) [1] , finite difference method (FDM) [2] , Green's function method [3] and compact thermal model [4] , [6] require considerable time for the computation of temperature. Several efforts have been made to improve the performance of floorplanners and some of them have been discussed below.
A Genetic Algorithm (GA) based thermal-aware floorplanner has been presented in [7] , which optimizes the chip The associate editor coordinating the review of this manuscript and approving it for publication was Fuhui Zhou . area and on-chip temperature of slicing floorplan structure. A thermal-aware SA based floorplanning tool called Hot-Floorplan has been introduced by [8] , which optimizes the peak on-chip temperature and performance indicated by the wire length delay for slicing floorplan. Authors in [9] present a hybrid PSO-GA algorithm for minimizing chip area, wire length, and temperature of non-slicing floorplan arrangement. Authors in [10] present a thermal-aware floorplanner for slicing floorplan by modeling the temperature-dependent wire delay, routing congestion and reliability factors and including the same in addition to the chip area and temperature metrics in the cost function of the HotFloorplan tool. The work shown in [11] presents an Application Specific NoC synthesis approach involving floorplan which targets to optimize the chip area, peak on-chip temperature and communication cost. The algorithm has been developed in Mixed Integer Linear Programming and Simulated Annealing. A Simulated Annealing based pre-RTL tool framework for floor-planning has been developed in [12] using the python language to optimize chip area and the peak temperature of SoC and chip multiprocessors. All of the floorplanners presented in [7] - [12] invoke the HotSpot tool [4] , [5] for the temperature estimation. HotSpot tool is accurate but it takes a considerable time for solving the temperature. Moreover, a thermal-aware placement algorithm has been presented in [13] to optimize the peak on-chip temperature, temperature gradient, and the wire length. Here temperature has been estimated from a thermo-resistive network of the chip and by using the modified nodal analysis on it, which again incurs a considerable computation overhead. Authors in [14] also present a fast but accurate temperature estimation technique by adopting an iterative conjugate gradient method as an alternative to the LU decomposition for solving the HotSpot tool thermal model. The technique presented in [14] still consumes considerable time when included for the complete floorplan optimization process. Authors in [15] have presented a floorplan strategy built around Satisfiability Modulo Theory to handle hard as well as flexible functional blocks. In this work, optimization has been done for the chip area alone. The work in [16] shows a Genetic Algorithm (GA) based floorplanner for optimizing the chip area and the wire length of free as well as fixed outline floorplans. Here the performance of the GA has been improved by including an entropy function that takes care of enhancing the diversity of solutions for the sake of obtaining the global optimum instead of settling at a local minimum. Paper [17] presents a novel floorplan algorithm based on Particle Swarm Optimization. The proposed floorplanner targets to achieve optimization in the chip area and the wire length. Work presented in [18] reveals a fast fixed outline multilevel floorplanner based on a thermal-aware non-linear model to simultaneously optimize the chip temperature and wire length. The floorplanner uses a power blurring analytical method to estimate the temperature and gives improved floorplan as well as runtime over the Corblivar floorplanning tool. The time overhead of exact temperature computation has been avoided in [19] and authors have proposed a thermal-aware placement algorithm for standard cells based on game theory to optimize the chip temperature. To achieve an improvement in time, authors in [20] reduced the complexity involved in temperature computation by developing a stochastic heat-diffusion model for the temperature estimation. Further a Simulated Annealing based temperature-aware floorplaner has been constructed in [20] by incorporating chip-temperature to the objective function of the Parquet floorplanning tool presented in [21] . Here optimization has been obtained for chip area, wire length, and on-chip temperature. The author in [22] presents two fast thermal-aware algorithms, one greedy and the other SA-Greedy hybrid algorithm for optimizing area and temperature of nonslicing floorplan. Like [20] , the complexity of temperature computation was also avoided in [22] . Instead of any temperature quantification in the objective function, hotspot minimization has been mechanized in [22] by forming groups containing one hot and three non-hot modules in each and further placing them in floorplan, such that a hot module remains surrounded by the three non-hot modules of its group. Authors in [23] have considered the area and peak power density (of a thermal zone) as metrics of optimization in floorplanning, but peak power density alone cannot account for the degree of hotness in a chip. The techniques presented in [20] , [22] , [23] are fast but do not consider the influence of the adiabatic die-boundary on the thermal characterization of a chip.
Our work exhibits a thermal-aware floorplanner and a fast novel heat diffusion based stochastic thermal model which avoids the complexity overhead of accurate temperature computation (incurred by some of the previous works) during the optimization process and also includes the adiabatic dieboundary influence (ignored by earlier works). The proposed thermal-aware floorplanner achieves a significant runtime improvement while maintaining equivalent qualitative outputs compared to the floorplanning tool, the HotFloorplan.
Our paper renders the following salient contributions -(i) It presents a unique thermal-aware floorplanner called the DOTFloor (Diffusion Oriented Time-improved Floorplanner), developed around the classical Simulated Annealing (SA) algorithm [24] - [26] to handle slicing floorplan structures composed of fixed as well as rotatable and hard functional-blocks. Utilizing a lateral heat spread management by the redistribution of functional block locations on the chip floor; the floorplanner gives successful optimization of the peak on-chip temperature as well as the chip area and the wire length metric with a considerable relative improvement in the execution time.
(ii) It introduces a novel thermal model called FATT (Fast Assumption Technique for Temperature) for obtaining an assumption of the degree of hotness by modifying the heat diffusion function presented in [20] and [27] , instead of exactly computing the temperature thereby reducing the computational complexity in thermal characterization. FATT approximates the local and peripheral (die boundary) heatdiffusion effects to give a fast and good assumption of the hotness of the chip. The inclusion of FATT in the DOTFloor is responsible for the relative improvement in time efficiency for the floorplanner.
(iii) It defines a weighted cost function that evaluates the fitness of a floorplan solution and also facilitates the adjustment of tradeoffs between the cost metrics for the attainment of the final optimized solution.
II. MOTIVATION OF THE WORK
The thermal model presented by Han and Korean in [20] and [27] facilitates a fast thermal characterization of floorplan structures. According to Han and Korean, in a floorplan, the heat diffusion Q i of a block Blk i with its neighboring blocks Blk j, having power densities pdi and pdj respectively and mutually sharing a boundary of shared length SL ij is given by,
Similarly, the total heat diffusion Q for the set of all functional-blocks {Blk i} in the floorplan is given by,
This thermal model assumes that maximizing Q leads to an increase in heat diffusion and it subsequently results in a minimization of the chip-temperature. Now, as an example, let us consider two gate-array floorplans of 36 functional-blocks each, arranged in a 6x6 matrix arrangement as shown in Fig. 1a and Fig. 1b , where every rectangle denotes a functional-block with a block number label in it. Let the height and width of each block be 1652 µm and 1218 µm respectively. Power of blocks Blk15, Blk21 and Blk22 are considered to be 4829126.4 µW each and zero for the rest of the 33 blocks. By fitting the considered parameters in (1a) and (1b), Table 1 has been constructed. The temperature profiles in Fig. 1 show that Floorplan-2 has a higher peak temperature (PT). [20] and [27] .
In Table 1 the terms Q 15 , Q 21 , and Q 22 denote the heat diffusion values of blocks Blk15, Blk21, and Blk22 respectively according to 1(a). Term Q denotes the total heat diffusion according to (1b). Table 1 shows that Floorplan-2 has higher heat diffusion value Q, and according to the assumption of the thermal model of Han and Korean, it is supposed to have lesser peak temperature (PT) than Floorplan-1. But from Table 1 and Fig. 1 it is observed that Floorplan-2 has higher peak temperature than Floorplan-1 (obtained from simulation results) and it contradicts the assumption of the thermal model of Han & Korean. Moreover, the hottest cell Blk21 has equal Q i values in both the floorplans but still has a higher temperature in Floorplan-2 which is unexplainable by (1a). The discrepancy arises because the die-boundary is actually adiabatic in nature and the thermal model does not include the adiabatic die-boundary influence on the temperature. It can be observed from the compact thermal model of Fig. 2 in [6] , that every heat-generating node (excluding those on the periphery) of the die has four lateral and one vertical conductive heat flow path. All the heat flow paths at a node help in draining out the heat from it. As a heat source location approaches closer to the periphery, the number of lateral and the vertical heat flow paths surrounding the source goes on decreasing. Finally, at the periphery location, one lateral conductive heat flow path is blocked and at the corner, two heat flow paths are blocked resulting in a reduced lateral heat spread or increased heat accumulation like an adiabatic system. As a result, the heat sources closer to the die periphery witness higher temperatures. Blk21 is hotter in Floorplan-2 because it is relatively nearer to the die-boundary than in Floorplan-1. Therefore, we modify the diffusion thermal model presented in [20] and [27] by including the adiabatic behavior of the chip-boundary, for the improved fast thermal characterization of floorplan structures. 
III. MODELS OF COMPUTATIONS (MOCs)

3) NEIGHBOR DIFFUSION
FATT considers the lateral diffusion of heat between a critical cell and its neighbor cells according to the Fourier's heat equation. The rate of heat flow q between two points of a conductor, x distance apart (along the x-axis), perpendicular to cross-sectional area A x , having a temperature gradient of T / x and thermal conductivity K , according to Fourier is given by, q = KA x ( T / x). Since temperature is directly proportional to the power density it follows; T ∝ pd where T and pd are respectively the temperature and power density differences between the two heat exchanging points. So it follows that,
Now we consider two heat exchanging cells (i and j) as shown in Fig. 3a , facing each other to have (a) power densities pdi and pdj respectively so that pd = pdi − pdj, (b) common superposable edge length Lij, so that the common superposable cross-sectional area for heat flow is A x = Lij×t, (c) perpendicular distance dij between the facing edges (of cell i and j) so that heat diffusion occurs through a channel of length x = dij + δ and cross-section A x . Here t is the thickness of the die and δ is a very small length element included to avoid singularity of (2a) in cases when the two cells come in contact resulting in dij = 0. So the rate of heat diffusion qij between two cells i and j according to (2a) is given by,
Considering heat diffusion among the neighbor cells (i.e. cells in contact with dij = 0) only and since parameters K and t are constants, (2b) becomes, qij = constant × Lij {(pdi − pdj) /δ}. Hence the total rate of heat diffusion Diff i from a critical cell-i to all its j-neighbors is given by,
FATT further assumes that a component of heat generated from the critical functional block travels laterally (along x and y directions) up to the adiabatic die boundary walls and gets reflected back to the critical block at the same rate. This leads to a reflective heat accumulation in the critical block resulting in a rise of temperature. The heat reflection is accounted from the two adjacent die walls which are nearer to the critical cell. The process has been illustrated in Fig. 3b . The rate of heat reflection or rate of heat accumulation (Ri) in cell-i is given similar to (2b) by, Ri = constant × KDi × t {(pdi − pdj) / (Si + δ)}. As shown in Fig. 3b , Di is the length of the edge (i.e. Height Hi or width Wi ) of the critical cell-i facing the reflector walls, pdi is the power density of the critical cell-i facing the reflecting die boundary wall and Si the perpendicular distance between the corresponding cell edge and the die wall (along x or y directions). In this case heat diffusion occurs through a conducting channel of length Si +δ and cross-section Di × t. The power density of reflecting point on the die wall is pdj = 0 and K , t being constants,
As the critical block approaches closer to the die walls the reflective heat accumulation increases, leading to a rise in temperature and the same can be observed from (3a). In (3a) it is observed that with an increase in the closeness of a critical block with the die wall the parameter Si decreases, resulting in an increase in the heat accumulation component Ri. The total reflected or accumulated heat component for the critical cell-i is given by,
Maximum neighbor diffusion (max_diff) occurs for the Max-cell when all its neighbor cells are dummy and in such a case form (2c) it is obtained that, max_diff = constant × 2 (H + W ) P/δ. Maximum heat accumulation (max_acc) for Max-cell occurs when it is at a corner of the die-boundary (such that Si = 0 along x and y directions) and from (3a), (3b) it is observed that, max_acc = constant × (H + W )P/δ. FATT formulates a net heat-diffusion term i for critical cell-i, consisting of the corresponding normalized diffusion and accumulation terms. The terms have been normalized so that they may not overpower each other's values.
Since the net diffusion i , increases with the increase in neighbor diffusion and decreases with the increase in heat accumulation, the first term is positive and the second term is negative in the RHS of (4a). For the entire floorplan, the thermal metric (TM) for all i-hot critical cells, is given by,
Finally, FATT assumes that maximizing the thermal metric TM will increase the net normalized heat-diffusion and consequently will reduce the on-chip temperature.
B. WIRE LENGTH MODEL
The total wire length metric (WLM) for N functional blocks or cells in a floorplan is computed according to the wire length model given in [8] as,
Here Aij is the Manhattan distance between centers (xi, yi) and (xj, yj) of blocks Blk i and Blk j respectively and ρij is the density of connections between them. This model collects the cell coordinates and dimensions from the Area Model, finds ρij, as well as computes Aij as, Aij = |xi − xj| + |yi − yj|. Also, for an optimally segmented wire, the total wire delay can be obtained from the product of WLM with the delay per unit length ξ , according to [8] and [28] . From [8] and [28] it can be derived that ξ = l crit rc 2a + √ 2ab , where l crit is the optimal length of a wire section, r and c are the resistance and capacitance per unit length of wire. The parameters l crit , r and c depend on the technology node and the metal layer. As per [28] the parameters a = 0.4 and b = 0.7 for 50% swing.
C. AREA MODEL
In this work, an area model has been developed to handle fixed as well as rotatable and hard block floorplans. The model has been built according to the methodology proposed in [29] and shape function arithmetic as shown in section 8.2 of [30] . A slicing floorplan can be represented by a skewed slicing tree with the functional blocks present as leaves at the lowest level, and the complete floorplan being represented by the daughter composite block to be formed at the topmost level or root node of the tree. Corresponding to every slicing tree the optimal orientation of the functional blocks is determined in two steps, viz. Merger of corner points and Backtrack.
Step1 Merger of corner points: Every rectangular functional block or cell has an associated shape function or graph composed of corner points whose coordinates represent the possible height and width of the block. At every blockmerging step, shape graphs of two parent blocks merge on the basis of the cutline operator to form the shape graph of the daughter composite block. Different combinations of corner point pairs with one from each parent block are considered to merge and form the corner points of the daughter shape graph. Redundant points from the list of corner points of the daughter block so formed are removed as per the method shown in section 8.2 of [30] . The process of stepwise shape graph merging continues until the shape graph of the final composite daughter block representing the entire floorplan is obtained. As an example, the stepwise shape graph formation for the normalized polish expression (NPE) of Fig. 4 , has been illustrated in Fig. 5 where Fig. 5a to Fig. 5i represent the Fig. 4 , upon merger of parent blocks defined by the cutline operator. Fig. 5a to Fig. 5i represents the gradual formation of shape graph of the final composite block of the floorplan. different stages of block merger. It starts with Fig. 5a where shape graphs of parent blocks Blk3 and Blk10 merge in V-cut to form the graph of composite daughter block CBlk1 and finally in Fig. 5i the shape graph of the final composite block CBlk9 or the root node is obtained by the shape graph merger of the parent blocks Blk4 and CBlk8 in H-cut.
Step2 Backtrack: In order to find the minimum floorplan area, the corner point giving the least area of the root node is identified from its shape graph. In Fig. 5i the third corner point from the top in the shape graph of CBlk9 is the suitable corner point giving a minimum area of the root node CBlk9. The problem now is to backtrack and find the chain of corner points of the parent blocks at every stage and finally end up with obtaining the particular corner points of the functional blocks which are associated with the formation of the suitable corner point of the root node CBlk9. In the given example the backtrack-process scans backward from Fig. 5i to Fig. 5a . The optimal height and width of the functional blocks or the leaf cells are obtained from their respective finalized corner points after the backtrack-process.
For fixed block floorplan, the functional block dimensions stay unaltered as the original dimensions. Hence, in this case, every block has only one corner point and thus in the shape graph implementation, only the first step i.e. Merger of corner points is performed to obtain the floor area. The backtrackoperation is not required. But in case of floorplan with rotatable blocks, both steps of Corner point merger and Backtrack are performed. Table 2 shows the results of shape graph implementation for the normalized polish expression (NPE) of Fig. 4 considering both fixed and rotatable block floorplans. In Table 2 the parameters H and W denote the functional block heights and widths respectively. It becomes clear from Table 2 that incase of floorplan with free blocks, the shape graph arithmetic efficiently incorporates the block rotation to achieve the optimal block dimensions and subsequently ensures the minimum area for every NPE. Shape graph implementation for fixed blocks in Table 2 manifests that fixed block floorplan has higher chip area. In both cases of fixed and rotatable block floorplans, the process initiates with the original block dimensions. The coordinates of the functional blocks in the chip floor are computed only after ensuring the final block dimensions for both cases of rotatable and fixed block floorplans. The block dimensions and the coordinates so obtained by the area model are also used by the wire length and the thermal models. • Power density = pdi and Power pi = pdi × Wi × Hi, the problem is to obtain an optimal floorplan by arranging the functional blocks on the chip floor such that the overall area, wire length metric (WLM) and the peak on-chip temperature of the floorplan are optimized. The solution is to be obtained once by considering the blocks {Blk i} to be of fixed orientation (Aspect ratio γ i = Hi/Wi) and again by considering the blocks {Blk i} to be of free orientation i.e. rotatable by 90 o (Aspect ratio γ i = Hi/Wi or γ i = Wi/Hi).
IV. PROPOSED FLOORPLANNING ALGORITHM DOTFLOOR (DIFFUSION ORIENTED TIME-IMPROVED FLOORPLANNER)
The proposed floorplan algorithm DOTFloor has been built around a classical Simulated Annealing engine (SA) given in [24] , [25] and sections 2.4.4 and 3.3.2 of [26] and the salient steps adopted in the algorithm have been listed below.
B. FLOORPLAN ENCODING
The proposed algorithm DOTFloor encodes a slicing floorplan by the normalized polish expression (NPE) [31] . 
Parameters Initial_Area, Initial_WLM and Initial_TM are the corresponding three cost metrics obtained for a randomly generated initial solution. Parameters W 1 , W 2 , W 3 are the weights for refining the area, wire length metric (WLM) and thermal metric (TM) respectively. Each weight parameter W k varies between 0 and 1 such that k W k = 1. Fitness of a solution decreases with the increase in its cost. Since the floorplan degrades with the increase in Area and wire length metric (WLM), they occur in numerators and as the floorplan improves with the increase in thermal metric (TM), it occurs in the denominator of the cost function in (6) . Since the cost metrics are dimensionally different and value of one metric may overpower other metrics, they have been normalized.
D. PERTURBATION
The stimulus to create a new floorplan solution is achieved by the DOTFloor, via any of the three perturbation mechanisms on a normalized polish expression (NPE): (a) Swap of randomly chosen adjacent operands or functional blocks, (b) Swap of randomly selected operand with its adjacent cut line operator, (c) Inverting cutline operators in a randomly chosen cutline or operator chain. Block rotation is not applied as a perturbation as it is taken care of during the shape curve optimization discussed in section III(C).
A new solution is created by the perturbation of the present floorplan solution. The cost difference between the new solution New_sol and present solution Present_sol, is given by h = Cost (New_sol) − Cost(Present_sol). The new solution is better if h< 0 and is inferior if h> 0.
E. METROPOLIS ACCEPTANCE CRITERIA
The following probability criterion has been adopted for accepting a new solution generated upon perturbation.
1) The probability P for accepting a superior solution ( h < 0) is given by P = 1.
2) The probability P of accepting an inferior solution ( h > 0) at annealing temperature T is given by,
An inferior solution is accepted if the probability P defined in (7) is greater than a randomly generated number varying between 0 and 1.
F. STOPPING CRITERIA
DOTFloor adopts the following conditions for stopping the optimization process.
1) LOCAL STOPPING CRITERIA
It is the criteria of annealing cycle due to which it stops when any of the conditions are met: (a) Number of iterations in the annealing cycle exceeds the iteration limit local_iter = 2Nk. (b) Number of probabilistically accepted inferior solutions exceeds the limit σ = Nk. Here N is the number of functional blocks and k is an integral value to be called as the local iteration multiplier. An increase in N ensures the increase in iterations in an annealing cycle necessary for optimization.
2) GLOBAL STOPPING CRITERIA
It is the criteria of the overall optimization process due to which it terminates when any of the conditions are met: (a) Total number of annealing cycles exceed the maximum limit global_iter. (b) The ratio of the number of rejected solutions (rejection count) to the total solutions (accepted and rejected), exceeds the critical limit . (c) Annealing temperature falls below the final value T f .
G. SIMULATION PARAMETERS
The parameters involved in realizing the algorithm have been defined in Table 3 . Maintaining h_avg = 0.001, global_iter = 1000 and = 0.99, the parameters P i , P f , α, k have been varied according to the different combination orders PCj(P i , P f , α, k) as follows:
PC1 9) . As a test case the parameter combinations have been applied for the optimization of ami33 benchmark circuit with weight combination W 1 = 1, W 2 = 0, W 3 = 0 associated with the cost function as defined in (6) . It can be observed from the graph in Fig. 6 that the Cost and the area metric show negligible variance with the final acceptance probability P f (combinations PC1 to PC3 or PC4 to PC6) since the solution converges much before the reaching the final annealing temperature T f characterized by the final acceptance probability P f . The cost and the area metric show a downward trend with the increase in the value of initial acceptance probability P i (combinations PC2, PC5, PC7, and PC8), yet involving a minor increase in the run time. The Cost and area metric also show a downward trend with the increase in α (combinations PC8, PC13, PC14, PC15). However, there is an increase in the runtime with an increase in cooling rate α.
For lower values of cooling rate α, the cost and the area metric shows noticeable improvement with increasing values of the local iteration multiplier k (combinations PC8 to PC12) and becomes constant after a certain value of k (combination PC10). But for higher values of the cooling rate α, there is a minor improvement in cost and area metric with an increase in local iteration multiplier k (combinations PC15 to PC19). Observation shows that the runtime increases with an increase in k. It is clear from the graph in Fig. 6 that combinations PC17 to PC19 give the minimum optimized cost. That is these combinations PC17 to PC19 provide the global minima. In this work, we have chosen the simulation parameter combination PC17 where P i = 0.99, P f = 0.01, α = 0.99, k = 7. The choice of parameters in PC17 ensures a very high initial acceptance probability P i (99%) and a very low final acceptance probability P f (1%) as is required for the operation of the classical Simulated Annealing. Moreover, the combination PC17 provides the global minimum cost at the expense of lesser runtime. The same trend has been observed with other cost metrics and other benchmark circuits as well. Step2: Randomly generate the encoded initial floorplan solution. The initial solution defines the first present solution termed as Present_sol.
Step3: Operate the present solution by the function Cost(), to determine the cost of the present solution given by Present_cost = Cost(Present_sol).
Step4: Initialize the best solution Best_sol as, Best_sol = Present_sol and the best solution cost Best_cost as, Best_cost = Present_cost.
Step5: Execute the n th Annealing cycle at a constant annealing temperature T n . (For the 1 st annealing cycle n = 1 and T n = T i ). Step5 consists of the following sub-steps. If the condition New_cost < Best_cost is satisfied, then update the best solution and best cost as Best_sol = New_sol and Best_cost = New_cost respectively. Execute Step5 until the local stopping criteria is attained.
Step6: For the next (n+1) th annealing cycle update the new annealing temperature T n+1 according to a geometric cooling schedule as T n+1 = α n T n .
Step7: Repeat Step5 for the next Annealing cycle until the global stopping criteria is met.
The output of the algorithm is the optimal floorplan solution denoted by the Best_sol.
V. RESULTS AND ANALYSIS
The proposed algorithm DOTFloor (Diffusion Oriented Time-improved Floorplanner) has been experimented on MCNC (Microelectronics Center of North Carolina) benchmark circuits [32] by considering a random power density distribution for the functional blocks ranging between 0.022 × 10 6 W/m 2 and 2.4 × 10 6 W/m 2 . The average power densities considered are 0.593831 × 10 6 W/m 2 for apte, 0.729076 × 10 6 W/m 2 for xerox, 0.72433×10 6 W/m 2 for hp, 0.729749 × 10 6 W/m 2 for ami33, and 0.728304 × 10 6 W/m 2 for ami49.
A. WEIGHT PARAMETERS IN COST FUNCTION OF DOTFloor AND OPTIMIZED RESULTS
The weight parameters W k in (6) are varied between 0 and 1 during optimization to track the optimal and the other best solutions. The different weight combinations CO i (W 1 , W 2 , W 3 ) in (6) considered by DOTFloor for optimization are CO 1 (1,0,0), CO 2 (0,1,0), CO 3 (0,0,1), CO 4 (0.5,0.25,0.25), CO 5 (0.25,0.5,0.25), and CO 6 (0.25,0.25,0.5). The trade-offs between the cost metrics with the relative variation in their corresponding weight parameter W k can be observed in Table 4 . Moreover, the cost metric shows an improving trend with the increase in the value of its associated weight parameter W k . In Table 4 area shows the maximum improvement when the corresponding weight parameter W 1 is given 100% optimization weightage (i.e.W 1 = 1) in combination CO 1 . Combination CO 1 gives only area-aware optimization. Similarly, CO 2 and CO 3 provide only (wire length metric) WLM-aware and only temperature-aware optimizations respectively. Table 4 shows the best solution attributes obtained by DOTFloor corresponding to the different weight combinations CO i . DOTFloor finds the optimal solution according to the method discussed in section V(C). For the purpose of comparison and validation, the HotSpot tool has been used at the end to quantify the peak temperature (PT) of the optimized solutions of DOTFloor. 
B. WEIGHT PARAMETERS IN COST FUNCTION OF HotFloorplan AND OPTIMIZED RESULTS
For validating the performance of the proposed algorithm, the HotFloorplan tool has been used from the HotSpot tool package available at [33] . For experimentation purposes, the HotFloorplan tool has been configured to handle fixed as well as free and hard macrocells. The cost function of the HotFloorplan tool is
Here in (8) Area denotes the chip area, Tmax represents the peak temperature and WL indicates the wire length metric of the floorplan. The wire length WL in (8) is computed similar to the wire length metric given in (5) . Parameters λ A , λ T , and λ W are the weights of the cost metrics. The default values given in the HotFloorplan tool [33] for the optimal solution are λ A = 5 × 10 6 , λ T = 1, λ W = 350. The weight combinations considered in this work for only area optimization is (λ A = 5 × 10 6 , λ T = 0, λ W = 0), for only temperature optimization is (λ A = 0, λ T = 1, λ W = 0), and for only wire length metric (WLM) optimization is (λ A = 0, λ T = 0, λ W = 350). It can be observed from Table 5 that cost metrics improve with the increase in the corresponding weight parameter value. Table 5 depicts the attributes of the area-aware, WLM-aware, temperature-aware and optimal solutions as obtained from the HotFloorplan tool by the adjustment of the weight parameters in its cost function.
C. OPTIMAL COMBINATION AND OPTIMAL SOLUTION IN DOTFLOOR (DIFFUSION ORIENTED TIME-IMPROVED FLOORPLANNER)
DOTFloor defines a unique method to determine the optimal floorplan solution. It characterizes a 3D Cartesian coordinate frame representing the solution space, to have the following salient features.
(i) The frame has three orthogonal axes viz. x, y and z, each representing the normalized area, wire length metric, and peak temperature respectively. The three axes being normalized get calibrated in the range of zero to one.
(ii) Every point in the 3D-frame represents a floorplan solution of a circuit for a particular weight combination CO i given in Table 4 . The x, y, and z coordinate of the point represent the normalized values of its area, wire length metric, and peak temperature.
(iii) The solution set of a circuit (representing the solution points) are plotted in the frame by normalizing the cost metrics with respect to the corresponding highest metric values of the circuit. For example, considering the solution set of Xerox given in Table 4 , the highest area metric value is 72.7709×10 −6 m 2 . The area metric of solutions (corresponding to the different combinations CO i ) of Xerox is normalized by dividing with the highest area metric value. The similar process follows for obtaining the normalized values of other metrics as well.
(iv) The solution point of a particular circuit closest to the origin of the frame provides its optimal solution. Table 6 lists the Euclidian distances of the solution points of different circuits corresponding to the different CO i weight combination values. Parameter D_Avg represents the average Euclidian distance (taking an average of the five different circuits) for each CO i weight combination. Steps (i), (ii), (iii) and (iv) have been repeated, firstly for fixed block and secondly for free block floorplans. It is clear from Table 6 that the minimum average Euclidian distance is obtained with the CO 5 combination for both cases of floorplans with fixed and free oriented blocks. Table 7 and Fig. 7 show the percentage improvement, obtained by DOTFloor over the HotFloorplan, upon single objective optimization of the targeted cost metric. Table 7 has been prepared by comparing the area metric of the area-aware solutions, wire length metric(WLM) of the WLM-aware solutions and peak temperature (PT) of the temperatureaware solutions of DOTFloor and HotFloorplan tool given in Table 4 and Table 5 respectively. The temperature improvement in Table 7 has been computed in units of degree Celsius. The following observations can be made from Table 7 and Fig. 7: (a) DOTFloor achieves an average improvement of 3.57% area in the area-aware solutions, 5.11% in the wire length metric (WLM) of WLM-aware solutions and, 1.32% peak temperature in the temperature-aware solutions respectively over the HotFloorplan tool, considering the case of fixed macrocell floorplans.
D. COMPARATIVE STIDY OF EXPERIMENTAL RESULTS
1) SINGLE METRIC OPTIMIZATION
(b) DOTFloor achieves an average improvement of 4.04% area in the area-aware solutions, 0.38% wire length metric (WLM) in the WLM-aware solutions, 0.16% peak temperature in the temperature-aware solutions respectively over the HotFloorplan tool, considering the case of rotatable macrocell floorplans. Table 8 and Fig. 8 show a comparative study of the cost metrics of the optimal solutions of DOTFloor and HotFloorplan. The temperature improvement in Table 8 has been computed in units of degree Celsius. The following observations can be made from Table 8 and Fig. 8: (a) DOTFloor has achieved an average improvement of 0.66% area, 2.86% wire length metric (WLM) and 1.41% peak temperature in the optimal floorplan solution composed of fixed macrocells with respect to the HotFloorplan.
2) OPTIMAL SOLUTION
(b) DOTFloor has achieved an average improvement of 9.13% area, -2.74% wire length metric (WLM) and 0.38% peak temperature in the optimal floorplan solution composed of rotatable macrocells with respect to the HotFloorplan.
The average percentage improvement for a cost metric has been computed from the mean of the improvement percent of the metric obtained in the different test circuits (given in Table 7 and Table 8 ). Hence it is noticeably monitored that the DOTFloor provides almost the equivalent quality of floorplan solutions as the HotFloorplan tool and this proves that the developed floorplanner DOTFloor is quite reliable. Fig. 9 and Fig. 10 represent the temperature profile and the floorplans of area-aware, temperature-aware and optimal solutions of DOTFloor for the ami49 benchmark-circuit considering the case of fixed blocks (in Fig. 9 ) and rotatable blocks (in Fig. 10 ). It is clear from Fig. 9, Fig. 10 and Table 4 that the temperature-aware solution has the least peak on-chip temperature but the highest area, the area-aware solution has the least area but the highest peak temperature, the optimal solution has a better area than the temperature-aware solution as well as better peak temperature than the areaaware solution. Table 9 depicts the comparison between the execution times of the DOTFloor and HotFloorplan tool under the different optimization conditions. Fig. 11 clearly shows that DOTFloor provides a very high run-time saving with respect to the HotFloorplan tool for floorplans composed of fixed as well as rotatable and hard macrocells. The DOTFloor gives an average 96.67% improvement in run time over the HotFloorplan tool while maintaining almost the equal quality of floorplan solutions. For a particular solution (Area-aware, WLM-aware, Temperature-aware or optimal) the average run time improvement percent has been computed from the mean of the percentage improvement in run time obtained for the different test circuits (given in Table 9 ).
From Table 10 , it has been observed that the proposed floorplanner DOTFloor achieves improved results in terms of area utilization with respect to the previous works. Since the power considerations in [7] and the present work differ, instead of a direct temperature comparison, a thermal assessment has been done between [7] and the proposed floorplanner by considering the percentage improvement in temperature of the temperature-aware solution over the areaaware solution (which is actually temperature un-aware) of corresponding papers. Table 11 shows that the proposed algorithm achieves better average improvement in peak temperature over [7] .
VI. CONCLUSION AND FUTURE SCOPE
This paper presents a thermal-aware floorplanning algorithm that is suitable for handling fixed as well as rotatable and hard functional blocks or macrocells. The proposed floorplanner DOTFloor (Diffusion Oriented Time-improved Floorplanner) is based on SA and has been developed using the C language and experiments have been conducted in a Linux platform operating on a 3.0 GHz Core-i5 Intel processor. The algorithm adopts a heat-diffusion based thermal model FATT (Fast Assumption Technique for Temperature) which helps in reducing the execution time by providing an assumption of the hotness instead of accurate temperature estimation during the run time of the optimization process. The results of this work show very significant achievement in the runtime efficiency of DOTFloor while simultaneously maintaining an equivalent qualitative performance compared to the Hotfloorplan. Moreover, the proposed floorplanner also shows improved performance over previous works as well. The proposed florrplanner successfully gives a good optimization of area, wire length metric and peak temperature metrics. As a future scope, the work has a good potential of being extended to the design of non-slicing floorplan and 3D IC floorplan as well.
