Abstract-Energy efficiency has become a very important issue to be addressed in today's system-on-a-chip (SoC) designs. One way to lower power consumption is to reduce the supply voltage. Multisupply voltage (MSV) is thus introduced to provide flexibility in controlling the power and performance tradeoff. In region-based MSV, circuits are partitioned into "voltage islands" where each island occupies a contiguous physical space and operates at one voltage level. These tasks of island partitioning and voltage level assignment should be done simultaneously in the floorplanning process in order to take those important physical information into consideration. In this paper, we consider this core-based voltage island driven floorplanning problem including islands with power down mode, and propose a method to solve it. Given a candidate floorplan solution represented by a normalized Polish expression, we are able to obtain optimal voltage assignment and island partitioning (including islands with power down mode) simultaneously to minimize the total power consumption. Simulated annealing is used as the basic searching engine. By using this approach, we can achieve significant power saving (up to 50%) for all datasets, without any significant increase in area and wire length. We compared our approach with the most updated previous work on the same problem, and results show that our approach is much more efficient and is able to save more power in most cases. We have also studied two other approaches to solve the same problem, a simple dynamic programming approach and a lowest possible power consumption approach. Experimental results show that ours can perform the best among these three approaches. Our floorplanner can also be extended to minimize the number of level shifters, to address a minVdd version of the problem and to simplify the power routing step by placing islands close to their corresponding power pins.
I. Introduction
T HERE ARE two kinds of power consumption: dynamic and leakage. Dynamic power is caused by charging and discharging of the load capacitance during switching, while leakage power is due to subthreshold currents when a device is turned off. Energy efficiency is now an important issue in today's system-on-a-chip (SoC) designs because of the increasing power density and the popularity of portable systems. There are many techniques to reduce power consumption. One of the most effective ways is by lowering the voltage supply. Multivoltage design is thus introduced to provide "just enough" power to support different functional operations. Both dynamic and leakage power consumption can be reduced in multivoltage designs. For dynamic power, a minor adjustment to the voltage level can result in a significant reduction in power consumption, which is proportional to the square of the voltage. For leakage power, the consumption can be reduced by powering down some components when they are inactive.
Multivoltage designs involve partitioning of a chip into areas called "voltage islands" that can be operated at different voltage levels, or be turned off when idling. With the use of voltage islands, the chip design process is becoming more complicated. We need to solve the problems of island partitioning, voltage assignment, and floorplanning simultaneously under area, power, timing, and other physical constraints. These problems must be solved at the same time since their results will mutually affect each other. Also, there are other issues to be considered. For example, voltage islands should be placed close to their corresponding power pins in order to make power routing easier and to reduce IR drop. Besides, each island requires level shifters to communicate with others and overhead in area and delay will be resulted. These additional issues have created many new challenges in generating floorplans for multivoltage designs. An example is shown in Fig. 1 . In this example, the possible voltage levels of each core and groupings of similar inactive periods (to generate islands with power down mode) are shown on the right hand side. Assuming that the number of islands is three, one possible partitioning is to group cores A, B, and C as one island operating at voltage 1.0-V, core D on its own as one island at voltage 1.5-V and cores I, K, L, and M as one island at 1.2-V. Notice that other cores will be operated at the chip-level voltage. The island containing I, K, L, and M can be powered down during sleep, as well as the island containing the single core D. A candidate floorplan solution for such a partitioning and voltage assignment is shown on the left.
There are several previous papers addressing this voltage island-driven floorplanning problem. One recent work is by Lee et al. [5] . Given a netlist without reconvergent fanouts, a voltage assignment (with two voltage levels of VDDL and VDDH) is first performed on the netlist according to the timing requirement before the floorplanning step. Level shifters are then inserted into the nets according to the voltage assignment 0278-0070/$26.00 c 2010 IEEE result when a VDDL block drives a VDDH block. At the end, a power-network aware floorplanner is invoked to pack the blocks such that the power-network resource, estimated as the sum of the perimeters of the voltage islands, will be minimized. As a result, blocks in the same voltage island will be placed close to each other. In their approach, the voltage assignment step and the floorplanning step are done separately. Hu et al. [3] have also considered this simultaneous island partitioning, voltage assignment, and floorplanning problem in SoC designs. Simulated annealing is used as the basic searching engine. Given a candidate solution, perturbations are performed to split an island, change the voltage of an island or change all the islands of one voltage to another voltage. Chiplevel floorplanning is then performed to find a floorplan in which compatible islands (islands with the same voltage) are likely to be adjacent. An island merging process is then applied to reduce the number of islands. At the end, island-level floorplanning is done to each newly formed island to shrink its area. The whole process is repeated until a satisfactory solution is obtained. Their approach does not consider islands with power down mode and the search space is large. Mak and Chen [7] have also addressed this problem on SoC designs. Given a floorplanning input, the voltage assignment and island partitioning problem is formulated as a 0-1 integer linear program. In their approach, a few candidate floorplan solutions are generated based on metrics like area and interconnect cost, then voltage assignment and partitioning are performed on these candidate floorplans using the integer linear programming approach to identify the best candidate solution. A fragmentation cost (number of adjacent cores operating at different voltages) is used to model the power network complexity but this cost is not related to the number of islands directly. There are some other previous works which address issues like reliability [12] and temperature reduction [4] in SoC voltage island partitioning and floorplanning. For island partitioning, Wu et al. [11] minimized the number of voltage islands after placement. Guo et al. [2] addressed the voltage assignment and voltage island generation problem in placement to minimize the number of level shifters. Cai et al. [1] proposed a voltage island generation flow in standard cell-based designs to reduce power consumption under performance constraint and to reduce layout overheads caused by cell clustering to form islands.
In this paper, we present a floorplanning method for SoC designs that is tightly integrated with island partitioning and voltage assignment. A preliminary version of it has appeared in [6] . Simulated annealing is used with normalized Polish expression (NPE) [10] as the floorplan representation. 1 Our cost function now considers area, wire length, power, and level shifter usage.
2 NPE is used because the slicing tree structure is a suitable data structure on which island partitioning and voltage assignment can be done optimally and efficiently. In each step of the annealing process, a candidate floorplan solution is generated on which optimal island partitioning and voltage assignment will be performed simultaneously to compute the smallest possible power consumption for that candidate floorplan solution. This is done by dynamic programming with an efficient cost table updating technique. In this way, we can integrate the three steps closely, and reduce the searching space (instead of doing voltage assignment by the "move" operations of the annealing process as in [3] ). In this floorplanning framework, we can also generate islands with power down mode to optimize the total power consumption further. We compared our approach with the most updated previous work [7] on the same problem, and results show that our approach is much more efficient and is able to save more power in most cases, while with less area overhead. Similar to [7] , timing is considered according to the assumption that the given voltage levels of each core are all feasible for achieving timing closure. To further study this problem, we have developed two other methods to solve this voltage islanddriven floorplanning problem. One of them is a simplified version of the dynamic programming approach, and the other one assumes a lowest possible power consumption for each core. We compare these two methods with our approach and experimental results show that our approach can perform the best. The simplified dynamic programming approach performs a little bit inferior to our approach in terms of solution quality, with a 5.37% reduction in running time on average. The lowest possible power approach consumes less execution time and achieves maximum power saving, but the number of voltage islands generated is relatively large, while our approach can generate a solution of which the power consumption is fairly close to the lowest possible value, while the number of voltage islands is much smaller. In addition, we have also extended our floorplanner to consider the area and power overhead of level shifters, to solve a minVdd version of this voltage island driven floorplanning problem (in which each core, instead of having a set of discrete feasible voltage levels, has a minimum voltage level above which the core can be operated properly and achieve timing closure) and to consider the ease of power network routing (proximity to power pins and shapes of voltage islands). By using our approach, we can achieve significant power savings (up to 50%) for all datasets, without any significant increase in area and wire length.
We will define the problem in Section II, we will then discuss the methodology used in Section III. The experimental results will be reported in Section VI before the conclusion and discussion in the last section.
II. Problem Formulation
In this problem, we are given a set of n cores with areas A 1 , A 2 , . . . , A n and aspect ratio bounds [l i , u i ] for i = 1, . . . , n. Each core i is associated with a power table T i that specifies the legal voltage levels for the core and the corresponding average power consumption values such that the cycle time can be achieved. These tables can be obtained by simulations that try applying different supply voltages to the core. The power consumption corresponding to each legal voltage can then be estimated. 3 We are also given a set of m nets {N 1 , N 2 , . . . , N m } and a set of groupings {G 1 , G 2 , . . . , G p } between the cores such that the cores in each group G i have similar inactive periods and will have a s i % saving in power consumption if they are grouped together as an island with power down mode. 4 Given a constant K and a chip-level voltage V c , our goal is to generate a floorplan F with K rectangular voltage islands so that the total power consumption is minimized. 5 Each island will be supplied with the lowest possible voltage level common to all the cores in that island while the remaining cores not assigned to any island will be operated at the chip-level voltage. Islands containing blocks all belonging to the same group G i can have a further reduction in power consumption by s i % by shutting it down during sleep. 6 An alternative scenario of this problem is that each core i, instead of having a few discrete legal voltage levels, has a minimum voltage level minV i such that i can operate properly and achieve timing closure at a voltage level not less than minV i . We call this the minVdd version of this voltage islanddriven floorplanning problem, and will extend our discussion on this in a later part of this paper.
A. Groupings of Cores With Similar Inactive Periods
The grouping information can be generated according to the dynamic behaviors of the cores obtained from a power state machine (PSM) [8] . A PSM represents all the states a chip can operate in and the possible transitions between them. Each state in a PSM describes the operating mode of the cores. For example, a core can be active, idle or sleep in any particular state. Each state also has a weight that represents the relative amount of time the system will remain in that particular state. The larger the weight, the higher is the chance that the system will be in that state within a certain time window. The power to an island with power down mode must remain active if any of the cores in it is active. The relative amount of time a group of cores on the same island will remain inactive can be estimated from the PSM, and the percentage saving in power can then be estimated from it. We can exhaust different ways of groupings in a preprocessing step to find out which grouping is possible, given an estimate of the overhead cost in energy consumption and area of the power gating circuitry. Those potentially beneficial groups will form the G i s in the above formulation.
III. Methodology
Our floorplanner is based on simulated annealing using NPE as the representation. For each candidate floorplan solution represented by an NPE, we will perform an optimal island partitioning and voltage assignment to maximize the total power saving. The cost function of the annealing process is to minimize a weighted sum of the area, wire length, and power. We can also extend our floorplanner to consider level shifters and proximity to power pins. Details will be given in the following sections.
A. Optimal Island Partitioning and Voltage Assignment
Given a candidate floorplan solution represented by a NPE, we can construct the corresponding slicing tree and perform optimal island partitioning and voltage assignment on it. This can be done efficiently by dynamic programming. The pseudocode is shown below. 
Return(min cost).
At the beginning, TreePart(root, k) is called to obtain an optimal island partitioning and voltage assignment of the whole floorplan, where root is the root of the slicing tree corresponding to the NPE under consideration and k is the number of voltage islands we want to produce. When k is zero (line 2), no voltage island is formed in the subtree of u, so the power consumption power(u) will be computed as (V c ) 2 A {u} , where V c is the chip-level voltage and A {u} is the total area of the cores in the subtree of u. For nonzero k, we will first check whether this optimal cost has been computed before and is available in the table for immediate return (line 3). If this value is not available, we will consider the situations when k is one and when k is larger than one separately. When k is one (line 4), there are four cases.
1) Case 1 (line 5), we continue to search for a voltage island in the left subtree of u and let the right subtree operates at the chip-level voltage V c . 2) Case 2 (line 6), similarly, we look for a voltage island in the right subtree of u and let all the cores in the left subtree work at V c . 3) Case 3 (line 7), use the function nonSubtree( ) to group the cores across a number of subtrees along the left tree branches of u into a voltage island (details will be given in the next section). 4) Case 4 (line 8), the entire subtree u is regarded as one voltage island and the power consumption cost({u}) will be computed as
where v {u} is the smallest common voltage among all the cores in the subtree rooted at u and A {u} is the total area of the cores in the subtree rooted at u.
We will compute the costs of the four cases, respectively, and the smallest one will be returned and recorded in the table for future use. When k is more than one (line 12), we will recursively call the procedure TreePart( ) to exhaust all the possible partitionings of the subtree of u, including both intersubtree partitionings (line 13) and intra-subtree ones (lines 14-16). The minimum one will be returned and recorded in the table.
1) Voltage Islands in nonSubtrees:
Notice that a voltage island (a rectangular region) may be formed by a set of contiguous right subtrees linked by internal nodes of the same type. An example is shown in Fig. 2 . Therefore, we need the procedure nonSubtree( ) to enumerate these cases. The pseudocode of nonSubtree( ) is shown in the following. In this procedure, we exhaust all the cases of forming one island with two or more contiguous right subtrees and the one with the smallest power consumption will be returned. On line 7, we compute the cost of grouping the right subtrees in S as one island and having the remaining k − 1 islands in the last left subtree (subtree D in the example of 
min cost
= ∞ 2. S = rchild(u) 3. op = operator(u) 4. While operator(lchild(u)) is op 5. u = lchild(u) 6. S = S ∪ rchild(u) 7. C = TreePart(lchild(u), k − 1) + cost(S) 8. If min cost > C, min cost = C 9. Return(min cost).
2) Proof of Optimality:
The procedure TreePart( ) will give the optimal partitioning to minimize the total power consumption. Given a candidate floorplan solution represented by a normalized slicing tree rooted at u and the number of voltage islands required k, TreePart( ) will exhaust all the possible cases recursively and return the best solution. When k is zero, there is only one case that all the cores in the tree rooted at u (called T u ) are operated at the chip voltage. When k is one, there is only one voltage island among all the cores in T u . The first three cases are obvious: 1) the island is in the left subtree of u; 2) the island is in the right subtree; and 3) all the cores in T u form one island. There is still a case that the island is formed between the left and right subtrees of u. This may happen only when two consecutive internal nodes are of the same type (both "+" or both "*"). In a normalized slicing tree, an internal node will not be of the same type as its right child, so this will happen only along the left branch. nonSubtree( ) will exhaust this last case of forming one island by a set of contiguous right subtrees along the left branch rooted at u. When k is larger than one, the cases are similar to those when k is one and TreePart( ) will exhaust all different ways of distributing the k islands between the left and right subtrees of u and the case of having an island lying between the two subtrees. Since TreePart( ) has exhausted all different cases of forming k islands in a given candidate floorplan, the solution returned by TreePart( ) is optimal.
3) Handling Island With Power Down Mode: Islands with power down mode can be easily handled in our framework. When computing the power consumption of an island formed with the cores in the set of subtrees rooted at a node in set X by calling cost(X) in the procedure TreePart( ) and nonSubtree( ), we only need to check if all the cores belong to one group G i for some i = 1, . . . , p. If this is true, the island formed can be shut down during sleep and have an additional power saving of s i %. 7 Notice that an island containing just one single core can also be shut down during sleep mode if the core belongs to some idle state group. In this way, our floorplanner can give optimal island partitioning and voltage assignment taking into account islands with power down mode given any candidate floorplan solution.
4) Speedup in Implementation and Complexity: A table cost table [v] [j] for j = 1, . . . , K is kept at each internal node v of the slicing tree to record the optimal power consumption of partitioning the cores in the subtree rooted at v into j islands. This data structure can help to minimize the number of recursive calls and to avoid repetitive computations. It can be seen from the procedure TreePart( ) that whenever we want to find the optimal power saving at a node u with k voltage islands, we will first check whether this is computed before and the required information is available from cost table [u] 
If it is available, the optimal value is returned immediately (line 3). Otherwise, it is computed recursively and the computed value will be saved in the cost table to be used in some later steps (lines 10 and 17). After a move in the annealing process, only a small part of the whole slicing tree will be changed and we only need to update the tables of the affected nodes once. The affected nodes will be those lying on the paths from the modified parts of the tree to the root. For those affected nodes, the corresponding cost tables will be flagged as "not updated" and will be updated during the recursive calls.
The size of the cost table is O(nK) where n is the number of cores and K is the number of voltage islands, because each internal node v has K entries in the table, cost table [v] [j] for j = 1, . . . , K, and there are n internal nodes in total. For each affected node v, we need to update all the K entries of its table once. Since each entry is just updated once, the time complexity will be the same as that of updating all the affected nodes in a bottom-up fashion from the leaves to the root. If the nodes were updated from the leaves to the root, the time taken to update a table at a node v was O(K 2 ), because there were K entries and each entry took O(K) time (the tables of v's children have already been updated). Therefore the total time to perform all the updates in each iteration is (number of affected nodes )
. This is O(K 2 n) in the worst case, and is (K 2 log n) on average.
B. Annealing Schedule
In our annealing engine, the temperature is set to 10 5 at the beginning and will drop at a rate of 0.95. At each temperature, 30 * n random moves are performed, where n is the number of blocks in the dataset. The annealing process stops either when less than 0.5 percent random moves are accepted at a certain temperature, or when the temperature falls below 10 −5 .
7 Even if some cores in the group are not in the island, we assume that there is still s i % saving in power since this percentage saving is computed based on the amount of time each core can be shut down in comparison with the total computational time. However, the way to calculate this saving in power can be modified for different problem formulations.
C. Moves
There are three kinds of moves to change the NPE of a candidate floorplan solution during annealing. This set of moves has been proven to be complete to change any arbitrary solution to any other arbitrary solution. 
D. Cost Function
We use the cost function ψ = A + λ w W + λ p P to evaluate a floorplan where A is the area of the floorplan, W is the total wire length estimated by the half perimeter bounding box and P is the total power consumption. The parameters λ w and λ p are the weights which will be set at the beginning of the annealing process by random walks to make the three terms similar in weighting. Note that this cost function can be modified to consider routability (by having an additional term on congestion estimation) or the fixed-outline constraint (by replacing the area term A by λ ∞ (max{0, w−W }+max{0, h− H }) where λ ∞ is a large positive constant, w and h are the width and height of the candidate floorplan solution, and W and H are the fixed width and height required for the final floorplan design).
IV. Extension to Basic Methodology
Based on the framework discussed in the above Section III, the methodology can be modified to handle the following varied versions of the problem, minVdd optimization and adjustable background voltage.
A. MinVdd Optimization
As discussed in the problem formulation section, an alternative scenario of this problem is that each core i, instead of having a few discrete legal voltage levels, has a minimum voltage minV i such that i can operate properly and achieve timing closure at a voltage level not less than minV i . Our general framework can be readily modified to address this minVdd version of the problem. The major modification made to the dynamic programming is that when we get down to the base case that the whole subtree forms only one island, instead of finding the lowest voltage level legal to all cores in the island, we should compute the highest minV i among all the cores i in the island and take that as the voltage level being used in that island. We can achieve this by computing the function cost({u}) of a subtree u as follows:
Notice that since we can control the number of voltage islands by adjusting the parameter k in the procedure TreePart(root, k), the floorplanning process will not end up with getting more voltage levels than desired. That means, the algorithm has already taken into account the maximum number of voltage levels allowed to find an optimal way of partitioning and voltage assignment such that the total power consumption is minimized. 
B. Adjustable Background Voltage
In the problem formulation, it is assumed that a background chip-level voltage V c is given. Our framework can also be used to address an extended version of the problem in which the background voltage is not fixed yet. 8 The final chip-level voltage is determined by the minimum (or maximum) feasible voltage level among all the cores which are not grouped into any voltage island. To achieve this, we need to expand the table at each node L times where L is the total number of possible voltage levels among all the cores. In this case, each entry cost table [v] [j][V l ] will be the optimal power consumption of partitioning the cores in the subtree rooted at v into j islands when the background chip-level voltage is V l where 1 ≤ l ≤ L. Notice that some V l where 1 ≤ l ≤ L may be impossible to be used as the background voltage in some cases and we can use a very large cost value to denote these cases. Besides the changes made to cost table, the procedures power( ), TreePart( ), and nonSubtree( ) will also be modified to return an array of L values corresponding to the L cases of using a different background voltage. Steps 9-10 and steps 15-17 in the pseudocode TreePart(u, k) will all become array operations treating the L different choices of background voltages separately and correspondingly.
All entries in the tables corresponding to different background voltages can be updated simultaneously during the recursive calls and the run time is thus only linearly scaled up by L. This is very affordable in practice since the number of possible voltage levels L is usually small. The memory size of such expanded cost table will be O(nKL).
V. Alternatives to Solve the Problem
To further study the problem, we have developed two other methods to solve the same voltage island-driven floorplanning problem. One of them is a simplified version of the dynamic programming approach described in Section III-A, and the other one assumes a lowest possible power consumption for each module.
A. Simplified Dynamic Programming Approach
In this simplified dynamic programming approach, the islands formed across the left and right subtrees are not taken into account, i.e., step 7 and step 13 in the pseudocode of the procedure TreePart(u, k) in Section III-A are removed. This approach is clearly suboptimal for a candidate floorplan, 9 since the solutions that contain islands formed across subtrees are neglected. However, the solution quality may not be significantly degraded by this simplification, since the potentially optimal grouping of some cores across subtrees in one candidate floorplan is very likely to be generated in another one in which these cores are lying in the same subtree, due to the stochastic property of simulated annealing.
B. Lowest Possible Power Approach
In this lowest possible power approach, each core is set to operate at its lowest legal working voltage level, so that the largest possible power saving (without power down mode) can be achieved. The dynamic programming procedure in this approach then simply need to count the number of voltage islands required for each candidate floorplan, and this number will replace the power consumption term in the cost function of the annealing process. This counting can be done recursively and the pseudocode is shown below. The variables Llevel and Rlevel are used for possibly merging of two neighboring islands of the same voltage level, which are separated into two subtrees of one parent node in the slicing tree. We can easily see that the run time complexity is O(n) where n is the number of cores, and this complexity is the smallest possible. 
If u contains a single core i
If operator(lchild(u)) equals op 13. 
VI. Experimental Results
We have done experiments on the GSRC floorplanning benchmarks. Since no voltage information is provided in those benchmarks, we have randomly generated the voltage levels for each block from the set {1.0-V, 1.1-V, 1.2-V, 1.3-V, 1.5-V} and 1.5-V is assumed to be the chip-level voltage. In each dataset, groups of blocks with similar inactive periods are also randomly generated. Table I shows the details of each dataset, the fourth column indicates the grouping information of similar inactive periods, e.g., for n30, there are two groups: one contains five cores with 30% power saving if being grouped together, and the other one contains five cores with 20% power saving if being grouped together. This grouping information is now generated randomly. 10 Our algorithm is implemented in C language and all the experiments are performed on a Sun Blade 2500 with a 1.6 GHz CPU and 2 GB RAM. The results are shown in Table III . For each dataset, we performed voltage island driven floorplanning with the number of voltage islands generated ranges from zero to six. 11 We could see from the results that up to 50% power saving can be achieved without any significant degradation in area and wire length. In addition, the speed is very acceptable and promising. Some resultant floorplans are shown in Figs. 3 and 4 . Fig. 3 is a resultant packing of dataset n100, with four voltage islands generated. In Fig. 4 , we aim at testing a particular situation in which some cores can be operated at very low voltages. We use playout of the MCNC benchmark as the test case, and assign 0.6-V to cores {2-9}, and 0.8-V to cores {12-19}, respectively, as their minimum working voltage. Floorplanning 
A. Comparison With Previous Work
In order to compare with the results of the previous related work [7] on exactly the same problem, 12 we have done another set of experiments with the benchmarks provided by the authors of [7] without idle islands. In their datasets, the available voltage levels for each cell are chosen from the set {1.1−V, 1.3−V, 1.5−V, 1.8−V }, the area of each level shifter is 10 × 10 and the power cost is one. The comparisons are displayed in Table II . Results show that our approach is much more efficient and is able to save more power in most cases, while with less area overhead. Note that we set K = 4 in the experiments, i.e., four islands are generated for each test case.
B. Comparison With the Simplified Dynamic Programming Approach
In this simplified dynamic programming approach, the islands formed across the left and right subtrees are not taken into account, This approach is suboptimal in terms of power saving with respect to a particular candidate floorplan, but due to the stochastic property of the annealing process, the solution quality will not be significantly degraded. This observation is supported by the experiments. Table IV displays the experimental results of this simplified dynamic programming approach, while Table V compares this approach with our approach, from which we can see that this simplified dynamic programming approach performs a little bit inferior to our method in terms of solution quality, with a 5.37% reduction in running time on average.
C. Comparison With the Lowest Possible Power Approach
In this lowest possible power approach, each core is set to operate at its lowest legal working voltage level, so that the largest possible power saving can be achieved. This approach provides an upper bound on power reduction for us 12 In [7] , it is also assumed that the legal voltage levels of a core are the feasible operating voltages at which the core will satisfy its performance requirement. to compare with. From Table III , we can see that the power consumption of our approach is very close to this lowest possible value, but the number of voltage islands is much smaller. Note that the efficient implementation of the island counting procedure [pseudocode IslandCount( )] contributes to the relatively shorter execution time of this lowest possible power approach.
D. Extension to Consider Area and Power Overhead Due to Level Shifters
Level shifters (LS) are needed for connections between two blocks in different power domains. These level shifters will lead to area and power overhead and should be minimized. We can extend our floorplanner to minimize the usage of level shifters by having additional terms in the cost function that represents the total area and power consumption due to level shifters. We assume that a level shifter is needed whenever a wire is connecting two blocks operating at different voltages. For example, if a net connects a source in voltage island A to three sinks, two in island B and one in island C, two level shifters will be inserted, one between A and B and one between A and C. These level shifters will add to the total area and power consumption of the design. This extra area and power usage can be estimated according to the size of a level shifter which can then be included into the area and power term of the cost function, ψ LS = (A+A LS )+λ w W +λ p (P +P LS ), where A LS and P LS are the total area and power consumption due to the level shifters, respectively. 13 The result is shown 13 The extra area due to the level shifters are considered and reflected in the cost function as described above. In terms of the final layout, we can proportionally increase the area of each core according to the number of level shifters inserted due to signals coming out from it. In this way, the final floorplan obtained from the annealing process can be slightly changed to take into account this extra area due to the level shifters.
in Table VII . In these experiments, the number of voltage islands K is set to four. Results show that our method can reduce the number of level shifters by 67.1% on average and can reduce total power consumption as well as total area, with some penalty in run time.
E. Extension to Consider Power Network Routing
The voltage islands should be placed in proximity to the power pins to simplify the power routing step and to minimize the IR drop. The power network resources can be modeled by the sum of the half perimeters of the islands [5] . We can also extend our floorplanner to consider these power network issues by having additional terms in the cost function (with weights determined by random walk) to represent: 1) the total distance of the voltage islands from their respective power pins; and 2) the sum of the half perimeters of the islands. In our experiments, we assume that the positions of the K power pins are given. In each iteration of the annealing process, each island is matched to a power pin such that the total distance between them is the smallest possible. This total distance and the sum of the islands' half perimeters will be minimized during the annealing process.
Two resultant packings of the n300 dataset consisting of 300 blocks are shown in Fig. 5 , and they are produced with the fixed-outline constraint. The packing in Fig. 5(a) is obtained by the original floorplaner, without taking into consideration these power network issues, while the one in Fig. 5(b) is obtained by this extended version. There are four power pins located at the center of each boundary in this example. We can see from the figures that the four islands are shifted to the sides of the chip containing the pins in order to be located closer to their respective power pins, and the islands are closer to square in shape that will favor IR drop reduction and power network routing.
F. Experiments With minVdd
In this version of the problem, instead of having a few discrete legal voltage levels, each core has a minimum voltage minV i such that i can achieve timing closure at a voltage level not less than minV i . Our framework can be easily modified to address this minVdd version of the problem. The results of applying our dynamic programming approach to this problem is shown in Table VIII . We can see that the running time is much shorter in comparison with Table III because it is easier to compute the maximum minV i among a group of cores than finding the smallest voltage level feasible to all of them.
VII. Conclusion
In this paper, we have proposed a simulated annealing-based approach for the floorplanning problem with simultaneous island partitioning and voltage assignment. The three factors area, wire length and power consumption of the resultant floorplan are concurrently taken into consideration, and the experiment results have shown that we are able to achieve a significant power saving of up to 50% for the testing datasets.
In addition, when extended to minimize the number of level shifters between different voltage domains, our method can reduce 67.1% of the LS used on average, with some penalty in power saving; and it also functions well to generate voltage islands in proximity to their corresponding power pins, by having an additional term in the cost function to minimize the total distance between voltage islands and power pins.
