Floorplanning is a crucial phase in VLSI Physical Design. The subsequent placement and routing of the cells/modules are coupled very closely with the quality of the floorplan. A widely used technique for floorplanning is Simulated Annealing. It gives very good floorplanning results but has major limitation in terms of running time. For more than tens of modules Simulated Annealing is not practical. Floorplanning forms the core of many synthesis applications. Designers need faster prediction of system metrics to quickly evaluate the effects of design changes. Early prediction of metrics is imperative for estimating timing and routability. In this work we propose a constructive technique for predicting floorplan metrics. We show how to modify the existing top-down partitioning based floorplanning to obtain a fast and accurate floorplan prediction. The prediction gets better as number of modules and flexibility in their shapes increases. We also explore the applicability of traditional Sizing Theorem when combining two modules based on their sizes and interconnecting wirelength. Experimental results show that our prediction algorithm can predict the area/length cost function normally within 5-10% of the results obtained by Simulated Annealing and is, on the average, thousand times faster.
INTRODUCTION
Given a circuit represented by a hypergraph, the floorplanning problem is to determine the approximate location of each module in a rectangular chip area. An important step in floorplanning is to decide the relative position of each module. A good floorplanning algorithm should minimize the total chip area, make the subsequent routing phase easy, and improve performance, by, for example, reducing signal delays. To facilitate subsequent routing phases, modules with relatively high connections should be placed close to one another. The set of nets thus defines the closeness of the modules. Placing highly connected modules close to each other reduces routing space/congestion. Wirelength minimization aims to do exactly that.
Several strategies have been adopted for floorplanning. The rectangular dual graph [22] approach to floorplanning is based on the proximity relation of a floorplan [14] . The algorithm for constructing a rectangular dual floorplan, if one exists, has been reported to run in linear time [1] . Other works on floorplanning by graph-dualization have been reported by Yeap et. al. [37] [38] . The hierarchical (bottom-up/top-down) approach to floorplanning is based on a divide and conquer paradigm, where at each level of hierarchy, only a small number of rectangles are considered [4] . A hierarchical approach to floorplanning has been reported in [17] , and research on this class of floorplans has some encouraging results [31] [35] [18] [27] . The partitioning based hierarchical floorplanning has been studied in [4] [15] [16] . Simulated Annealing is a technique used to solve general optimization problems, floorplanning problems being one of them [32] . Simulated Annealing examines the configurations of the problem in sequence. Each configuration is a feasible solution of the optimization problem. Simulated Annealing is limited to very small number of modules (40 -50) as reported in [33] [34] . Simulated Annealing has been the best known floorplanner so far because others (like partitioning based hierarchical) have difficulty matching the shape. However, because of its semi-exhaustive nature Simulated Annealing takes substantial computation resources and is extremely slow.
Here, we address fast prediction of floorplan metrics. Floorplanning needs to be used in the heart of many synthesis applications (High Level Synthesis, Logic Synthesis for example). Designers want to evaluate the effects of early designing decisions on the final layout hence there is a real need for fast floorplan predictors. With large systems (hundreds of modules) Simulated Annealing fails. What is needed is a fairly accurate and very fast floorplanning technique to be able to convincingly predict the metrics of the layout i.e., area, wirelength, relative positions of the modules. The reduction in device sizes has lead to the integration of microprocessor and chipset. The concept of integrating whole system on a chip is fast gaining grounds. Quick floorplan predictors are needed in this domain as many different combinations exist for chipsets. It is predicted that the performance improvement in deep sub-micron, for the existing fabrication technologies, will come mainly from architecture and circuit [26] . Floorplan predictors that can assess viability of a chip at the architectural level are needed. Fast floorplan predictors will help the architects to decide the features and their impact on layout.
Prediction methods can broadly be classified into statistical and constructive. Major work on prediction has been towards statistical prediction. Wirelength estimation has been studied in [5] [19] . Kahng et. al. [2] discuss three basic types of wirelength estimations : a priori estimation is predicting wirelength of layout in advance, a posteriori estimation is used when for a fixed placement post-routing wirelength is predicted and online estimation attempts to predict wirelength as the hierarchical layout progresses. Rent's rule has been extensively studied and used for statistical estimation [24] [9] [28] . Other works related to wirelength estimation have been e.g., MST-based [12] , Bounding Box based methods [2] . Kahng et. al. [2] propose new bounding box estimators for on-line wirelength estimation and wirelength estimators using pure analytic techniques. The problem with statistical prediction is that circuits are not homogeneous. Circuits have been designed hierarchically and have high connectivity at low level. These features are difficult to model statistically.
Constructive prediction (algorithmic heuristics) has been given very little attention. General guideline on constructive prediction was outlined in SLIP'99 positional statement [21] . In this work we present constructive prediction of floorplan metrics (area/wirelength cost metric). Our claim is that fast algorithms can exploit the features of the system to act as better predictors and eventually as constructors. The difficult thing in floorplanning is shape-management (which Simulated Annealing does perfectly). Flexibility in representation of modules reduces shape management and lets one concentrate on other objectives (wirelength etc.). We propose a fast algorithm as a predictor based on modification of ex-isting top-down hierarchical partitioning approach to floorplanning. We also show that the predictor can be extended to act as constructor when modules have high shape flexibility. Having constructivepredictor as the constructor enables one to control and guide the solution quality more effectively, a privilege that statistical predictors lack.
The rest of the paper is organized as follows. We present several relevant definitions in Section 2. Section 3 presents several models of flexibility. Various heuristics associated with our approach have been discussed in Section 4. Section 5 gives an outline of our algorithm. In Section 6 we discuss the time complexity of the algorithm. Section 7 discusses our experimental setup and results for prediction and construction. We present concluding remarks and direction for future work in Section 8.
DEFINITIONS
In this section we present definitions related to our work.
Flexibility
Flexibility, f, as the term indicates is a measure of how flexible a module is, in terms of shape and number of representations. The more flexible a module better are the chances of obtaining a compact floorplan or in other words more saving in dead-space. Intuitively we can say that if we have modules of infinite flexibility then we can obtain floorplans with zero dead-space. The shape curve for a flexible module [3] [36] is shown in Figure 1 . We did some experiments with TimberWolf to analyze the shape curve of some modules. We had TimberWolf place a cluster of cells in rectangular areas with varying aspect ratios and observed the width/height relation of the final placement. From our experiments and some real data on flexible modules [20] , we observed that the shape curve is almost constant, i.e.,
, where 4 generally varies between 5 -10 %.
Slicing Tree
A slicing tree is the binary tree representation of a sliceable floorplan. The leaves of the tree correspond to modules and an internal node defines how its child floorplans are combined to form a partial floorplan. Based on the type of internal node we define three types of slicing trees.
Empty slicing tree
As the name indicates, an empty slicing tree has vacant internal nodes, i.e., there is no cut or orientation (relative position of the children) associated with the internal nodes.
The floorplan for an empty slicing tree has just the positional information of the modules. No decision is yet made on how these modules will combine (vertical/horizontal) to produce (sub)floorplan. An example is shown in Figure 2. 
Cut-based slicing tree
Each internal node of a cut-based slicing tree denotes the cut of the children nodes (vertical 8 or horizontal -). There is a complete freedom on the ordering of the child nodes.
The corresponding floorplan has only the cut information. The order in which modules will be placed (left/right for vertical sizing,
Corresponding floorplan A Cut-based Slicing Tree is not yet fixed. Figure 3 shows such a slicing tree.
Oriented slicing tree
The oriented slicing tree is an extension of cut-based slicing tree where in addition to the cut the relative position (left/right, top/bottom) of the children nodes for an internal node is also fixed. F  F I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P 
MODELING FLEXIBILITY
Enlisted below are some models to calculate flexibility of a module. As we shall see in the next section, an accurate assessment of flexibility of a module is important for our work.
Based on maximum dimensions of the module
We assume that we have an ordered listing of representations of modules, i.e.,
where N is the number of distinct representations of the module.
One can say that flexibility in width is
and flexibility in height is
And total flexibility is
There are several shortcomings of this model. One can easily come up with examples where a module with only few distinct representations is estimated as more flexible than one with more number of representations.
Based on incremental sum
This model calculates the ratio of width and height of successive representations and sums them all to obtain flexibility
Though this model fares better than the previous one in terms of giving a fairly accurate picture of flexibility but fails to take into account the number of distinct representations, N, of the modules.
Flexibility as a function of incremental sum and number of distinct implementations
What is needed is some kind of flexibility function F which depends on
We also need to define a transitive relation on this function to be able to say that one module is more flexible than the other, i.e.,
Also the property of sum '+' needs to be defined on this function to be able to sum the flexibilities of the modules, i.e.,
In general it is difficult to come up with a unique flexibility function which will satisfy the above two properties for all the combinations of modules and tell us with certainty about the flexibility of any module. The problem lies more with trying to define flexibility as a number than with the function itself. For our work we have chosen the flexibility function to be
This function can assess the flexibility of a module with fair amount of certainty. We will show the application of these models on three 2x6 area modules where module1 has a rigid (rotatable) 2x6, 6x2 representations; module2 has 2x6, 4x3, 6x2 representations and module3 has 2x6, 3x4, 4x3, 6x2 representations. does not truly reflect the flexibility of the module. A module with many representations is estimated to be having the same flexibility as the rigid one. .N, can tell with fair amount of certainty about the flexibility of various modules. The slight change in flexibility is reflected as a large change in the number corresponding to that. Addition of even one extra representation contributes large to the flexibility number. We have experimented with these three flexibility models and indeed the last one,
.N, produces the best results.
PREDICTION HEURISTICS
In the previous section we discussed how to obtain a good measure of flexibility of a module. In this section we discuss related heuristics which make prediction more effective. These heuristics are added to an existing top-down hierarchical partitioning approach, discussed in Section 5.
Balance with respect to area and flexibility
After obtaining a measure of the flexibility of a module it is imperative to balance the two partitions with respect to area and flexibility. Balance on area helps in saving dead-space and balance on flexibility helps shape-management by pairing up rigid modules with the flexible ones.
The observation is that combining a flexible module with a rigid one is better than combining a rigid module with another rigid module.
Figure 5. Partitioning based on area balancing
This argument is best explained by Figure 5 . We partition the four modules into two regions, one region has all the flexible modules and the other one has all the rigid modules. Though these two partitions are balanced on area, we will get a lot of dead-space even if we take a lot of care in balancing area on further partitions. As is evident from the figure, the right half will result in a lot of deadspace. Though the modules in the left half will combine to produce a compact partition, the overall effect will be a bad floorplan because of the right half. Now consider the case where the two regions were balanced on flexibility and area as shown in Figure 6 . Here half of the flexible modules are on one side and half on the other. As is evident from the figure, the rigidity of the rigid module is annulled (may not be fully) by the flexibility of the flexible module. The overall effect is a much better floorplan than if we had balanced with respect to area only.
Sizing Theorem and Wirelength
Although area management is important, we also need to consider wirelength, for wirelength closely models routability [30] . We first present our lemma on sizing which forms an extension of the famous sizing theorem [18] Hence we need to retain only non-redundant area implementations as given by the sizing theorem. Such a list of non-redundant implementations contains all the minimum wirelengths for the combination. The proof can easily be extended to Bounding Box based wire models.
Note that this proof is valid only when we are combining two modules. That is, this wirelength minimization scheme by retaining non-redundant implementations is at best a local one. For global wirelength minimization we may still need to retain all the 0 x implementations of the modules s and t . However, it is easy to see that such a choice will result in exponential growth in the number of implementations of the final floorplan. For our work we are retaining the non-redundant implementations at all levels of sizing. That is, even when combining two sub-floorplans (each of these may itself be a combination of several modules) we retain only the non-redundant implementations.
Determining the Cut-Orientation
The major bottle-neck of any hierarchical floorplanning approach is how to decide on the cut (horizontal or vertical). There are several possible ways of combining modules in the system and till now there is no universally agreed upon technique for that. A most common practice is to pre-decide on the cut-orientation. However, this technique overlooks the importance of exploring the various cut possibilities. The commonly used techniques "of retaining the cut which produces least area for the combination" can not be used with flexible modules. We propose following heuristic for deciding the cut-orientation of a combination of flexible modules.
Choose the cut-orientation which makes aspect-ratio of the combination closer to 1
The equivalent aspect-ratio of a flexible module is the product of the aspect-ratios of its individual implementations. It is not hard to see that the equivalent aspect-ratio of a highly flexible module will be almost 1.
The motivation for desiring the aspect ratio to be closer to 1 is to reduce the perimeter of the final floorplan. A square shaped module will obviously have lesser perimeter than a rectangular module of same area. We evaluate the equivalent aspect-ratio of the combined module for both horizontal and vertical cut and retain that cut which gives an aspect-ratio closer to 1.
Length Minimization by Swapping Sub-floorplans
Our model of length estimation is based on the center-to-center Euclidean distance between the modules. Same model is being used in the Simulated Annealing [Wong-Liu] [33] floorplanner that we use for comparing our results.
The bi-partitioning approach may itself at times aid in increasing the wirelength. This kind of situation arises because after initial partitioning the sub-partitions behave as independent partitions and have no constraint on where the connected modules in the two subpartitions be placed to minimize the wirelength. The situation is best depicted in Figure 8 where this independent partitioning has put connected modules in diagonally opposite blocks. One can loose highly on wirelength if this happens for several sub-partitions.
Figure 8. Bipartitioning may push some connected modules further apart
We propose following Partition-Swapping heuristic which can be utilized as a post-processing step to correct (may be partially) such undesirable situation.
Recursively swap two sub-floorplans (a top-down approach)
connected modules placed far apart swapping two sub-floorplans
Figure 9. Swapping two sub-floorplans
The procedure is shown in Figure 9 . The idea is to swap the sub-partitions (starting from top), retain the configuration that gives wirelength improvement and proceed to lower sub-floorplans. This may lead to substantial wirelength improvements (at times 30-40% improvement was observed). Our algorithm is based on hierarchically partitioning the circuit in a top-down fashion. In the process various sizing and wirelength improvement heuristics (discussed in previous section) are applied to obtain a reasonably accurate floorplan in terms of area and wirelength cost. The algorithm has three main phases. The three phases demonstrate the building up of the floorplan through slicing trees, from empty to cut-based to oriented.
A PREDICTION ALGORITHM

Hierarchical Partitioning Phase
The circuit is recursively bi-partitioned using a variation of Fiduccia-Mattheyses min-cut heuristic [7] called hMetis Hypergraph Partitioner [13] . The basic idea behind using min-cut based partitioning is to minimize the connectivity between the two partitions (a restriction imposed by the fact that wires take a substantial portion of the chip area). While partitioning we need to ensure that the two halves are balanced with respect to area and flexibility. In Section 4 we made several observations which justify the need for distributing flexible modules to ensure that rigid modules get paired with flexible ones.
Using hMetis as the partitioner
hMetis is a software package for partitioning large hypergraphs. The algorithms in hMetis (based on multilevel hypergraph partitioning) reduce the size of the graph (or the hypergraph) by collapsing vertices and edges, partition the smaller graph and then construct a partition for the original graph. These highly tuned algorithms allow hMetis to quickly produce high-quality partitions.
Since we are using the library functions provided by the hMetis package it is difficult to have both area and flexibility balance in the same hMetis run. Instead we run area-balanced hMetis several times and choose the most flexibility balanced run from these. In true sense the two phases, partitioning and cut-deciding, are coupled. We are building the empty slicing tree and the cut-based slicing tree at the same time. The top-down partitioning basically builds the empty slicing tree and the bottom-up sizing transforms it to a cut-based one. The cut-based slicing tree thus obtained has sizing information but the relative ordering of the modules/subfloorplans is still undecided.
Wirelength Improvement Phase
The cut-based slicing tree has a default orientation (arbitrary). The final floorplan may itself have multiple non-redundant implementations. For any such implementation the sizes of the subsequent sub-floorplans/modules are unique. This phase does the job of assigning orientation to the internal nodes of the slicing tree so that the wirelength of the system gets minimized (for a particular floorplan size).
The slicing tree obtained after the partitioning and sizing of the nodes is a cut-based one. Till this step we do not consider the orientation of the internal nodes. The length minimization heuristic is basically trying the various orientations of the internal nodes in a greedy fashion (we always retain the current best and move to subsequent sub-partitions) to improve on the length. A typical situation that may arise because of the independent partitioning is shown in Figure 11 . Here the two connected modules X and Y have been inadvertently put at a larger distance because of the independent partitioning discussed earlier. By swapping proper internal nodes we can get rid of such situations. Of course any such sequence is retained only if it improves the overall length. The resulting oriented slicing tree is shown in Figure 12 .
TIME COMPLEXITY OF THE ALGORITHM
The algorithm has following major steps,
Populating the data structures
With an efficient use of data structures the population time is guided by the number of (hyper)edges (hyperedge is defined as the edge with two or more modules connected to it) in the system ( v . Considering the fact that number of modules in each hyperedge is generally much smaller than the number of hyperedges itself, the time required for populating our graph data structure is ( v e where is a factor dependent on number of modules connected to each hyperedge ( is normally between and ¡ ).
Recursive bi-partitioning using hMetis and sizing
FM has been shown to be having a linear time complexity e [7] where is the number of terminals in the system. Since hMetis also employs FM as its partitioner, the analysis of time complexity remains the same (other parts of hMetis partitioner such as clustering have linear time complexity 
Length minimization
We traverse down the slicing tree and try the swapping at each internal node. After any such swap we need to fix up the shapes of the nodes affected by the swap. This involves sizing of the affected nodes too. The complexity of such an operation is
EXPERIMENTAL SETUP AND TEST CIRCUITS
For our work we have chosen nine circuits (ind1, ind2, hway2, fract, prim1, prim2, prim1 small, prim2 small1, prim2 small2 and prim2 small3). The details about the number of modules and edges in the circuits are given in Table 2 . The modules in our floorplan circuits are the cluster of cells from the circuits used in placement. We used TimberWolf1.4.0 to place these cluster of cells with varying aspect-ratios to generate shapes for the modules. The data for the shape and size of the flexible modules were generated based on the shape-curve given in Figure 1 . The flexible modules in our floorplan circuits have aspect-ratios ranging from
. Each floorplan circuit has four different percentages of rigid modules in the system (From
. ind1 (10) implies that circuit ind1 has 10% rigid modules in it). Simulated Annealing floorplanner [Wong-Liu] [33] has been used for comparing the quality of our algorithm. We also compare the quality of our Wong-Liu floorplanner with TimberWolf1.4.0 floorplanner. Since the wirelength models of the two are different, we compare area and run-times. It turns out that our Wong-Liu Simulated Annealing floorplanner is as fast as TimberWolf Simulated Annealing floorplanner and the area results (TimberWolf produces a larger area because it leaves some space between modules for routing channels or pad spacing) are comparable, as given in Table 3 . The main motivation for comparison with TimberWolf has been to make sure that our implementation of Wong-Liu floorplanner is fast and of good quality when compared to other Simulated Annealing implementations (we are reporting results for only two of the test circuits). All the experiments were carried out on Ultra Sparc 5 machine.
In general obtaining minimal values for all the objectives (area and wirelength etc.) of a multi-objective optimization problem is difficult. For a given floorplanning problem there may exist several optimal floorplans with different area and wirelength combinations. A commonly used metric for assessing floorplan quality is , and found out that our prediction was consistent with that of Wong-Liu floorplanner. We are reporting results with § having a constant value of n t ¨ so that we can compare the two approaches in a straight forward manner.
The results of Simulated Annealing [Wong-Liu] floorplanner, our predictor and constructor are given in Table 4 (we also present the results in form of bar-charts, Figure 13 ). The first column of Table 4 gives the name of the circuit and the number in the parentheses tells the percentage of rigid modules in that circuit. The floorplan metric (
) for the Wong-Liu floorplanner, predictor and constructor is given under the column "cost". "% diff" column gives the percentage by which our predictor and constructor differ from Wong-Liu cost. "Speedup" column gives the factor by which our predictor and constructor are faster than Wong-Liu floorplanner.
In the following subsections we discuss the results associated with our prediction and construction.
The Algorithmic Predictor
In the previous sections we have discussed our algorithm and heuristics to make Partitioning based Floorplanning more effective, the objective being area and wirelength minimization. Our claim is (and the results show) that Partitioning based Floorplanning is a very good and fast predictor of system metrics (area-wirelength) and gets better as the shape management gets easier by ª increased number of soft modules/IPs. ª number of modules increases. So that area management of one module becomes less relevant. The results are given in Table 4 . As is clear from the results the quality of our predictor gets better with an increase in the number and flexibility of modules. It can be observed, in general for both Wong-Liu and Predictor, that the floorplan metric starts worsening with increase in the percentage of rigid modules in the system. However, even with varying percentage of rigid modules in the system, the floorplan metric as predicted by our modified partitioning based approach is almost always within to that by Wong-Liu Simulated Annealing floorplanner. The real gain is evident when we compare the speed. Our predictor is on the average 1000 times faster than Wong-Liu floorplanner.
Extending our Predictor to Constructor
Till now we have concentrated primarily on the prediction of system metrics by fast floorplanning. What is important in floorplanning with eased shape management ? We need partitioning early on and then need to switch to length-area optimization at some point (use annealing for that). We modify our predictor slightly to obtain a floorplan constructor even better than Wong-Liu Simulated Annealing floorplanner. We do Low Temperature Simulated Annealing as the postprocessing on our predictor to obtain proper shape management. The partitioner has already done a good job of reducing wirelength. With large flexibility in the shapes of modules Low Temperature annealing in the end will further improve the floorplan by making it more compact. The results are given in Table 4 . In almost all the cases we achieve a floorplan quality as good as or 5 -10% better than the Wong-Liu floorplanner and at the same time achieving a speedup of 20 on the average.
CONCLUSION AND FUTURE WORK
As the design sizes grow by leaps and bounds, faster prediction of system metrics gains more and more importance. Floorplanning forms the early phase of Physical Design. System metrics at the floorplanning level are a good indicator of the quality of design. Unfortunately the best known floorplanner, Simulated Annealing, is extremely slow. In our work we have proposed an algorithmic predictor of floorplan metrics. Our work falls in the category of constructive prediction where we actually construct fast floorplans to estimate the quality of final floorplan. We have shown that the existing top-down hierarchical floorplanning approach can be modified suitably (through well defined heuristics) to obtain fast floorplans. Our floorplan predictor is almost thousand times faster than the traditional Simulated Annealing floorplanner and the prediction error is within 5 -10 %. The quality of prediction gets better with the increase in number of modules and the flexibility in the representation of shapes of modules.
In this work our primary focus has been the prediction of area/wirelength floorplan metric. Further work in this direction will be to integrate prediction of other metrics like congestion and timing. We also intend to handle prediction under constrained floorplanning when certain module/pin locations are fixed.
