Traditional placement problems are studied under a fully specified cell library and a complete netlist. However, in the first, e.g., 2 years of a 2-3 year microprocessor design cycle, the detailed netlist is unavailable. For area and performance estimation, layout must nevertheless be done with incomplete information. Another source of incompleteness comes from reuse of instances from earlier design generations; these instances and their parameters will change as the project evolves. The problem of placement with incomplete data (PID) can be abstracted as having to place a circuit when pc% of the cells and p,% of the nets are missing. The key challenge in PID is how to add missing cells and nets.
having to place a circuit when pc% of the cells and p,% of the nets are missing. The key challenge in PID is how to add missing cells and nets.
In this paper, two "patching-methods" for adding missing nets and cells are proposed. The methods are called abstraction and fusion.
Experimental results are very interesting and illurstrative. First, they show that PID is a difficult problem and an arbitrary (and perhaps intuitively sound) method may not produce high-quality results. Experiments verify that the abstraction method is a very good predictor and that fusion is not because circuits produced by abstraction attain much of the properties of the origianl circuits. Summary Table 3 in Section 4 shows that when a circuit has 10% incompleteness, abstraction can predict the final total wirelength with an error of 5.8%, while fusion has a 67.8% error in predicting the wirelength in the same circuit.
Introduction
The placement problem has been an interesting and challenging problem for many years. Recently, performancedriven, issues related to deep submicron, reliability and many more factors make the placement problem even more challenging.
Permission to make digital/hard copy of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage, the copyright notice, the title of the publication and its date appear, and notice is given that copying is by permission of ACM, Inc. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. A constructive placement method that employs resistive networks as a working domain was proposed in 141. In [9] , there is a comparison between linear and quadratic cost functions. The linear cost function used in the GordianL placement tool achieves results with up to 20% less area than the quadratic cost function of the original Gordian procedure. Therefore, in this paper, we will use the halfperimeter cost function for area minimization.
Here we study the placement problem with incomplete data (PID). This problem is of fundamental importance since VLSI design is getting more and more complicated. A typical VLSI design will take 2 to 3 years to complete. In the first 2 years, a detailed netlist is not availahlp. However, for area estimation a,nd performance estimation, several placement runs are do:oe with such incomplete information. Traditional placement problems are based on a fully specified cell library and a complete netlist. Obviously, this cannot handle this problem. Another source of incompleteness will come from reuse of instances from earlier generations. The hardware instances are modified to some extent and more information may be added as the project moves forward. So even if we do have a complete cell library arid netlist, this cell library and netlist will not be the same library and netlist as in the final version of the design. The designer will surely want to know how much he or she can trust the placement result. PID is the problem of trying to place the final design while only a part of the netlist and a fraction of the cell library is known.
In our experiments, we u:ie MCNC benchmarks. We randomly delete p,% of the nets and p,% of the cells. We then try to re-construct the netlist using different "patching-methods'' . We compare the wire-length of the original circuit against the wire-length of the "patched" circuit as a basis to judge the quality of the proposed patching-methods. Experiments on different values of pc and pn and different benchmarks have been performed. This paper is organized as follows: In Section 2 we formally described the placement with incompleteness problem followed by a detailed summary of approaches and implementations in Section 3. In Section 4, experimental results will be shown and discussed: followed hv conchisions in Section 5.
Problem Formulation
The best way is to study PID problem is starting from a complete circuit. This circuit, including the complete netlist and the cell library, will be no doubt the only fair metric to measure how good any kind of PID algorithm is. A good algorithm will give a good prediction on area or performance using only part of the original circuit information.
Starting from a complete circuit C, we can build an incomplete circuit C, simply by deleting or modifying cells and nets from the original circuit C. There are two numbers to quantitively measure the incompleteness, p , and p,. We denote by pc the percentage of cells we deleted or modified from C to get C,. Similarly, pn is the percentage of nets we deleted or modified from C to get C,. Now we are ready to describe the placement wzth zncompleteness problem formally:
is formed by deleting a number of cells and nets from C ( M , N ) . Two real numbers pc and pn satisfy the following conditions:
We are going to make predictions on issues like final chip area, power consumption, etc., of the circuit
C ( M , N ) based only on the circuit C,(M,,N,) and p* rameters p c , pn. These predictions will be compared to the actual value of the circuit C ( M , N ) .
This above description concludes the formulation of the placement problem with incomplete data. There will be numerous ways to obtain such predictions for circuit C. In this paper, we are actually trying to "reconstruct" the circuit C from the incomplete circuit C,. Specifically, we are going to find a set of modules M a and a se6 of nets Na. We use Ma and Nal together with Ma and N,, Figure 2 shows the result of C, . In this case, pc = 25% and pn = 20% since only 2 nets are gone. We can add two new dummy cells, N 1 and N 2 and some dummy nets to get circuit C'. Figure 3 shows a possible way to get C'. Of course, most probably, C' will not be exactly the same as the original circuit C. Our hope is that by using a "c1ever"way to get C', the placement result of C' will be very close to the placement result of C.
MS M7
~ Figure 1 : Original circuit C.
In the next section, we will show our approaches to find Ma and Na.
Note that in this paper, we only try to make a prediction on the total net lengths, which is consistant with the final chip area. "Half perimeter bounding box" is what we will use to measure the length of each net. 
Our Approach and Implementation
Following the discussion in the previous section, sets M a and Na should be "similx" to sets M , and N, in circuit C, . The question is, what does "similar" mean here.
Our interpretation is that they have similar statistics. For example, two module sets should have similar size distribution curves (to be discussed later).
There are a lot of possible statistics which we can use. Among these, we chose three of them to be the statistics for the module set. They are:
1. The size distribution of cells. There is no question that the size of cells is the most important information in area estimation. Besides this, the number of terminals on each cell is also important since it determines the connectivity of each cell. The larger this number is, the larger the total final area will be. An appropriate 1/0 type distribution of tciiniiials will assure a netlist similar to the original circuit. Without this 1/0 type distribution, any pair of terminals can be hooked up, leaving too much freedom t o deal with The above is a brief description of these statistics and reasons why they are important to us. Based on these three statistics, we can write the psudo-code for finding missing cells, set Ma as following:
1. AddCell(circuit Ci, p,) {
2.
4.
5.
6.
7.
8. Determine each terminal's type.
In
Step 6, 7 and 8, we make use of those three statistics created in step 2.
After this procedure is done. we have added a Set of cells M a to the circuit C'. These newly added cells are isolated since no nets are connected to them. The next step is to add some nets to the incomplete circuit C,. We will refer to all the existing nets already in Ci as "old nets" and all the nets we will add later as "new nets". Similarly, all the cells previously existing in C, are called "old cells" and all the cells added by Addcell() are called "new cells".
Get these three statistics discussed above.
for i = 1 to n,
3.

TZC = p c * lMil
Create a cell data structure. Determine the size of this cell. Determine number of terminals on this cell
Things are not so easy when finding the set of new nets, Na. The reason for this is because the set of nets is more like a graph, Similarity for graphs is very hard to define.
In the actual implementation, we use only one statistic for the set of nets, the number of terminals in each net. It is not clear what other statistics are also good, so we stick to this one for now. Even after deciding to use only one statistic, it is still difficult to get the whole missing netlist, the set Na.
The first thing we need to know is the number of nets to be added. This can be found by using the input parameter pn.
After that, a natural scheme will be similar to what we did on adding new cells: Look at all the cells in the circuit, including those "old" cells in Ci and those "new" cells added by Addcell(), and create a list C of all the terminals unattached to any nets. Initially, this list C will include all terminals from all new cells and those unattached terminals in the old cells.
As the example shown in Figure 3 , this list C will contain two pins in N1, two pins in N2 and the input pin in M8.
After the list C is created. for each t o he added new net, first determine the number of terminals in it using the distribution function. Then assign terminals from the list C to this new net and delete those assigned terminals from the list C. Repeat this procedure on the next to be added new net.
One of these following three cases will happen during this procedure.
1. List C is empty after all nets have been added.
Then we have finished our reconstruction procedure on the circuit C'.
2.
There are no terminals left in list C when adding a new net. There is nothing we can do in this situation unless we add some new cells. In this case, we will just stop adding new nets. This fact suggests that some terminals in newly added cells should be assigned to an old net as well. That is why the number of terminals in list C is almost always larger than the total number of terminals in all new nets.
Based on this analysis, if case 3 happens, for each terminal left in the list C, we will randomly pick either an old net or a new net and assign this terminal to it. We will refer to this patch method as fusion. However, fusion is not the only way to accomplish such a task. We have an alternative patching method, abstraction, described as the following: if case 3 happens, instead of assigning the remaining terminals to an existing net as in fusion, we will keep adding new nets and assign these terminals to these new nets until the list C becomes empty.
Since most terminals initially in list C are just terminals in new cells, in abstractron, all new cells are interconnected and rarely is there a connection between new cells and old cells.
We briefly summarize the difference between fusion and abstraction as in the following: In fusion, new cells are attached to both new nets and old nets, so they are connected to both old cells and new cells; while in abstraction, new cells are only attached to new nets, new cells form kind of a cluster since most of their nets are internal nets.
Another difference is that the circuit C' created by abstraction will have more nets than its counterpart created by fusion.
Intuitively, fusion makes much more sense than abstraction, since it appears that fusion will generate a circuit more like the original one, circuit C, while abstraction will almost never look like C.
Contrary to people's intuition, experimental results show that abstraction offers imuch more accurate prediction than fusion does. In the next section, we will present this interesting experimental result, followed by an explanation.
Experimental Results and Explanation
Fusion and abstraction were implemented using C++ All experiments were run on s Sun Sparc-20 workstation. Three MCNC benchmark standard cell circuits were used for experiments. Table 1 shciws some basic statistics for these three benchmark circuits. We delete all the 1/0 pads from all three of these circuits, allowing us to concentrate on the effects of incomplete data on core cells. As shown in Table 1 For each benchmark circuit C(M,12/), for a specified degree of incompleteness p , We first delete p , !.Ut cells and the nets attached to them to form C, . Then we run our two reconstruction procedures, fusion and abstraction, to get a "reconstructed" circuit C' (fusion and abstraction were discussed in detail in Section 3). After that, we use the NRG placement tool to get the total wirelength of C' and compare this value with the value of the original circuit C. Table 2 shows the total wirelength results achieved from the reconstructed circuit Nprimaryl'. For each degree of incompleteness, column "WL" shows the average total wirelength over all trials; column "dev." shows the standard deviation of all trials, and "%err" shows the percent error compare to the real value from the original complete circuit Nprimaryl.
It's clear that fusion has a much bigger error than abstraction for any percentage of incompleteness.
Since this is a non-deterministic procedure, standard deviation is another important factor to look at. However, there is not a clear pattern in this figure. All the data points are spread in the range between 0.7% and 1.7%. Standard deviation does not increase as the percentage of incompleteness increases. This is determined by the nature of fusion and abstraction. Since fusion and abstraction rely heavily on the statistics from the incomplete circuit Ci, the increase of incompleteness really does not affect the degree of randomness in reconstructing circuit C'. That is why the standard deviation does not increase along with the percentage of incompleteness.
Similar results for Nprimary2 and Nstruct were obtained from the experiments. Table 3 is a summary table of these three circuits. All the percent errors in Table 3 are the average percent errors among these three circuits.
From experimental results shown above, we can conclude that abstraction's prediction is much more accurate than fuszon's. That is very surprising and counterintuitive.
The reason why abstraction is better than fusion is because fusion mantains the original properties of the circuit.
A big conclusion from this is that the netlist topology is very important. It will have a huge effect on the total wirelength. In this particular placement with incomplete data problem, we should not try to change the circuit topology of existing netlist. We formally proposed a new problem in placement area, placement with incomplete data. This problem arises along with the increasing complexity of current VLSI designs. It will become more and more important in the future. We tried two approaches: fuszon and abstractaon as described in detail in Section 2 . \iVe wdnt to see how good they can predict the final wirelength/ chip area using only an incomplete netlist and library.
Experimental results in Section 4 show that abstraction is a much better patching method than fusion. With 10% incompleteness in the circuit, abstraction can still predict the wirelength with an error of 5.8%; while fusion can have an error more than 50% even with a 5% incompleteness in the circuit. the netlist topology is essential to the final wirelength. In order to get a good prediction, we should try to preserve the original topology as much as possible.
Comparisons between fusion and abstraction shows that
