ENISLE: an intuitive heuristic nearly optimal solution for mincut and ratio mincut partitioning by Cheng, Shun-wen & [[alternative]]鄭舜文
ENISLE: An Intuitive Heuristic Nearly Optimal Solution for 
Mincut and Ratio Mincut Partitioning 
Shun- Wen Cheng and Kuo-Hsing Cherzg 
Department of Electrical Engineering, Tamkang University 
Tamsui, Taipei Hsien 25 137, TAIWAN 
Abstract Mincut partitioning is to minimize the total cuts of the 
edges by the partitioning of nodes into two sets. The proposed 
method, Edge-Node Interleaved Sort for Leaching and Envelop 
(ENISLE) algorithm, is not only using node information but also 
using edge information. It is simple, but works effectively, and 
never appears in any earlier literature. It can soon get an intuitive 
heuristic nearly optimal solution for mincut and ratio mincut 
partitioning at the same time, is very suitable for EDA usage. 
Index Term - min-cut partitioning. ratio min-cut partitioning. 
clustering effect, iterative-improvement, radix sort. IC CAD. 
EDA, System-on-a-Chip (SoC), VLSI circuit. 
I. INTRODUCTION 
Circuit partitioning plays a key role in the design auto- 
mation of VLSI chips, multichip systems, and system-on-a-chip 
(SoC). It is used to reduce VLSI chip area, reduce the component 
count and the number of interconnects in multiple FPGA 
implementations of large circuits or systems. Circuit partitioning 
facilitates efficient parallel simulation of circuits, facilitates 
design of tests for digital circuits and reduces timing delays, and 
facilitates the various combinations of sub-system layouts [ 181. 
Iterative-improvement two-way min-cut partitioning is an 
important stage in most circuit placement tools. Recursive two- 
way partitioning is an efficient and popular approach to obtaining 
k-way partitions for k > 2 [5]. [ I  I], [20]. So many papers are 
concern here with the two-way mincut partitioning problem. 
Since two-way mincut partitioning is NP-complete problem [IO], 
a number of approximate schemes have been proposed. These 
include iterative improvement methods [7]. [SI, 1131. [151. [161. 
simulated annealing [ 171 and clustering-based techniques [ 3 ] ,  
[ 1 I]. [20]. Iterative improvement methods are sometimes also 
used as a preprocessing phase for clustering as in [ 191. [2O]. 
Most partitioning techniques for circuit netlists like the 
Fiduccia-Mattheyses (FM) method, computes the gains of nodes 
using local netlist information that is only concerned with the 
immediate improvement in the cutset. This can lead to 
misleading gain information. This matter let us turn back the 
origin of the mincut partitioning problems and solving them by 
global viewpoints. The proposed new method in the paper is a 
totally different method. 
Let a circuit be represented by a hyper-graph or netlist C = 
(V, E)  where V is the set of nodes that represent components of 
the circuit and E the set of hyperedges that represent the nets of 
the circuit. Each hyperedge or net connects two or more nodes 
together; generally the output of a node is connected to the inputs 
of several other nodes by a net. 
Figure 1. The min-cut partitioning. 
Figure 2. The ratio min-cut partitioning. 
The goals of min-cut and ratio min-cut partitioning display 
on the V-E plain are shown in Fig. 1 and Fig. 2. Our goal is 
partitioning, not only clustering [I], [9]. Notice when (V, E)  
pairs approach to the uniformly distribution on the V-E plain. 
and if we can find the min-cut. the “outline areas” of the (V. E)  
pairs will from the product VE approach to VE /2, like vapor 
compression behavior. 
Figure 3. Some special (V. E) distribution cases 
Fig. 3 shows some special (V, E) distribution cases. It may 
often be seen when describe the row-based placements in 
sequence. The “outline areas” of (V, E )  pairs occupied on the 
V-E plain are far less than the VE product. Under this condition. 
the probability that we can get the min-cut solution is relatively 
small. This is unlike vapor compression behavior, is like 
“melting” the material. So first we need “heating” it, adding 
“entropy” to it. And notice these cases ease to be randomized. If 
it exists local cluster blocks, we only need to resolve the blocks. 
And some small blocks will be automatically resolved in the 
proposed method. 
This paper is organized as follows. In Section 2. describe 
the proposed method and demonstrate by examples. In Section 3 
we then reveal the importance of the distributed conditions. 
Finally conclude the major findings and outline the future work. 
V- 167 
0-7803-6685-9/01/$10.0O02001 IEEE 
Authorized licensed use limited to: Tamkang University. Downloaded on March 23,2010 at 23:14:31 EDT from IEEE Xplore.  Restrictions apply. 
0 " 8  
bit32 :l; 
bit31 :I; 
bit30 :I; 
bit2 : I ;  
bit1 :l; 
. . . . . . . 
) radix-sortunit; 
\ x x  
\ x x  
\ Y  x 
\ \  x x 
\ \  -, \ 
Figure 4. The first four steps (phase one) of the ENISLE 
algorithm are shown by this example in detail. 
ail32 
.- 
' C  . 
.- 
. 
Figure 5. Using bit field structure to reduce the memory 
requirement, and accelerate the sort process. 
aph; 
232.311.213.231. I l l ,  LIZ. 131.123,221 
IS 321.231, 111.221 
2s ,231.112.132 
3s  -- 213,123 
311.231.111.221.232. 111. 131.213.123 
IOS -* 111,112. 213 
20s -+ 321.221. 123 
MS , 231. 232. 132 
111.112.213.321,221.123.231.23?. 132 
LOOS 111. 112.123. 132 
200s -I 113.221.231.231 
3OOS 321 
o/plll.  111. 123. 131.113.221.231, '1' ?'I 
Figure 6. Using radi 
handle the sorting job 
ix-sorting technique can effectively 
' of mass numbers. 
11. THE EDGE-NODE INTERLEAVED SORT 
FOR LEACHING AND ENVELOP (ENISLE) 
ALGORITHM 
The proposed method in this work is called Edge-Node 
Interleaved Sort for Leaching and Envelop (ENISLE) algorithm. 
It is briefly shown as the following: 
Algorithm ENISLE 
(* Initialize the V-E plain & randomize W, E) pairs *) 
if ( CLUSTERED-DISTRIBUTION( ) ) then 
endif 
SORT-EDGE( ); (* From Bottom-side *); 
~ SORT-NODE(); (* From Right-side *); 
SORT-EDGE( ); (* From Top-side *); 
SORT-NODEO; (" From Left-side *); 
~ INIT-RECORD-NODE-SETO; 
repeat 
1 
RANDOMIZE(); 
(* Phase One: Basic four steps, Edge interleave I *I 
(a; Phase Two Begins: Additional steps. Edge interleave 2 * 
SORT-NODE(); (* From Right-side *); 
SORT-EDGE( ); (* From Top / Bottom side *); 
break; 
endif 
SORT-NODE( ); (:I. from Left-side *); 
until ( NODE-SET-CHANGEDO ); 
end. 
Notice it can intuitively determine distributed uniformly or 
not by the diagram clearly, not need additional computing aboul 
correlation coefficients or co-variances. If (V, E) pairs are no1 
uniformly distributed on V-E plain, and if we do not randomize il, 
then directly issue the converge procedures, may get a worse cu1 
solution and leave the loop. No non-determined / infinite loops 
occur. In Fig. 4, first four steps of the ENISLE algorithm are 
demonstrated by an example in detail. 
V- 168 
Authorized licensed use limited to: Tamkang University. Downloaded on March 23,2010 at 23:14:31 EDT from IEEE Xplore.  Restrictions apply. 
s x  ;I x x x  
t 
E 
F 
G 
H 
5K 
;+ x x  x x  x x x  
X x x  
x s x  
x x x  x ................................................................................ 
x x x x  s 
x Y 
x x x x  
x x x  
L x  s s  
u x x  x s  
N X  X X  
x x  x x  
s x x  
Initialize the V-E Plain. 
x x  
x x  x x  ................................................................. :I x x  x s x  
H x  Y 
Step 3.4, cut numbers: 8. 
x x x  
x x  x 
x x x  
x x x  x 
x x  
\ \  
\ x  
Step 1, cut numbers: 14. 
v v 
Step 5 ,  cut numbers: 2. 
v 
................................ 
N I \  \ \ 
Step 6, cut numbers: 2. 
Figure 7. It shows the proposed ENISLE algorithm effectively solves the min-cut and the ratio min-cut partitioning at the same time. 
In the ENISLE algorithm, carefully arrange the memory 
requirement is necessary. As shown in Fig. 5, using bit field 
structures reduce to one-eighth-memory space. If it has IOOK 
nodes and 500K edges, the program will need about 6.4GB 
virtual memory space. A powerful sorting engine decides the 
performance of this method -- we need sort very mass numbers! 
Using radix sort [I41 can handle this problem effectively, as 
shown in Fig. 6. If the circuits are more enormous. multilevel 
methods [2] can be considered. 
On a 1280 x 1024 pixels x 24 bits true color display monitor, 
assume 1280 x 16 bits edges / 1024 x8 bits nodes = 20480 edges 
I 8192 nodes per screen, or 1024x24 bits edges I 1280 bits nodes 
= 24576 edges I 1280 nodes per screen. We can scroll the screen, 
as scroll the spreadsheets. It can directly observe every iterative 
improvement, get useful information, or decide to manually halt 
the procedures or not, if necessary. This is very suitable for IC 
EDA industrial certain cut constraints under non-uniformly 
distributed cases. 
In Fig. 7, this example demonstrates the cut interchange 
conditions in every step. The ENISLE algorithm effectively 
solves the min-cut partitioning and the ratio min-cut partitioning 
at the same time. Fig. 8. shows another successful example. 
V- 169 
Authorized licensed use limited to: Tamkang University. Downloaded on March 23,2010 at 23:14:31 EDT from IEEE Xplore.  Restrictions apply. 
Figure 8. Another successful example. 
Figure 9. Non-uniformly distributed condition. 
HI. THE QUALITATIVE ANALYSIS OF (V, E) PAIRS 
DISTFUBUTED CONDITIONS 
In Fig. 9, (V, E) pairs are not uniformly distributed on V-E 
plain. And if we do not randomize it, then directly issue the 
convergent procedure. It shows that we cannot get the optimal 
solution (two cuts), but get a nearly optimal solution (three cuts). 
It can intuitively determine distributed uniformly or not by this 
diagram clearly. 
As mentioned in Fig. 3, it shows (V, E) pairs non-uniformly 
distribution lead the probability that can get the min-cut solution 
is relatively small. The work finds the relationship between cut 
numbers and initial (V, E) pairs distributed conditiodentropy. 
The relationship is a very important issue. It is shown as Fig. 10. 
Distribution 
j - c u t  b -cu t  M i n - c u t  
Figure 10. The relationship between cut numbers and 
initial (V, E) pairs distributed condition/ entropy. Notice 
the cut number j > k > min-cut, k is second optimal cut, 
and j is third optimal cut. The higher initial potential, the 
more probability aims the min-cut. 
Iv.  CONCLUSION AND FURTHER ESEARCH 
Due to the proposed new method ENISLE is different with 
any other min-cut partitioning methods, not improves or modifies 
other min-cut partitioning methods. So this paper does not 
concentrate on the comparisons with them. mainly focus on the 
demonstration of the proposed new method. We will completely 
finish further into the work. 
The paper indicates that the proposed ENISLE method can 
effectively solve the mincut partitioning and the ratio mincut 
partitioning at the same time by global viewpoints. It is not only 
using node information but also edge information. Hundreds of 
netlists experiments have ever been processed and found if we 
can let (V, E) pairs approach to uniformly distribution on the V-E 
plain, it can soon get the optimal solution, no more NPC problem. 
If we can't, or just require certain cut constraint, not min-cut, the 
proposed method can provide an intuitive heuristic nearly 
optimal solution. It is very suitable for EDA industrial usage. 
V. REFERENCES 
S. B. Akers, "Clustering Techniques for VLSI." in Proc. LE€€ Irit. 
Syrnp. oil Circuits arid Svsrcrirs. 1982. pp. 472476. 
C. J. Alpert, J.-H. Huang. and A. B.  Kahng. "Multilevel circuit 
partitioning." in Proc. Desigrr Arrtorrirrtiort Cor$. 1997. pp. S30- 
533. 
C. J. Alpert and S.-Z. Yao. "Spectral partitioning: The niore eigen- 
vectors the better." in Proc. IEEE/ACM Dcsigrr Airtormtiori Corlf:. 
J. Cong et al.. "Large scale circuit partitioning with loose/stable net 
removal and signal flow based clustering." in Pine. IEEE/ACM I r i r .  
Cant Coinputer-Aided Desigrr. Nov. 1997. pp. 441-4466. 
J. Cong and S. K. Lini. "Multiway Partitioning with Painvise 
Movement." in Proc. IEEE/ACM Irrr. Cor$ Corrrprrrci.-Aiiic~rl 
Dcsigri. 1998. pp. 512-516. 
S. Dutt and W. Deng. "A probability-based approach to VLSI 
circuit partitioning." in Pruc. I€€€/ACM Drsigii Airtorrrtrfiori Cor$. 
June 1996. Best-Paper Award. pp. 100-10.5. 
S. Dutt. "New faster Kemighan-Lin-type graph-partitioning algo- 
rithms." in Prm.  I€EE/ACM Irit. Cor!$ Curii~~rtrcr-Aitlc~rcl Desigri. 
Nov. 1993. 
C. M. Fiduccia and R. M. Mattheyses. "A linear-time heuristic for 
improving network partitions." in Proc. IEEE/ACkf l%/i Dcsigrf 
Airroirrotiorr Cor$. 1981. pp. 17S-ISl. 
J. Garbers. H. J. Proniel. and A. Steger. "Finding clusters in VLSI 
circuits." in P roc. IE€E/ACM Iii t .  CurIf Coriiplrtcr.-Airl~.II Dc~sigrr. 
1990. pp. 5 2 0 - 5 3 .  
199.5, pp. I9S-?OO. 
.. 
[ I O ]  M. R. Garey and D. S .  Johnson. C ~ l l l / J l l t C ~ . S  m i l  Iii tr~icml~ili t~.  San
Francisco. CA: W. H. Freeman. pp. 209-210. 
[ I  I ]  L. Hagen and A. Kahng. "Fast spectral methods for ratio cut 
partitioning and clustering." in P r w .  I€EE/ACM brr. CorIf 
C~~r i i~~ ic t i~ r~ -~ i t l r~c /  D L ' s ~ I I .  1991. pp. 10-13. 
[ I ? ]  M. A. B. Jackson. A. Srinivasan. and E. S. Kuh. "A fast algorithni 
for perfomiance driven placenient." in Pine. IEE€/ACM Irir. Coiif: 
Corri~~irtc~r~-Aitk~cl Dcsigrr. 1990. pp. 328-33 1 .  
[I11 B.  W. Kernighan and S. Lin. "An efficient heuristic procedure for 
partitioning graphs." Bell Swori i  Tcclr. Jorri~iiiil, vol. 49. pp. 29 1- 
307. Feb. 1970. 
[ 141 D. E. Knuth. Sorriirs trrrtl Si2trrc/riri~. Addison-Wesley. 1973. 
[ 151 B. Krishnamurthy. "An iniproved nin-cut algorithni for partitioning 
VLSI networks." I€€€ Tmris. Cor,iprtc~r.s. vol. C-33. pp. 438446. 
May 1984, 
[ 161 Y. G. Saab. "A fast and robust network bisection algorithni." IEEE 
Tr.rrri.s. Corirp~r/cr.c.. pp. 903-913. 1995. 
[ 171 C. Sechen. VLSI Ploceriierir trritl Globrrl Rorrririg Usiii<q S i i i i i i / f i r d  
Ariirc(i/iiig. B. V. Deventer. Ed. kiisterdani. Netherlands: Kluwer. 
[IS] N. A. Shenvani. Algorirhrrrsfor VLSI Physictrl Dcsigrr Aiiroii i( iriori .  
3rd €d. Boston. MA: Kluwer. 1999. 
[I91 Y. C. Wei and C. K. Cheng. "An iniproved two-way partitioning 
algorithni with stable perfomiance." I€€€ Trcrris. Corripiter-Aided 
Dcsipr. pp. 1502-1S1 I .  1990. 
[20] Y. C. Wei and C. K. Cheng. 'Toward efficient hierarchical designs 
by ratio cut partitioning.'. in Pruc. Iri t .  Cor$ Cori i~~r i f~~r-Ai i l i~~l  
Drsigri. 1989. pp. 298-301. 
V- 170 
Authorized licensed use limited to: Tamkang University. Downloaded on March 23,2010 at 23:14:31 EDT from IEEE Xplore.  Restrictions apply. 
