




1.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2 6
2.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.2 LSI . . . . . . . . . . . . . . . . . 8
2.3 . . . . . . . . . . 12
2.3.1 . . . . . . . 12
2.3.2 FU FU . 16
2.4 . . . . . . . . . . . . . . . . . . . . . . . 17
2.4.1 DFG . . . . . . . . . 17
2.4.2 . . . . . 17
2.4.3
. . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
3 21
3.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
3.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
3.3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
3.4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
3.4.1 . . . . . . . . . . . . . . . . . . . 26
3.4.2 . . . . . . . . . . . . . . . . . . . . . . 26
3.4.3 /FU . . . . . . . . . . . . . . . . . 26
3.4.4 . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
3.5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
4 30
4.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
i
4.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
4.3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
4.4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38













1.1 15% [14] 90[nm]

















































RT [ 2.2] 2
C C
















   x = a + b;
   y = b – c;
   s = x + y + d;


























































4. u1 u2 CS(Control Step)
1 4 u1 u2 (yield-equivalent)
1 2 u1 u2





















































κ = 0.8 1 0.7 2 1.0
1 1.0 2 1















2. ∆area cost/∆performance improvement resource sharing








































































0 CLKlower clk (i)
CLKlower ≤ clk (i) ≤ CLKupper (2.7)
j FUj 0 DelayjMin
DelayjMax clk (i) FUj S (clk (i) , FUj)
S (clk (i) , FUj) = dDelayjMax ÷ clk (i)e × clk (i)−DelayjMax (2.8)
18
27 E (clk (i) , FUj)
E (clk (i) , FUj) = dDelayjMax ÷ clk (i)e − dDelayjMin ÷ clk (i)e+ 1
DFG j numj clk (i)
AV Eslack (i)
AV Eslack (i) =
∑
numj × S (clk (i) , FUj)∑
numj
AV Eevent (i)
AV Eevent (i) =
∑
numj × E (clk (i) , FUj)∑
numj
clk (i) clksuitability (clk (i))
clksuitability (clk (i)) =
1





















0 ≤ Dj(t) ≤ 1,∀t ≥ 0 (3.1)
∞∑
t=0
Dj(t) = 1 (3.2)
Dj(t) = 0,∀t ≤ 0 (3.3)














































/FU CVLS(Condition Vector List Schedul-
ing) [19, 20] FU























3x = a + b – c + d;
if(a != 0)
y = x + c;
else if(a + b < c)
Conditionals: a != 0 ―(2)
a + b < c ―(4)
y = c + d;



































(b) CV (b) (3) (4) (3) (5) (3) (6)
FUVf (k) k FUf
CV 3 CV (a)
(3) (5) ADD1 FUVADD1(3) = [1, 1, 0]















C++ CPU Intel Xeon
3.4GHz 4GB OS Debian/Sarge 16bit




0.04[ns] 0.850 0.900 0.950
0.50[ns]
• DCT






EWF3 EWF 3 102
• FIR
FIR X (n) Y (n)





































[2, 13, 9] 0.85 16.7%




















clk[ns] exe step clk CS
( ) [ns] [ns]
DCT(+3,*3) 0.50 +:3 *:5 0.61 38 23.18
[2] 0.50 +:1 *:2 1.74 16 27.84
[13] 0.50 +:1 *:2 1.74 16 27.84
[9] 0.50 +:1 *:2 1.74 16 27.84
EWF(+3,*2) 0.50 +:1 *:2 1.55 18 27.90
[2] 0.50 +:1 *:2 1.72 18 30.96
[13] 0.50 +:1 *:2 1.72 18 30.96
[9] 0.50 +:1 *:2 1.72 18 30.96
EWF3(+3,*2) 0.50 +:1 *:2 1.63 52 84.76
[2] 0.50 +:1 *:2 1.74 52 90.48
[13] 0.50 +:1 *:2 1.74 52 90.48
[9] 0.50 +:1 *:2 1.74 52 90.48
FIR(+4,*4) 0.50 +:2 *:3 1.00 45 45.00
[2] 0.50 +:1 *:2 1.69 31 50.70
[13] 0.50 +:1 *:2 1.69 31 50.70
[9] 0.50 +:1 *:2 1.69 31 50.70
4.2: ( 0.90)
clk[ns] exe step clk CS
( ) [ns] [ns]
DCT(+3,*3) 0.50 +:3 *:5 0.61 38 23.18
[2] 0.50 +:1 *:2 1.74 16 27.84
[13] 0.50 +:1 *:2 1.74 16 27.84
[9] 0.50 +:1 *:2 1.74 16 27.84
EWF(+3,*2) 0.50 +:1 *:2 1.56 18 28.08
[2] 0.50 +:1 *:2 1.72 18 30.96
[13] 0.50 +:1 *:2 1.72 18 30.96
[9] 0.50 +:1 *:2 1.72 18 30.96
EWF3(+3,*2) 0.50 +:1 *:2 1.64 52 85.28
[2] 0.50 +:1 *:2 1.74 52 90.48
[13] 0.50 +:1 *:2 1.74 52 90.48
[9] 0.50 +:1 *:2 1.74 52 90.48
FIR(+4,*4) 0.50 +:2 *:3 1.00 45 45.00
[2] 0.50 +:1 *:2 1.69 31 50.70
[13] 0.50 +:1 *:2 1.69 31 50.70
[9] 0.50 +:1 *:2 1.69 31 50.70
36
44.3: ( 0.95)
clk[ns] exe step clk CS
( ) [ns] [ns]
DCT(+3,*3) 0.50 +:3 *:5 0.62 38 23.56
[2] 0.50 +:1 *:2 1.74 16 27.84
[13] 0.50 +:1 *:2 1.74 16 27.84
[9] 0.50 +:1 *:2 1.74 16 27.84
EWF(+3,*2) 0.50 +:1 *:2 1.57 18 28.26
[2] 0.50 +:1 *:2 1.72 18 30.96
[13] 0.50 +:1 *:2 1.72 18 30.96
[9] 0.50 +:1 *:2 1.72 18 30.96
EWF3(+3,*2) 0.50 +:1 *:2 1.65 52 85.80
[2] 0.50 +:1 *:2 1.74 52 90.48
[13] 0.50 +:1 *:2 1.74 52 90.48
[9] 0.50 +:1 *:2 1.74 52 90.48
FIR(+4,*4) 0.50 +:2 *:3 1.01 45 45.45
[2] 0.50 +:1 *:2 1.69 31 50.70
[13] 0.50 +:1 *:2 1.69 31 50.70























0.85 16.7% 11.0% 0.90 16.7%





















[1] K. Bernstein, D. J. Frank, A. E. Gattiker, W. Haensch, B. L. Ji, S. R. Nassif, E.
J. Nowak, D. J. Pearson and N. J. Rohrer, “High performance CMOS variability in
the 65nm regime and beyond,” in Electron Devices Meeting, 2007. IEDM 2007. IEEE
International, pp. 569–571, 2007.
[2] E.-S. Chang and D. D. Gajski, “An optimal clock period selection method based on slack
minimization criteria,” in Proc ACM Transactions on Design Automation of Electronic
Systems (TODAES), pp. 352–370, 1996.
[3] D. Chen and J. Cong, “Register binding and port assignment for multiplexer optimiza-
tion,” in Proc. ASP-DAC 2004, pp. 68–73, 2004.
[4] J. Cong and J. Xu, “Simultaneous FU and register binding based on network flow
method,” in Proc. DATE 2008, March 2008.
[5] , , , , ,“
,” , VLD2007–119, pp. 7–12,
2008.
[6] , , , , ,“
,” , VLD2008–
84, pp. 145–150, 2008.
[7] D.D Gajski, N.D Dutt, A. C-H Wu and S. Y-L Lin, High-level synthesis: Introduction
to chip and System Design, Kluwer Akademic Publishers, 1992.
[8] S.-H. Huan, C.-H. Cheng, Y.-T. Nieh and W.-C. Yu, “Register binding for clock period
minimization,” in Proc. Design Automation Conference, 2006 43rd ACM/IEEE, pp.
439–444, 2006.
[9] W.-L. Hung, X. Wu and Y. Xie, “Guaranteeing performance yield in high-level syn-
thesis,” in Proc. Computer-Aided Design, 2006. ICCAD ’06. IEEE/ACM International
Conference on, pp. 303–309, 2006.
[10] J. Jung and T. Kim, “Timing variation-aware high-level synthesis,” in Proc. Inter-
national Conference on Computer Aided Design, Proceedings of the 2007 IEEE/ACM
international conference on Computer-aided design, pp. 424–428, 2007.
44
[11] H.-F. Jyu and S. Malik, “Statistical timing optimization of combinational logic circuits,”
in Proc. of Int’l Conf. on Computer Design, pp. 77–80, 1993.
[12] T. Kim and X. Liu, “Compatibility path based binding algorithm for interconnect re-
duction in high level synthesis,” in Proc. ICCAD 2007, pp. 435–441, 2007.
[13] J. Ramanujam, S. Deshpande, J. Hong, and M. Kandemir, “A heuristic for clock selec-
tion in high-level synthesis,” ASP-DAC/VLSI Design 2002, pp. 414–419, 2002.
[14] N. J. Rohrer, “Introduction to statistical variation and techniques for design optimiza-
tion,” in ISSCC Tutorial, 2002.
[15] S. Tongsima, C. Chantrapornchai, E. H.-M. Sha and N. S. Passos, “Probabilistic re-
timing: A circuit optimization technique,” in Proc. Technical Report 96–15, Dept. of
Computer Science & Engineering, Univ. of Notre Dame, 1996.
[16] S. Tosun, N. Mansouri, E. Arvas, M. Kandemir and Y. Xie, “Reliability-Centric High-
Level Synthesis,” in Proc. Design Automation and Test in Europe (DATE’2005) Volume
2, pp. 1258–1263, 2005.
[17] , , , “
,” . DSP, , pp. 101–108,
1998.
[18] , , “ ,”
. FTS, , pp. 109–116, 1997.
[19] K. Wakabayashi and T. Yoshimura, “A resource sharing and control synthesis method
for conditional branches,” in Proc. ICCAD 1989, pp. 62–65, 1989.
[20] K. Wakabayashi and H. Tanaka, “Global scheduling independent of control dependencies
based on condition vectors,” in Proc. 29th ACM/IEEE DAC 1992, pp. 112–115, 1992.
[21] F. Wang, Y. Xie and A. Takach, “Variation-aware resource sharing and binding in
behavioral synthesis,” in Proc. ASP-DAC 2009, pp. 79–84, 2009.
[22] K.-C. Wu, C.-T. Hsieh and S.-C. Chang, “Delay variation tolerance for domino circuits,”
in Proc. Design Automation 2006, pp. 24–27, 2006.
45
