Studies on register allocation for instruction-level parallel processors by 片岡 正樹





2.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3 13
3.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.1.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.1.2 . . . . . . . . . . . . . . . . . . . . . . . . 14
3.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
3.2.1 SSA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
3.2.2 GPDG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
3.2.3 GPDG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
3.2.4 . . . . . . . . . . . . . . . . . . . . . . . 19
3.2.5 . . . . . . . . . . . . . . . . . . . . 19
4 23
4.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
4.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
4.2.1 . . . . . . . . . . . . . . . . . . . . . . . . . . 25
4.2.2 . . . . . . . . . . . . . . . 27
4.3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
4.3.1 GPDG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
4.3.2 . . . . . . . . . . . . . . . . . . . . . . . 41
4.3.3 . . . . . . . . . . . . . . . . . . . . . . . . . . 44
i
4.3.4 . . . . . . . . . . . . . . . . . . . . . . . . . 50
4.4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
4.5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
4.5.1 . . . . . . . . . . . . . . 59
4.5.2 Integrated Code-Scheduling . . . . . . . . . . . . . . . . . . . . . . 64
4.5.3 . . . . . . . . . . . . . . . . . . . 65
4.6 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
5 67
5.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
5.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
5.2.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
5.2.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
5.2.3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
5.3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
5.3.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
5.3.2 . . . . . . . . . . . . . . . . . . . 77
5.3.3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
5.3.4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
5.4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
5.4.1 . . . . . . . . . . . . . . . . . . . . . . . . . 92
5.4.2 . . . . . . . . . . . . . . . . . . . . . . . . . 94
5.5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
5.5.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
5.5.2 Optimistic Coalescing . . . . . . . . . . . . . . . . . . . . . . . . . 97
5.5.3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
5.5.4 . . . . . . . . . . . . . . . . . . . . . . . . . . 99





2.1 5 . . . . . . . . . . . . . . . . . . . . . . . 9
2.2 . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.1 SSA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
3.2 GPDG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
3.3 . . . . . . . . . . . . . . . . . . . . . . . . 20
4.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
4.2 . . . . . . . . . . . . . 28
4.3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
4.4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
4.5 GPDG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
4.6 . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
4.7 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
4.8 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
4.9 . . . . . . . . . . . . . . 49
4.10 . . . . . . . . . . . . . 52
4.11 GPDG . . . . . . . . . . . . . . 53
4.12 8 . . . . . . . . . . . . . . . . . . 56
4.13 32 . . . . . . . . . . . . . . . . . 57
4.14 32 . . . . . . . . . . . . . . . . . 58
4.15 . . . . . . . . . 61
4.16 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
4.17 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
4.18 Integrated Code-Scheduling . . . . . . . . . . . . . . . . . . . . . . 64
5.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
iii
5.2 . . . . . . . . . . . . . . . . . . . . . . . . 74
5.3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
5.4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
5.5 . . . . . . . . . . . . . . . . . . . . . . 79
5.6 . . . . . . . . . . . . . . . . . . . . . . . . 80
5.7 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
5.8 . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
5.9 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
5.10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
5.11 . . . . . . . . . . . . . . . . . . . . . . 93
5.12 1000 . . . . . . . . . . . . . . . . . . . . . . 95
5.13 . . . . . . . . . . . . . . . . . . . . . . 100
iv
4.1 8 . . . . . . . . . . . . . . . . . . . . . 56
4.2 32 . . . . . . . . . . . . . . . . . . . . 57
4.3 32 . . . . . . . . . . . . . . . . . . . . 58
5.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
5.2 ( ) . . . . . . . . . . . . . . . . . . . 92






























• (true dependency) a X b X
b a 2.1 S2
load A[i] S1 store A[i]
• (anti-dependency) a X b X
b a
2.1 S3 store A[i] S2 load A[i]
• (output dependency) a X b
X b a
2.1 S3 store A[i] S1 store A[i]
8 2
• (input dependency) a X b X
b a 2.1
S5 load B[i] S1 load B[i]



































3. VLIW (Very Long Instruction Word)
SISD (b) PC 1
1 1
1
4. SIMD (Single Instruction / Multiple Data)
2.2. 11
1



































データフロー MIMD スーパースカラ/ VLIW SISD
： 命令






































1. 1 ( a)








while (i < N) {
b = B[i];
tmp1 = y+2;











while (i1 =φ(i0, i2), i1 < N) {
b0 = B[i1];
tmp1 = y+2;












if-else b b1, b2
(b1, b2)














(cc1)i2 = i1+1(cc1)tmp1 = y+2(cc1)b0 = B[i1]
(cc1)cc2 = b0<tmp1
(*cc1) i1 =φ(i0, i2)


































































































































2 a, b 2 c = a+b, d =





c = a+ b d = a− b a, b c








































dominate) A 2 B
C C B B




































X Y dominance frontier
CFG succ[n] n idom[n] n
child[n] idom[k] = n k
x dominance frontier (DF [x])
DF [x] = {y ∈ succ[x] | idom[y] 6= x} ∪ ⋃
z∈child[x]
{y ∈ DF [z] | idom[y] 6= x}

















x1, x2, x3, . . .
dominance frontier









































































































































































































































































































































































2 4 2 4
gcc bc-optab.c 32 21 30 21
voltex tree0.c 16 7 11 7
twolf clean.c 27 17 26 17
























2 4 2 4
gcc bc-optab.c 28 19 28 19
voltex tree0.c 11 6 11 6
twolf clean.c 26 17 26 17
























2 4 2 4
gcc bc-optab.c 14 12 12 11
voltex tree0.c 10 9 9 9
twolf clean.c 11 11 11 11
























































































[8] [9] (DAG: Directed

































































n CFG in[n] n
out[n] n pred[n]
n CFG 1 def [n] n
use[n] n n
in[n] = out[n] = n in[n] out[n]
1
5.3. 73
CFG (in out ) CFG
(out in )
in out












1:      A=E+E
2:      A=A*B
3:      C=A/E
4:      F=1
5:      D=C+A
6: LOOP:D=D+F
7:      F=F+1
8:      IF F<E GOTO LOOP
9:      D=E/D
5.3:
5.1:
0 1 2 3
def use in out in out in out in out
1 A E BE ABE BE ABE BE ABE
2 A AB ABE AE ABE AE ABE AE
3 C AE AE ACE AE ACE AE ACE
4 F ACE ACEF ACE ACEF ACE ACEF
5 D AC ACEF DEF ACEF DEF ACEF DEF
6 D DF DEF DEF DEF DEF DEF DEF
7 F F DEF DEF DEF DEF DEF DEF
8 EF DEF DE DEF DEF DEF DEF
























ABI (Application Binary Interface)














(LOAD + STORE × 2)× 10
A B
2 (A B) ( 5.5 ) B



































3. N N N











































ii. A X B X
C X

































(b) A B C






















E C&E 5.3 1


















Stanford Benchmark Stanford Benchmark







































































































































































[1] Ferrante, J., Ottenstein, K.J. and Warren, J.D.: “The Program Dependence Graph
and Its Use in Optimization”, ACM Trans. Prog. Lang. Syst., Vol.9, No.3, pp. 319–
349 (1987)
[2] Chaitin, G.J., Auslander, M.A., Chandro, A.K., Cocke, J., Hopkins, M.E., and Mark-
stein, P.W.: “Register Allocation via Coloring”, Computer Languages, Vol.#6, pp.
47–57 (1981)
[3] Chaitin, G.J.: “Register Allocation & Spilling via Graph Coloring” Proceedings of
the ACM SIGPLAN ’82 Symposium on Compiler Construction, pp. 95–105 (1982)
[4] Cytron, R., Ferrante, J., Rosen, B., Wegman, M. and Zadeck, K.: “An Efficient
Method of Computing Static Single Assignment Form”, Conf. Record of the 16th
ACM Symposium on the Principles of Programming Languages, pp. 25–35 (1989)
[5] , , : “
”, , Vol.37, No.6, pp. 1149–1161 (1996)
[6] Norris, C. and Pollock, L.L.: “A Scheduler-Sensitive Global Register Allocation”,
Proc. of the ACM SIGPLAN ’93 Conf. on Supercomputing, pp. 804–813 (1993)
108
[7] Pinter, S.S.: “Register Allocation with Instruction Scheduling: a New Approach”,
Proc. of the ACM SIGPLAN ’93 Conf. on Programming Languages Design and Im-
plementation, pp. 248–257 (1993)
[8] Berson, D.A., Gupta, R. and Soffa, M.L.: “Resource Spackling: A Framework for In-
tegrating Register Allocation in Local and Global Schedulers”, IFIP WG 10.3 Work-
ing Conference on Parallel Architectures and Compilation Techniques, pp. 135–146
(1994)
[9] Berson, D.A., Gupta, R. and Soffa, M.L.: “GURRR: A Global Unified Resource
Requirements Representation”, Proc. of ACM Workshop on InterMediate Represen-
tations, Sigplan Notices, Vol.30, pp. 23–34 (1995)
[10] , , , : “
”, , Vol.38,
No.3, pp. 584–594 (1997)
[11] Inagaki, T., Komatsu, H. and Nakatani, T.: “Integrated prepass scheduling for a Java
Just-In-Time compiler on the IA-64 architecture”, Proceedings of the international
symposium on Code generation and optimization: feedback-directed and runtime
optimization, pp. 159–168 (2003)
[12] Knoop, J., Ru¨nthing, O. and Steffen, B.: “Lazy Code Motion”, Proc. of PLDI,
Vol.27, No.7, pp. 224–234 (1992)
[13] Aho, A.V. and Ullman, J.D.: “Principles of Compiler Design” Addison-Wesley,
(1977)
[14] Aho, A.V., Sethi, R. and Ullman, J.D.: “Compilers – Principles, Techniques, and
Tools” Addison-Wesley, (1986)
109
[15] Poletto, M. and Sarkar, V.: “Linear scan register allocation”, ACM TOPLAS, Vol.
21, No. 5, pp. 895–913 (1999)
[16] Briggs, P.: “Value Numbering” Software: Practice and Experience Vol.27, No.6 pp.
701–724 (1997)
[17] Bergner, P., Dahl, P., Engebretsen, D. and O’Keefe, M.: “Spill Code Minimization
via Interference Region Spilling”, Proc. of the ACM SIGPLAN ’97 Conf. on Pro-
gramming Language Design and Implementation, pp. 287–295 (1997)
[18] Park, J. and Moon, S.: “Optimistic register coalescing”, Proc. PACT ’98, pp. 196–204
(1998)
[19] Briggs, P., Cooper ,K.D. and Torczon, L.: “Improvements to graph coloring register
allocation” ACM TOPLAS, Vol. 16, No. 3, pp. 428–455 (1994)
[20] George, L. and Appel, A.W.: “Iterated register coalescing” ACM TOPLAS, Vol. 18,
No. 3, pp. 300–324 (1996)
[21] Koseki, A., Komatsu, H. and Nakatani, T.: “Spill Code Minimization by Spill Code
Motion”, 12th International Conf. on Parallel Architectures and Compilation Tech-




, Vol.49, No.SIG1(PRO35), pp.96-102,
January, 2008
, ,
, Vol.46, No.SIG1(PRO24), pp.10-18, Jan-
uary, 2005
Non-Retrial Register Allocation and Its Spill Optimization Method
Masaki Kataoka, Akira Koseki, Hideaki Komatsu, Yoshiaki Fukazawa
The 2007 International Conference on Parallel and Distributed Processing
Techniques and Applications, June 25-28, 2007
•
, , , ,
112
2007 (SWoPP






, , , ,
58 (PRO-2005-5), March 16-17, 2006
•
, , , ,
8 (PPL2006),
March 5-7, 2006, PPL2006 pp163-169
•
, , , ,
57 (PRO-2005-4), January 16-17, 2006
• Java
, , , ,
57 (PRO-2005-4), January 16-17, 2006
• invokeinterface
, , , ,




49 (PRO-2004-1), May 18-19 ,2004
•
, , , ,




/ / (SWoPP2002), August
21-23, 2002, Vol.2002 No.81 pp79-84
