How sensitive is processor customization to the workload's input datasets? by Breughe, Maximilien et al.
1) Customization is input data set 
       insensitive.
2) Interval analysis is extremely fast and
       accurate.
1. Introduction 2. Methodology
3. Results
4. Conclusions
6. References
2.2. Technique
IPC=#instructions / #cycles
#cycles=#instructions+∑missesi * latencyi
Interval analysis: 
abstract the level of simulation.
Only 1.4 years (on single machine)!
Accurate within 3.6%! 
application
power
estimates
performance
estimates
architecture 
configurationprofiling
input
profile
analytical 
performance 
model
McPAT 
power model
Spearman Rank Correlation: 
ranking processors with 1 dataset 
v.s. ranking with 1000 datasets.
Processor design is dependent on 
application.
Is it also dependent on application's input?
One input enough for design 
decisions?
Very high correlation!
This research is funded by the Flemish Fund for Scientific Research (FWO-Vlaanderen)
...input 1
input 2 input 1000
... ...}
1000
 inputs
}
10 MiBench
applications
}
1024 processor
designs
2.1. Experiment
=10 million evaluations!
application
power
performance
architecture 
configuration
simulatorinput
x x
How sensitive is processor customization to 
the workload's input data sets?
*ELIS, Ghent University, Belgium; ‡ICT, Beijing, China; ǂINRIA, Saclay, France
input 1
24x
24x
38x
38x
0x
1x
10-6
10-5
10-4
D
-c
ac
he
 m
is
s 
ra
te
cache config number
0.1
0.2
0.3
br
an
ch
 m
is
s 
pr
ed
ic
tio
n 
ra
te
branch predictor config number
Maximilien Breughe*, Zheng Liǂ, Yang Chen‡, Stijn Eyerman*,
 Olivier Temamǂ, Chengyong Wu‡, Lieven Eeckhout*
input 2
3x
3x
5x
5x
1x
1x
1 input to make design decisions 
correlates well with using 1000 
inputs.
Cycle level simulation would take 
4000 years (on single machine)!
I cache miss
Branch mispredicted
Hazard due to
dependency
t
IPC
1
0
[1] M. Breughe, Z. Li, Y. Cheng, S. Eyerman, O. Temam, C. 
Wu, and L. Eeckhout. How sensitive is processor 
customization to the workload's input datasets? In 
Proceedings of IEEE Symposium on Application Specific 
Processors (SASP), June 2011
dijkstra.cc
 1)   Sensitivity of compiler optimization to 
       micro architectural design decisions.
 2)   Compare interval analysis to neural 
       network.
3) Validate interval analysis against 
       industrial simulator.
4) Simulate other processor families.
5. Future work
co
rr
el
at
io
n 
co
ëf
fic
ie
nt
ad
pc
m
_c
ad
pc
m
_d
di
jk
st
ra
gs
m
jp
eg
_c
jp
eg
_d
pa
tri
ci
a
qs
or
t
sh
a
st
rin
gs
ea
rc
h
