Systems and Networks on Chip - Challenges and Solutions by De Micheli, Giovanni
SYSTEMS and NETWORKS on CHIP
Challenges and Solutions
Giovanni De Micheli
Centre Systèmes Intégrés
De Micheli 2
What is a System on Chip?
• What is a system?
De Micheli 3
Why on one chip?
• Cost reduction
• Integration issues
De Micheli 4
Multi-processor SoCs
• Large-scale
– Billion-transistor chips
– Multi-cores, multi-threaded sw
• Power-consumption limited
• Very expensive to design
– Non recurring engineering costs
– Migration toward software
IBM Cell Multi-Processor
De Micheli 5
Application-specific SoCs
• Multi-core systems
• Embedded memory
– Embedded software
• Heterogeneous structure
– Flexibility vs optimality
• Example:
– Telecom EDGE baseband and
multimedia handset [STM]
De Micheli 6
Platforms
• Economic viability requires large production
volumes
– Domain-specific hardware
– Differentiation via software
• Examples
– Telecom:
• Philips Nexperia
• ST Nomadic
– Automotive
 
De Micheli 7
Where are we heading ?
De Micheli 8
Outline
• Evolutionary technologies
– Variability and reliability
– Thermal management and networking
• Revolutionary technologies
– Nano and molecular electronics
• Summary and conclusions
De Micheli 9[Diskobolos - Myron: 460BC]
De Micheli 10
Medium term trends
• Feature size downscaling
• Increasing transistor density and clock frequency
Power and thermal management
• Lower supply voltage
Reduced noise immunity
• Increasing the spread of physical parameters
 Inaccurate modeling of physical behavior
Variability and reliability
De Micheli 11
20 nm MOSFET  (2010 ?)
50 Si atoms along the channel
4 nm MOSFET  (2020 ?)
10 Si atoms along the channel
Dopant Atoms
Variability: physical motivation
De Micheli 12
Prediction vs reality
number of 
nets
Real delay - predicted delay
Average,
wireload model,
what you 
designed for
fastest/best slowest/worst
circuit does 
not work
• Models used for design are not accurate enough
• Statistical spreading makes design over-conservative
-100% +100%
De Micheli 13
Design space exploration
worst case analysis
Voltage
Delay
max
typ
min
Pareto points on w.c. curve
De Micheli 14
Δ
Adaptive design space
worst case analysis
Voltage
Delay
min
typmax As parameters spread,
w.c. design is too pessimistic
Δ
De Micheli 15
Self-calibrating circuits
• Address variability and
robustness
• Design self-calibrating circuits
operating at the edge of failure
• Examples:
– Dynamic voltage scaling of bus
swings [Ienne –EPFL]
– Dynamic voltage scaling in
processors
• Razor [Austin – U Mich]
– Dynamic latency adjustment for
NoCs
• Terror [Stanford]
48
-b
it 
LF
S
R
48
-b
it 
LF
S
R
X
X
X
clk/2
clk/2
clk clk
clk/2
clk/2
clk
!=
40
-b
it 
E
rr
or
 C
ou
nt
er
Slow Pipeline A
Slow Pipeline B
Fast Pipeline
clk/2
18
18
36
36
36
18x18
18x18
18x18
stabilize
dd
v
FIFO
ch
F
Controller
FIFO
n
dd
v
En
co
de
r
D
ec
od
er
Ack
ch
errors
ch
v
De Micheli 16
Outline
• Evolutionary technologies
– Variability and reliability
– Thermal management and networking
• Revolutionary technologies
– Nano and molecular electronics
• Summary and conclusions
De Micheli 17[Laocoonte: Hagesandros, Athanadoros and Polydoros: 1st century BC]
De Micheli 18
Dealing with transient
malfunctions
• Soft errors
– Data corruption due external
radiation exposure
• Crosstalk
– Data corruption due to internal
field exposure
• Both malfunctions manifest
themselves as timing errors
– Error containment
De Micheli 19
Sources of soft errors
Strong (nuclear)
interaction
Transient pulse
Electromagnetic interaction
(Silicon reaction)
Si
Transistor
e-
e-
e-
Electrons
Neutrons
@ sea level
Silicon
25Mg+α
28Al+p
24Mg+n+α
Ions
Solar wind
(source)
De Micheli 20
Soft error rates
• Vary with altitude and latitude
De Micheli 21
Propagation of soft errors
De Micheli 22
Memory protection techniques
CODE 
MEMORY
DATA 
MEMORY
ECC 
LOGIC
ECC
LOGIC
CODE 
MEMORY
DATA 
MEMORY
CODE 
MEMORY
DATA 
MEMORY
High Speed
ECC 
Standard ECC Low Area ECC High Speed
ECC
Embedded
Protection
System
Level Protection
Techniques
DATA 
MEMORY
Speed
Optimized ECC
Protection IP
embedded in
the memory IP
DATA 
MEMORY
BUS
Control
P
A
R
IT
Y
ERR CTRL
ex:
- 4x4 bits: 5-bit ECC
  (instead of 4x3=12)
- 4x8 bits:  6-bit ECC
  (instead of 4x5=20)
Shared ECC
Error Correcting Codes Shielding Error Management
[Source IROC]
De Micheli 23
Logic protection techniques
Redundancy (TMR) Detection + System
Correction
Hardened
Libraries
100% to 200% overhead
Protection
Transistor
embedded in
the cellERR signal used by system(Hardware or Software) to
correct the error
MODULE
DUPLICATED
MODULE
TRIPLICATED
MODULE
VO
TE
Data Out
Clock
Q
Q
DFF
DFF Dup
δ
D
DCombinationalLogic
Comp
ERR
Sequential
Element
D Q
clock
Other
Techniques
D Q
clock
Scan Sequential
Element
scan clock
scan in scan out
SCAN hardware used as
DETECTION hardware in
functional mode
Redundancy Shielding Others
[Source IROC]
De Micheli 24
Outline
• Evolutionary technologies
– Variability and reliability
– Thermal management and networking
• Revolutionary technologies
– Nano and molecular electronics
• Summary and conclusions
De Micheli 25
[Charioteer of Delphi - Sotades: 474 BC]
De Micheli 26
Aging of materials
• Failure mechanisms
– Electromigration
– Oxide Breakdown
– Thermo-mechanical stress
• Temperature dependence
– Arrhenius law
time
De Micheli 27
Execution 
core
120oC
Cache
AGU
Temp
(oC)
Thermal maps
1.5 GHz Itanium-2
[Source: Intel Corporation and Prof. V. Oklobdzija]
De Micheli 28
Thermal maps
multiprocessosr
De Micheli 29
Thermal effects
• Keep chip as cool as possible
– Reduce failure rates and power consumption
• In multi processor (core) system,
power management shuts down idle cores
– The temperature distribution will change in time
– Thermal stress may increase
• Balance temperature reduction and thermal
stress
De Micheli 30
Component redundancy
• Use stand-by components to
replace faulty ones
– Provide for temporary or
permanent back-up
– Exploit power/thermal
management
– Load sharing
• A programmable and flexible
interconnection is required
Standby
Standby
Faulty
Standby
memory
De Micheli 31
From power to system
management
• Analyze system-level reliability
– as a function of a power management policy
• Determine a system management policy
– to maximize reliability (over a time interval) and
minimize energy consumption
• Determine a system management policy and
system back-up topology
– to maximize reliability (over a time interval) and
minimize energy consumption
De Micheli 32
System-level management
• Reliability and energy management can be
modeled by stochastic processes
– Stochastic optimum control for policy design
– As more accurate models are required, policy
design is harder
• Simulation of system management policies is
useful for assessing effectiveness of
redundancy and energy cost
– Simulation results show dominant effect of
temperature and its cycling on system reliability
• Optimal policy design is also possible
De Micheli 33
Effect of DPM policy on MTTF
• Power and temperature gap
between active and sleep state
• Small gap
– Thermal cycle effects dominate
EM and TDDB only in the lower
temperature spectrum
– MTTF decreases/increases as
DPM gets more aggressive
• Wider gap
– Thermal cycles effects dominate
– MTTF decreases always as DPM
gets more aggressive
[Simunic – UCSD]
De Micheli 34
Outline
• Evolutionary technologies
– Variability and reliability
– Thermal management and networking
• Revolutionary technologies
– Nano and molecular electronics
• Summary and conclusions
De Micheli 35[Dorifero of Policleto]
De Micheli 36
Component-based SoC design
• SoCs are designed (re)-using large macrocells
– Processors, controllers, memories…
– Plug and play methodology is very desirable
– Components are qualified before use
• Design challenge:
– Provide a functionally-correct, reliable operation of the
interconnected components
   Critical issue:
– Design of the communication fabric
De Micheli 37
Example:
Motorola MSC8126 platform
• Communication scalability is the bottleneck
De Micheli 38
Entire Chip is Not Reachable in One Clock Cycle !
[Source: Leblebici]
De Micheli 39
Why on-chip networking ?
• Provide a structured methodology
 for realizing on-chip communication
– Modularity
– Flexibility
• Cope with inherent limitations of busses
– Performance and power of busses do not scale up
• Support reliable operation
– Layered approach to error detection and correction
Network
Interface
Packets
Routes
PE
De Micheli 40
NoC multi-processors:
the RAW architecture [MIT]
• Fully programmable SoC
– Homogenous array of tiles:
• Processor cores
with local storage
• Each tile has a router
• The raw architecture is exposed to the compiler
– Cores and routers are programmable
– Compiler determines which wires are used at each cycle
– Compiler pipelines long wires
The BONE Roadmap
[KAIST]
De Micheli 42
Metrics for NoC design
• Low communication latency
– Streamlined control protocols
– Data and control signals can be separate
• High communication bandwidth
– To support demanding SW applications
– Great match to stream computing
• Low energy consumption
– Wiring switched capacitance dominates
• Error resiliency
– To compensate/correct electrical-level errors
• Flexibility and programmability
De Micheli 43
Error resiliency
• Several implementation
styles:
– Local link-level
• ECC in switches
– Global end to end
• ECC at core interfaces
– Transaction level
• Software approach
De Micheli 44
Flexibility in NoC design
• NoCs have modular structure
– Core Interfaces
– Switches/routers
– High-speed links
• NoCs can be tailored to
applications
– Topology selection
– Switch/link sizing
– Protocols
• Several parameters for optimization
and a large design space
CPU
Memory
DSP
Memory
link
switch
network 
interface
CPU
De Micheli 45
xPipes NoC Synthesis Flow
SUNMAP
Topology
Mapping &
Selection
Synthesis
Simulation
Platform
Generation
xpipes-
Compiler
Power Lib
Floor-
planner
Routing
Function
Area Lib
Topology
Lib
System
config
Application
code
Co-Design
SystemC
code
xpipes
component
Lib
• Support for several topologies
and routing functions
• Ar a, power, delay optimization
  Comparisons
 130nm UMC library
 Cores: 1mm² obstructions (ARM cores, 32kB SRAM)
 Wire routing over the cores was forbidden
1
mm²
AMBA
Shared
Slaves
AHB Layer
1
mm²
Mesh Row
2 NIs + 1 switch
 Summary of results:
2.7% vs. 17% post P&R timing degradation → Much improved physical scalability
Clock frequency 885 vs. 400 MHz → Much faster
16% application speedup (longer latency, but more effective bandwidth)
7x more area and 5x more  power (mostly due to flip-flops in buffers)
Overall better energy efficiency for >4 Watts Proc&Mem power
Predictability is hig ly enhanced
Efficiency: competitive with state-of-the art interc nnect even in 130nm technology
[Source: Benini]
De Micheli 47
Outline
• Evolutionary technologies
– Variability and reliability
– Thermal management and networking
• Revolutionary technologies
– Nano and molecular electronics
• Summary and conclusions
De Micheli 48
[Scopas: Torso]
De Micheli 49
New computing materials
• When will current semiconductor technologies run out of steam?
• What factor will provide a radical change in technology?
– Performance, power density, cost?
• Several emerging technologies:
– Silicon nanowires, carbon nanotubes, single-electron devices,
molecular switches, quantum devices, biological computing, …
• Are these technologies compatible with silicon?
– What is the transition path?
• What are the common characteristics, from a design technology
standpoint?
De Micheli 50
Common characteristics of
nano-devices
• Self-assembly used to create structures
– Manufacturing paradigm is bottom-up
• Significant presence of physical defects
– Massively fault-tolerant design style
• Competitive advantage stems from the
 high density of computing elements
–  Two orders up as compared to scaled CMOS
De Micheli 51
Nanoscale and molecular
electronics
• Nano-scale electronics
– Includes scaled-down traditional CMOS
– Challenges induced by nanometric scale
– Scaling limit?
• Molecular electronics
– Devices exploit molecular structure
• There is plenty of room at
the bottom
De Micheli 52
Micro/Nano scale compatibility
PolySi nanowire
integrated with
CMOS
nanowire
[Source: Leblebici]
De Micheli 53
• Massive parallelism and redundancy
• Local and global configuration
• Regular layout
– Exploit properties of crosspoint architectures
• E.g., Programmable Logic Arrays (PLAs)
– Wiring delay are predictable
– Regular structure support redundant logic design
Reliable nano-design
De Micheli 54
Reliable nano-design:
logic synthesis
• Device level redundancy
– Duplicate transistors to achieve broader
coverage
– Cover Boolean implicants more than once
• New paradigm for testing
– Circuit with faulty devices may still be OK
– Exploit, rather than remove, redundancy
• Objective is overall yield
De Micheli 55
Reliable nano-design
Weighted Averaging
input layer logic layer averaging layer decision layer
identical
logic
blocks
weighted
average
blocks
threshold
decision
block
Fault tolerant architecture based on multiple layers
x
i
x
1
k
1
k
i
x
n
k
n
y y =
V
fs
k
i!
i
!
i
k
i
x
i
General weighted averaging and re-scaling function
 used in the third layer
[Leblebici and Schmid] 
De Micheli 56
Architectural implications
• Modularity, redundancy, regularity
• Cellular approach to computation
– Cellular nonlinear networks
– Stream computing
• Programming paradigms
– Designer need to think “parallel” to exploit
these architectures at best
De Micheli 57
Outline
• Evolutionary technologies
– Variability and reliability
– Thermal management and networking
• Revolutionary technologies
– Nano and molecular electronics
• Summary and conclusions
De Micheli 58
Summary and conclusions
• The electronic market is driven by embedded applications where
performance and reliability are key figures of merit
• Hardware systems are more prone to fail
– Variations in manufacturing
– Hard and soft malfunctions
• Reliability can be enhanced by component and communication redundancy
– System management is critical for long-lasting operation
– On-chip networks support redundancy
• Massive parallelism and redundancy are key to design highly-dependable
circuits with nano-technologies
– Sub 45nm CMOS technologies
– Novel silicon and non-silicon based nano-technologies
De Micheli 59
De Micheli 60
