A Lightweight NoC for the NOVA Packet Processing Plattform by Sauer, Christian et al.
HEINZ NIXDORF INSTITUTE
University of  Paderborn
System and Circuit Technology
Prof. Dr.-Ing. Ulrich Rückert
A Lightweight NoC for the
NOVA Packet Processing Platform
C. Sauer1, M. Gries1, S. Dirk1, J.-C. Niemann2, M. Porrmann2, U. Rückert2
1 Infineon Technologies, Access Communications, Munich   2 Heinz Nixdorf Institute, Paderborn
Motivation & Goal Application Domain
Network-Optimized Versatile Architecture (NOVA) Platform: Concepts
Status/Next Steps Partners/Funding
References: [1] C. Sauer, M. Gries, S. Sonntag, "Modular Reference Implementation of an IP-DSLAM," 10th IEEE Symposium on Computers and Communications (ISCC 2005), Cartagena, Spain, Jun 2005
[2] J.-C. Niemann, M. Porrmann, C. Sauer, U. Rückert, "An Evaluation of the Scalable GigaNetIC Architecture for Access Networks," 2nd Advanced Networking and Communications Hardware Workshop (ANCHOR) along with ISCA 2005, Madison, Wisconsin, USA
[3] R. Eickhoff, J.-C. Niemann, M. Porrmann, U. Rückert, “Adaptable Switch boxes as on-chip routing nodes for networks-on-chip, “ In: Rettberg, Zanella, Mauro, Rammig, “From Specification to Embedded Systems Application,” IESS 2005. Manaus, Brazil
Modular platform
? Number and type of processing cores
? Number and type of  co-processors
? Number and type of I/O interfaces
? Heterogeneous memory hierarchy
? Communication architecture
1
Systematic development of platform building blocks
? Profiling of embedded general-purpose cores (and their 
compilers) and specialized packet processing engines
? Packet processing engines provide high performance; 
general purpose cores are flexible and have mature compilers
⇒ Provides a first estimate of required parallelism
2
GigaNoC – hierarchical Network-on-chip
? Bus-based network for small clusters (< 10 components)
? Switch boxes for connecting clusters
NoC principles
? Message passing interface with priorities: computation
can continue while message is transmitted to destination
? Incorporation of flow control by backpressure signaling to 
avoid overload and loss
? Lightweight one-word header, which contains destination, 
message type, and context ID for fast flow-through processing
3
? Verification of approach and concepts 
? Currently implementing initial NOVA platform prototype   
comprising approx. 20 message passing clients 
? Application-driven analysis using system-level benchmark
? Mapping IP-DSLAM reference application to the platform
? Enables detailed quantitative exploration of design trade-offs
? Demonstrator at CeBIT 2006 trade fair
? Evaluation of GigaNoC for larger systems [2] 
Infineon Technologies
?Communications, Access System Engineering (COM AC SE NP)
? 81726 Munich, Germany
?E-mail: Christian.Sauer@infineon.com
Heinz Nixdorf Institute
?University of Paderborn, System and Circuit Technology
? Fürstenallee 11, 33102 Paderborn, Germany
?E-mail: niemann@hni.upb.de
This project is supported, in parts, by the research grants GigaNetIC and 
PlaNetS of the German government (BMBF).
PlaNetS
• Modular platform construction kit to investigate design 
criteria, such as flexibility, programmability, area, and 
performance 
• Reuse of of-the-shelf components (e.g., programmable 
embedded cores) and deployment tools where possible
• Refinements (instruction set, co-processors, etc.) where 
needed as determined by profiling of reference 
applications
• Synergy: Hardware platform directly supports message 





















PD/IP IF SM/IP IF
















? Separation of IP-specific interfaces from 
interconnect and memory interfaces
4
130nm 90nm 130nm 90nm
32 Cores [N-Core] 32 x 0.16 32 x 0.12 205 285
8 switch-boxes [with 5 ports] 8 x 1.129 8 x 0.53 560 650
32 local RAMs, (32 KB) 
+ 8 local packet buffers (2  x 16 KB)
32 x 0.875 
+ 8 x 2 x 0.466
32 x 0.875 
+ 8 x 2 x 0.466 400 450
8 local on-chip busses 8 x  0.05 8 x 0.02 211 290
Total 50.01 43.7 205 285











































































































































































































































?Message passing semantics between computational kernels
?Subject to tight constraints on costs and performance
?Flexibility required to support broad variety of protocols and 
customer requirements



















? Two NoC interfaces, buffering 16 
64B messages each, need less than 
1/3 of the CPU subsystem area 5
Rel. area distribution
