4,051 research outputs found
Performance Implications of NoCs on 3D-Stacked Memories: Insights from the Hybrid Memory Cube
Memories that exploit three-dimensional (3D)-stacking technology, which
integrate memory and logic dies in a single stack, are becoming popular. These
memories, such as Hybrid Memory Cube (HMC), utilize a network-on-chip (NoC)
design for connecting their internal structural organizations. This novel usage
of NoC, in addition to aiding processing-in-memory capabilities, enables
numerous benefits such as high bandwidth and memory-level parallelism. However,
the implications of NoCs on the characteristics of 3D-stacked memories in terms
of memory access latency and bandwidth have not been fully explored. This paper
addresses this knowledge gap by (i) characterizing an HMC prototype on the
AC-510 accelerator board and revealing its access latency behaviors, and (ii)
by investigating the implications of such behaviors on system and software
designs
NoC Resource Allocation Based on Physical Design Techniques
Networks-on-Chip (NoC) has been recognized as a scalable approach for on-chip
communication. Quality-of-Service (QoS) is a fundamental part of application specific
NoCs. This thesis focuses on resource allocation on NoC, to improve the capability of
NoC for Guaranteed Service (GS). A graph model is adopted to describe physical and
temporal sources of a NoC. Based on the graph model, an RRR-based algorithm is
proposed for simultaneous routing and time slot allocation. In addition, a negotiation-based
algorithm is suggested for achieving power-efficient QoS for application-specific
NoCs. Last, a hybrid NoC architecture, which combines circuit switching and packet
switching, is developed and investigated. Experimental results show that our techniques
outperform previous works
Low-overhead hard real-time aware interconnect network router
The increasing complexity of embedded systems is accelerating the use of multicore processors in these systems. This trend gives rise to new problems such as the sharing of on-chip network resources among hard real-time and normal best effort data traffic. We propose a network-on-chip router that provides predictable and deterministic communication latency for hard real-time data traffic while maintaining high concurrency and throughput for best-effort/general-purpose traffic with minimal hardware overhead. The proposed router requires less area than non-interfering networks, and provides better Quality of Service (QoS) in terms of predictability and determinism to hard real-time traffic than priority-based routers. We present a deadlock-free algorithm for decoupled routing of the two types of traffic. We compare the area and power estimates of three different router architectures with various QoS schemes using the IBM 45-nm SOI CMOS technology cell library. Performance evaluations are done using three realistic benchmark applications: a hybrid electric vehicle application, a utility grid connected photovoltaic converter system, and a variable speed induction motor drive application
- …