13 research outputs found
Medium access control in wireless network-on-chip: a context analysis
© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes,creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Wireless on-chip communication is a promising candidate to address the performance and efficiency issues that arise when scaling current NoC techniques to manycore processors. A WNoC can serve global and broadcast traffic with ultra-low latency even in thousand-core chips, thus acting as a natural complement to conventional and throughput-oriented wireline NoCs. However, the development of MAC strategies needed to efficiently share the wireless medium among the increasing number of cores remains a considerable challenge given the singularities of the environment and the novelty of the research area. In this position article, we present a context analysis describing the physical constraints, performance objectives, and traffic characteristics of the on-chip communication paradigm. We summarize the main differences with respect to traditional wireless scenarios, and then discuss their implications on the design of MAC protocols for manycore WNoC, with the ultimate goal of kickstarting this arguably unexplored research area.Peer ReviewedPostprint (author's final draft
Systematic Physics-Compliant Analysis of Over-the-Air Channel Equalization in RIS-Parametrized Wireless Networks-on-Chip
Wireless networks-on-chip (WNoCs) are an enticing complementary interconnect
technology for multi-core chips but face severe resource constraints. Being
limited to simple on-off-keying modulation, the reverberant nature of the chip
enclosure imposes limits on allowed modulation speeds in sight of inter-symbol
interference, casting doubts on the competitiveness of WNoCs as interconnect
technology. Fortunately, this vexing problem was recently overcome by
parametrizing the on-chip radio environment with a reconfigurable intelligent
surface (RIS). By suitably configuring the RIS, selected channel impulse
responses (CIRs) can be tuned to be (almost) pulse-like despite rich scattering
thanks to judiciously tailored multi-bounce path interferences. However, the
exploration of this "over-the-air" (OTA) equalization is thwarted by (i) the
overwhelming complexity of the propagation environment, and (ii) the non-linear
dependence of the CIR on the RIS configuration, requiring a costly and lengthy
full-wave simulation for every optimization step. Here, we show that a
reduced-basis physics-compliant model for RIS-parametrized WNoCs can be
calibrated with a single full-wave simulation. Thereby, we unlock the
possibility of predicting the CIR for any RIS configuration almost
instantaneously without any additional full-wave simulation. We leverage this
new tool to systematically explore OTA equalization in RIS-parametrized WNoCs
regarding the optimal choice of delay time for the RIS-shaped CIR's peak. We
also study the simultaneous optimization of multiple on-chip wireless links for
broadcasting. Looking forward, the introduced tools will enable the efficient
exploration of various types of OTA analog computing in RIS-parametrized WNoCs.Comment: 10 pages, 7 figures, submitted to an IEEE Journa
WHYPE: A Scale-Out Architecture with Wireless Over-the-Air Majority for Scalable In-memory Hyperdimensional Computing
Hyperdimensional computing (HDC) is an emerging computing paradigm that
represents, manipulates, and communicates data using long random vectors known
as hypervectors. Among different hardware platforms capable of executing HDC
algorithms, in-memory computing (IMC) has shown promise as it is very efficient
in performing matrix-vector multiplications, which are common in the HDC
algebra. Although HDC architectures based on IMC already exist, how to scale
them remains a key challenge due to collective communication patterns that
these architectures required and that traditional chip-scale networks were not
designed for. To cope with this difficulty, we propose a scale-out HDC
architecture called WHYPE, which uses wireless in-package communication
technology to interconnect a large number of physically distributed IMC cores
that either encode hypervectors or perform multiple similarity searches in
parallel. In this context, the key enabler of WHYPE is the opportunistic use of
the wireless network as a medium for over-the-air computation. WHYPE implements
an optimized source coding that allows receivers to calculate the bit-wise
majority of multiple hypervectors (a useful operation in HDC) being transmitted
concurrently over the wireless channel. By doing so, we achieve a joint
broadcast distribution and computation with a performance and efficiency
unattainable with wired interconnects, which in turn enables massive
parallelization of the architecture. Through evaluations at the on-chip network
and complete architecture levels, we demonstrate that WHYPE can bundle and
distribute hypervectors faster and more efficiently than a hypothetical wired
implementation, and that it scales well to tens of receivers. We show that the
average error rate of the majority computation is low, such that it has
negligible impact on the accuracy of HDC classification tasks.Comment: Accepted at IEEE Journal on Emerging and Selected Topics in Circuits
and Systems (JETCAS). arXiv admin note: text overlap with arXiv:2205.1088
Assessment of data rates on the internal and external CPU interfaces and its applications for Wireless Network-on-Chip development
Nowadays central processing units (CPUs) are the major part of the personal computers, and usually their progress defines personal computers (PCs) progress. However, modern CPU architecture has a set of limitations mentioned in this thesis. As a result, new CPU architectures are now under development. Most prospective solution in this field are based on a proposed concept of Wireless Networks-on-Chip (WNoCs), where part of wired connections is changed into wireless links. However in order to design and develop this kind of system, information about data rates on the internal and external CPU interfaces of modern CPUs is needed. Main goals set in the beginning of working on this thesis were to get this data rates assessment and give an assessment of suitable wireless technologies for milticore CPUs with different number of cores.
In this thesis CPU evolution is described and peculiarities of modern CPU architectures are mentioned. Besides state-of-the-art overview for Wireless Networks-on-Chip is provided. Moreover, full methodology of measuring intra-CPU counters and getting data rates on cache bus between second and third level caches and third level cache and random access memory (RAM) controller bus are provided. Dependencies of data rates on interfaces of interest on the number of active CPU cores and CPU clock frequency are studied and provided in a form of plots. Also differences in the traffic for different types of CPU load are provided as bar diagrams. For testing we used several real-life tasks that are typical for CPUs and artificial tests which are represented as programs written in C programming language. In addition, extrapolation model for CPUs with bigger amount of cores is provided and assumption about suitable wireless technologies for different number of CPU cores is made
Time-Randomized Wormhole NoCs for Critical Applications
Wormhole-based NoCs (wNoCs) are widely accepted in high-performance domains as the most appropriate solution to interconnect an increasing number of cores in the chip. However, wNoCs suitability in the context of critical real-time applications has not been demonstrated yet.
In this article, in the context of probabilistic timing analysis (PTA), we propose a PTA-compatible wNoC design that provides tight time-composable contention bounds. The proposed wNoC design builds on PTA ability to reason in probabilistic terms about hardware events impacting execution time (e.g., wNoC contention), discarding those sequences of events occurring with a negligible low probability. This allows our wNoC design to deliver improved guaranteed performance w.r.t. conventional time-deterministic setups. Our results show that performance guarantees of applications running on top of probabilistic wNoC designs improve by 40% and 93% on average for 4 Ă— 4 and 6 Ă— 6 wNoC setups, respectively.The research leading to these results has received funding from the European Community's Seventh Framework Programme [FP7/2007-2013] under the PROXIMA Project (www.proxima-project.eu), grant agreement no 611085. This work has also been partially supported by the Spanish Ministry of Science and Innovation under grant TIN2015-65316-P and the HiPEAC Network of Excellence.
Mladen Slijepcevic is funded by the Obra Social FundaciĂłn la Caixa under grant
Doctorado \la Caixa" - Severo Ochoa. Carles Hernández is jointly funded by the Spanish Ministry of Economy and Competitiveness (MINECO) and FEDER funds through grant TIN2014-60404-JIN. Jaume Abella has been partially supported by the MINECO under Ramon y Cajal postdoctoral fellowship number
RYC-2013-14717.Peer ReviewedPostprint (author's final draft
Performance comparison of selected wired and wireless networks on chip architectures
In this paper we compare performance intra-core communications in network on chips.We consider two alternative architectures, wired and wireless. The wired on is based on a common bus (ring) with all the cores attached to it. We compare it to the mesh (point-to-point) architecture based on THz wireless links operating in 0.1-0.54 frequency band. Using reference latencies of inter-core communications in modern CPUs we perform an applicability assessment of considered schemes. As performance metrics of interest we consider both delay and capacity. Our results indicate that the latter architecture outperforms the former by a singificant margin. The proposed system can be realized implementing directional antennas at all cores and ensuring that cores are placed on a chip such that there is no interference between them
Computing and communications for the software-defined metamaterial paradigm: a context analysis
Metamaterials are artificial structures that have recently enabled the realization of novel electromagnetic components with engineered and even unnatural functionalities. Existing metamaterials are specifically designed for a single application working under preset conditions (e.g., electromagnetic cloaking for a fixed angle of incidence) and cannot be reused. Software-defined metamaterials (SDMs) are a much sought-after paradigm shift, exhibiting electromagnetic properties that can be reconfigured at runtime using a set of software primitives. To enable this new technology, SDMs require the integration of a network of controllers within the structure of the metamaterial, where each controller interacts locally and communicates globally to obtain the programmed behavior. The design approach for such controllers and the interconnection network, however, remains unclear due to the unique combination of constraints and requirements of the scenario. To bridge this gap, this paper aims to provide a context analysis from the computation and communication perspectives. Then, analogies are drawn between the SDM scenario and other applications both at the micro and nano scales, identifying possible candidates for the implementation of the controllers and the intra-SDM network. Finally, the main challenges of SDMs related to computing and communications are outlined.Peer ReviewedPostprint (published version
Engineer the channel and adapt to it: enabling wireless intra-chip communication
© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes,creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.The authors gratefully acknowledge support from the Spanish MINECO under grant PCIN-2015-012, from the EU’s H2020 FET-OPEN program under grants No. 736876 and No. 863337, and by the Catalan Institution for Research and Advanced Studies (ICREA).Peer ReviewedPostprint (author's final draft
Engineer the Channel and Adapt to it: Enabling Wireless Intra-Chip Communication
Ubiquitous multicore processors nowadays rely on an integrated
packet-switched network for cores to exchange and share data. The performance
of these intra-chip networks is a key determinant of the processor speed and,
at high core counts, becomes an important bottleneck due to scalability issues.
To address this, several works propose the use of mm-wave wireless
interconnects for intra-chip communication and demonstrate that, thanks to
their low-latency broadcast and system-level flexibility, this new paradigm
could break the scalability barriers of current multicore architectures.
However, these same works assume 10+ Gb/s speeds and efficiencies close to 1
pJ/bit without a proper understanding on the wireless intra-chip channel. This
paper first demonstrates that such assumptions do not hold in the context of
commercial chips by evaluating losses and dispersion in them. Then, we leverage
the system's monolithic nature to engineer the channel, this is, to optimize
its frequency response by carefully choosing the chip package dimensions.
Finally, we exploit the static nature of the channel to adapt to it, pushing
efficiency-speed limits with simple tweaks at the physical layer. Our methods
reduce the path loss and delay spread of a simulated commercial chip by 47 dB
and 7.3x, respectively, enabling intra-chip wireless communications over 10
Gb/s and only 3.1 dB away from the dispersion-free case.Comment: 12 pages, 10 figures. IEEE Transactions on Communications Journal,
202