Search CORE

1,920 research outputs found

An On-line BIST RAM Architecture with Self Repair Capabilities

Author: Benso Alfredo
Chiusano Silvia Anna
Di Natale Giorgio
Prinetto Paolo Ernesto
Publication venue: IEEE
Publication date: 01/01/2002
Field of study

The emerging field of self-repair computing is expected to have a major impact on deployable systems for space missions and defense applications, where high reliability, availability, and serviceability are needed. In this context, RAM (random access memories) are among the most critical components. This paper proposes a built-in self-repair (BISR) approach for RAM cores. The proposed design, introducing minimal and technology-dependent overheads, can detect and repair a wide range of memory faults including: stuck-at, coupling, and address faults. The test and repair capabilities are used on-line, and are completely transparent to the external user, who can use the memory without any change in the memory-access protocol. Using a fault-injection environment that can emulate the occurrence of faults inside the module, the effectiveness of the proposed architecture in terms of both fault detection and repairing capability was verified. Memories of various sizes have been considered to evaluate the area-overhead introduced by this proposed architectur

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Bridging the Gap: FPGAs as Programmable Switches

Author: da Silva Jeferson Santiago
Langlois J. M. Pierre
Luinaud Thomas
Savaria Yvon
Stimpfling Thibaut
Publication venue
Publication date: 01/01/2020
Field of study

The emergence of P4, a domain specific language, coupled to PISA, a domain specific architecture, is revolutionizing the networking field. P4 allows to describe how packets are processed by a programmable data plane, spanning ASICs and CPUs, implementing PISA. Because the processing flexibility can be limited on ASICs, while the CPUs performance for networking tasks lag behind, recent works have proposed to implement PISA on FPGAs. However, little effort has been dedicated to analyze whether FPGAs are good candidates to implement PISA. In this work, we take a step back and evaluate the micro-architecture efficiency of various PISA blocks. We demonstrate, supported by a theoretical and experimental analysis, that the performance of a few PISA blocks is severely limited by the current FPGA architectures. Specifically, we show that match tables and programmable packet schedulers represent the main performance bottlenecks for FPGA-based programmable switches. Thus, we explore two avenues to alleviate these shortcomings. First, we identify network applications well tailored to current FPGAs. Second, to support a wider range of networking applications, we propose modifications to the FPGA architectures which can also be of interest out of the networking field.Comment: To be published in : IEEE International Conference on High Performance Switching and Routing 202

arXiv.org e-Print Archive

PolyPublie

Feature Study on a Programmable Network Traffic Classifier

Author: Guerra Perez Keissy
Scott-Hayward Sandra
Sezer Sakir
Yang Xin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 24/04/2017
Field of study

Queen's University Belfast Research Portal

X-TIME: An in-memory engine for accelerating machine learning on tabular data with CAMs

Author: Bruel Pedro
Buonanno Luca
Faraboschi Paolo
Foltin Martin
Graves Catherine E.
Ignowski Jim
Moon John
Pedretti Giacomo
Roth Ron M.
Serebryakov Sergey
Xu Cong
Ziegler Tobias
Publication venue
Publication date: 05/04/2023
Field of study

Structured, or tabular, data is the most common format in data science. While deep learning models have proven formidable in learning from unstructured data such as images or speech, they are less accurate than simpler approaches when learning from tabular data. In contrast, modern tree-based Machine Learning (ML) models shine in extracting relevant information from structured data. An essential requirement in data science is to reduce model inference latency in cases where, for example, models are used in a closed loop with simulation to accelerate scientific discovery. However, the hardware acceleration community has mostly focused on deep neural networks and largely ignored other forms of machine learning. Previous work has described the use of an analog content addressable memory (CAM) component for efficiently mapping random forests. In this work, we focus on an overall analog-digital architecture implementing a novel increased precision analog CAM and a programmable network on chip allowing the inference of state-of-the-art tree-based ML models, such as XGBoost and CatBoost. Results evaluated in a single chip at 16nm technology show 119x lower latency at 9740x higher throughput compared with a state-of-the-art GPU, with a 19W peak power consumption

arXiv.org e-Print Archive

On using content addressable memory for packet classiﬁcation

Author: Spitznagel Edward W.
Taylor David E.
Publication venue: Washington University Open Scholarship
Publication date: 03/03/2005
Field of study

Packet switched networks such as the Internet require packet classiﬁcation at every hop in order to ap-ply services and security policies to trafﬁc ﬂows. The relentless increase in link speeds and trafﬁc volume imposes astringent constraints on packet classiﬁcation solutions. Ternary Content Addressable Memory (TCAM) devices are favored by most network component and equipment vendors due to the fast and de-terministic lookup performance afforded by their use of massive parallelism. While able to keep up with high speed links, TCAMs suffer from exorbitant power consumption, poor scalability to longer search keys and larger ﬁlter sets, and inefﬁcient support of multiple matches. The research community has responded with algorithms that seek to meet the lookup rate constraint with greater efﬁciency through the use of com-modity Random Access Memory (RAM) technology. The most promising algorithms efﬁciently achieve high lookup rates by leveraging the statistical structure of real ﬁlter sets. Due to their dependence on ﬁlter set characteristics, it is difﬁcult to provision processing and memory resources for implementations that support a wide variety of ﬁlter sets. We show how several algorithmic advances may be leveraged to im-prove the efﬁciency, scalability, incremental update and multiple match performance of CAM-based packet classiﬁcation techniques without degrading the lookup performance. Our approach, Label Encoded Content Addressable Memory (LECAM), represents a hybrid technique that utilizes decomposition, label encoding, and a novel Content Addressable Memory (CAM) architecture. By reducing the number of implementation parameters, LECAM provides a vehicle to carry several of the recent algorithmic advances into practice. We provide a thorough overview of CAM technologies and packet classiﬁcation algorithms, along with a detailed discussion of the scaling issues that arise with longer search keys and larger ﬁlter sets. We also provide a comparative analysis of LECAM and standard TCAM using a collection of real and synthetic ﬁlter sets of various sizes and compositions

Washington University St. Louis: Open Scholarship

P4-enabled Smart NIC:Enabling Sliceable and Service-Driven Optical Data Centres

Author: Farhadi Beldachi Arash
Nejabati Reza
Simeonidou Dimitra
Yan Yan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/05/2020
Field of study

Explore Bristol Research