Search CORE

232 research outputs found

The FNL+MMA Instruction Cache Prefetcher

Author: Seznec André
Publication venue: HAL CCSD
Publication date: 30/05/2020
Field of study

International audienceWhen designing a prefetcher, the computer architect has to define which event should trigger a prefetch action and which blocks should be prefetched. We propose to trigger prefetch requests on I-Shadow cache misses. The I-Shadow cache is a small tag-only cache that monitors only demand misses. FNL+MMA combines two prefetchers that exploit two characteristics of the I-cache usage. In many cases, the next line is used by the application in the near future. But systematic next-line prefetching leads to overfetching and cache pollution. The Footprint Next Line prefetcher, FNL, overcomes this difficulty through predicting if the next line will be used in the "not so long" future. Prefetching up to 5 next lines, FNL achieves a 16.5% speed-up on the championship public traces. If no prefetching is used, the sequence of I-cache misses is partially predictable and in advance. That is, when block B is missing, the nth next miss after the miss on block B is often on the same block B (n). This property holds for relatively large n up to 30. The Multiple Miss Ahead prefetcher, MMA, leverages the property. We predict the nth next miss on the I-Shadow cache and predict if it might miss the overall I-cache. A 96KB FNL+MMA achieves a 28.7% speed-up and decreases the I-cache miss rate by 91.8%

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

HAL-Rennes 1

Scientific History of Incipit in the period 2010-2016

Author: CSIC - Instituto de Ciencias del Patrimonio (Incipit)
Publication venue: CSIC - Instituto de Ciencias del Patrimonio (Incipit)
Publication date: 01/02/2017
Field of study

Historial de la actividad científica y técnica del Instituto de Ciencias del Patrimonio (Incipit) del CSIC, basado en Santiago de Compostela, desde su fecha de creación (2010) hasta el año 2016. Se presentan la misión y las líneas de investigación del Incipit, centradas principalmente en el estudio de los procesos de patrimonialización y de valorización social del patrimonio cultural realizadas con una perspectiva transdisciplinar. Se relacionan las publicaciones, proyectos de investigación, actividades de ciencia pública, eventos de comunicación y productos de divulgación que su personal investigador ha producido a lo largo de estos años.General introduction to the Incipit. Presentation of the Research Line: Cultural Heritage Studies: Sub-Theme: Landscape Archaeology and Cultural Landscapes, Sub-theme: Heritagization Processes: Memory, Power and Ethnicity, Sub-theme: Socioeconomics of Cultural Heritage, Sub-theme: Archaeology of the Contemporary Past, Sub-theme: Material culture and formalization processes of cultural heritage. Scientific Contributions. Transfer of Knowledge. International Activities. Other Activities and Results. Scientific DisseminationN

Digital.CSIC

BOOM: Broadcast Optimizations for On-chip Meshes

Author: Beckmann Bradford M.
Krishna Tushar
Peh Li-Shiuan
Reinhardt Steven K.
Publication venue
Publication date: 14/03/2011
Field of study

Future many-core chips will require an on-chip network that can support broadcasts and multicasts at good power-performance. A vanilla on-chip network would send multiple unicast packets for each broadcast packet, resulting in latency, throughput and power overheads. Recent research in on-chip multicast support has proposed forking of broadcast/multicast packets within the network at the router buffers, but these techniques are far from ideal, since they increase buffer occupancy which lowers throughput, and packets incur delay and power penalties at each router. In this work, we analyze an ideal broadcast mesh; show the substantial gaps between state-of-the-art multicast NoCs and the ideal; then propose BOOM, which comprises a WHIRL routing protocol that ideally load balances broadcast traffic, a mXbar multicast crossbar circuit that enables multicast traversal at similar energy-delay as unicasts, and speculative bypassing of buffering for multicast flits. Together, they enable broadcast packets to approach the delay, energy, and throughput of the ideal fabric. Our simulations show BOOM realizing an average network latency that is 5% off ideal, attaining 96% of ideal throughput, with energy consumption that is 9% above ideal. Evaluations using synthetic traffic show BOOM achieving a latency reduction of 61%, throughput improvement of 63%, and buffer power reduction of 80% as compared to a baseline broadcast. Simulations with PARSEC benchmarks show BOOM reducing average request and network latency by 40% and 15% respectively

DSpace@MIT

Spotlight on Essex County: 2010 Summer

Author: Essex Free Press
Publication venue: Scholarship at UWindsor
Publication date: 01/01/2010
Field of study

Articles about Essex County on topics such as migrant workers, festivals, steamships, photography, fishing, wineries, birding, County Road 50, War of 1812.https://scholar.uwindsor.ca/swoda-windsor-region/1071/thumbnail.jp

Scholarship at UWindsor

State of the art baseband DSP platforms for Software Defined Radio: A survey

Author: Claudio Brunelli
Fabio Garzia
Heikki Berg
Jari Nurmi
Omer Anjum
Tapani Ahonen
Publication venue: Springer Nature
Publication date: 01/01/2011
Field of study

Software Defined Radio (SDR) is an innovative approach which is becoming a more and more promising technology for future mobile handsets. Several proposals in the field of embedded systems have been introduced by different universities and industries to support SDR applications. This article presents an overview of current platforms and analyzes the related architectural choices, the current issues in SDR, as well as potential future trends.Peer reviewe

Springer - Publisher Connector

Trepo - Institutional Repository of Tampere University

Adding Parallelism to Sequential Programs – a Combined Method

Author: Czejdo Denny Bogdan
Daszczuk Wiktor Bohdan
Grześkowiak Wojciech
Publication venue: Electronics and Telecommunications Committee
Publication date: 15/04/2024
Field of study

The article outlines a contemporary method for creating software for multi-processor computers. It describes the identification of parallelizable sequential code structures. Three structures were found and then carefully examined. The algorithms used to determine whether or not certain parts of code may be parallelized result from static analysis. The techniques demonstrate how, if possible, existing sequential structures might be transformed into parallel-running programs. A dynamic evaluation is also a part of our process, and it can be used to assess the efficiency of the parallel programs that are developed. As a tool for sequential programs, the algorithms have been implemented in C#. All proposed methods were discussed using a common benchmark

International Journal of Electronics and Telecommunications (Warsaw University of Technology)

2007-2011

Author: Xavier University - Cincinnati
Publication venue: 'Institute of Science Culture Exhibition and Design'
Publication date: 01/01/2011
Field of study

Xavier University

Proving the Absence of Microarchitectural Timing Channels

Author: Buckley Scott
Heiser Gernot
Klein Gerwin
Millar Curtis
Murray Toby
Sison Robert
Wistoff Nils
Publication venue
Publication date: 25/10/2023
Field of study

Microarchitectural timing channels are a major threat to computer security. A set of OS mechanisms called time protection was recently proposed as a principled way of preventing information leakage through such channels and prototyped in the seL4 microkernel. We formalise time protection and the underlying hardware mechanisms in a way that allows linking them to the information-flow proofs that showed the absence of storage channels in seL4.Comment: Scott Buckley and Robert Sison were joint lead author

arXiv.org e-Print Archive