Search CORE

4 research outputs found

FPGA-accelerated group-by aggregation using synchronizing caches

Author: Absalyamov I
Budhkar P
Halstead RJ
Najjar WA
Tsotras VJ
Windh S
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2016
Field of study

Recent trends in hardware have dramatically dropped the price of RAM and shifted focus from systems operating on disk-resident data to in-memory solutions. In this environment high memory access latency, also known as memory wall, becomes the biggest data processing bottleneck. Traditional CPU-based architectures solved this problem by introducing large cache hierarchies. However algorithms which experience poor locality can limit the benefits of caching. In turn, hardware multithreading provides a generic solution that does not rely on algorithm-specific locality properties. In this paper we present an FPGA-accelerated implementation of in-memory group-by hash aggregation. Our design relies on hardware multithreading to efficiently mask long memory access latency by implementing a custom operation datapath on FPGA. We propose using CAMs (Content Addressable Memories) as a mechanism of synchronization and local pre-aggregation. To the best of our knowledge this is the first work, which uses CAMs as a synchronizing cache. We evaluate aggregation throughput against the state-of-the-art multithreaded software implementations and demonstrate that the FPGA-accelerated approach significantly outperforms them on large grouping key cardinalities and yields speedup up to 10x

Crossref

eScholarship - University of California

III-V-on-Si Photonic Crystal Nanocavity Laser Technology for Optical Static Random Access Memories

Author: Alexoudi Theonitsa
Bazin Alexandre
Fitsios Dimitrios
Kanellos George T.
Miliou Amalia
Monnier Paul
Pleros Nikos
Raineri Fabrice
Raj Rama
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/11/2016
Field of study

Crossref

Explore Bristol Research

Optics in Computing:from photonic Network-on-Chip to Chip-to-Chip Interconnects and Disintegrated Architectures

Author: Alexoudi Theonitsa
Kanellos George T.
Maniotis Pavlos
Miliou Amalia
Mitsolidou Charoula
Moralis-Pegios Miltiadis
Mourgias-Alexandris George
Pitris Stelios
Pleros Nikos
Terzenidis Nikolaos
Vagionas Christos
Vyrsokinos Konstantinos
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 15/10/2018
Field of study

Explore Bristol Research

DRAM Aware Last-Level-Cache Policies for Multi-core Systems

Author: Hameed Fazal
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2015
Field of study

Two important parameters for DRAM cache are the miss rate and the hit latency, as they strongly influence the performance. This thesis investigate the latency and miss rate trade-offs when designing a DRAM cache hierarchy. It proposes novel application-aware and DRAM aware policies that simultaneously reduce miss rate (while considering the cache access pattern of concurrently running applications) and hit latency (while considering DRAM characteristics)

KITopen