Hardware-only stream prediction + cache prefetching + dynamic access ordering

Mckee, Sally A.; Zhang, Chengqiang

Hardware-only stream prediction + cache prefetching + dynamic access ordering

Authors: Sally A. Mckee
Chengqiang Zhang
Publication date: 1 January 1999
Publisher: University of Utah

Abstract

Journal ArticleThe speed gap between processors and memory system is becoming the performance bottleneck for many applications, and computations with strided access patterns are among those that suffer most. The vectors used in such applications lack temporal and often spatial locality, and are usually too large to cache. In spite of their poor cache behavior, these access patterns have the advantage of being, predictable, which can be exploited to improve the efficiency of the memory subsystem. As a promising technique to relieve memory system bottleneck, prefetching has been studied in its various forms, and so is dynamic memory scheduling. This study builds on these results, combining a stride-based reference prediction table, a mechanism that prefetches L2 cache lines, and a memory controller that dynamically schedules accesses to a Direct Rambus memory subsystem. We find that such a system delivers impressive speedups for scientific applications with regular access patterns (reducing execution time by almost a factor of two) without negatively affecting the performance of non-streaming programs

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

The University of Utah: J. Willard Marriott Digital Library

oai:collections.lib.utah.edu:i...

Last time updated on 01/01/2020