Search CORE

49 research outputs found

Exploiting Inter- and Intra-Memory Asymmetries for Data Mapping in Hybrid Tiered-Memories

Author: Antognetti P.
Arafa M.
Arjomand M.
Bhattacharyya A.
Blagodurov S.
Cao Y.
Chang Y.-M.
Cho B.-H.
Das A.
Das A.
Dray C.
Goda A.
Huang Y.
Jayasena N. S.
Kang U.
Kim Y.
Lee D.
Mallik A.
Mutlu O.
Mutlu O.
Pourshirazi B.
Qureshi M. K.
Qureshi M. K.
Redaelli A.
Rixner S.
Sandhu B. S.
Seong N. H.
Seshadri V.
Srinivasan J.
Stuecheli J.
Yoon H.
Yue J.
Zhang L.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 10/05/2020
Field of study

Modern computing systems are embracing hybrid memory comprising of DRAM and non-volatile memory (NVM) to combine the best properties of both memory technologies, achieving low latency, high reliability, and high density. A prominent characteristic of DRAM-NVM hybrid memory is that it has NVM access latency much higher than DRAM access latency. We call this inter-memory asymmetry. We observe that parasitic components on a long bitline are a major source of high latency in both DRAM and NVM, and a significant factor contributing to high-voltage operations in NVM, which impact their reliability. We propose an architectural change, where each long bitline in DRAM and NVM is split into two segments by an isolation transistor. One segment can be accessed with lower latency and operating voltage than the other. By introducing tiers, we enable non-uniform accesses within each memory type (which we call intra-memory asymmetry), leading to performance and reliability trade-offs in DRAM-NVM hybrid memory. We extend existing NVM-DRAM OS in three ways. First, we exploit both inter- and intra-memory asymmetries to allocate and migrate memory pages between the tiers in DRAM and NVM. Second, we improve the OS's page allocation decisions by predicting the access intensity of a newly-referenced memory page in a program and placing it to a matching tier during its initial allocation. This minimizes page migrations during program execution, lowering the performance overhead. Third, we propose a solution to migrate pages between the tiers of the same memory without transferring data over the memory channel, minimizing channel occupancy and improving performance. Our overall approach, which we call MNEME, to enable and exploit asymmetries in DRAM-NVM hybrid tiered memory improves both performance and reliability for both single-core and multi-programmed workloads.Comment: 15 pages, 29 figures, accepted at ACM SIGPLAN International Symposium on Memory Managemen

arXiv.org e-Print Archive

Crossref

Modelling of soft ground consolidation via combined surcharge and vacuum preloading

Author: Bo M.W.
Chu J.
Chu J.
Indraratna B.
Indraratna B.
It
Kjellman W.
Long R.P.
Qian J.H.
Rixner J.J.
Rujikiatkamjorn C.
Rujikiatkamjorn C.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Aging-Aware Request Scheduling for Non-Volatile Main Memory

Author: Arjomand M.
Balaji A.
Balaji A.
Balaji A.
Bolchini C.
Bucek J.
Burr G. W.
Chandrasekar K.
Das A.
Das A.
Das A.
Das A.
Das A.
Das A.
Das A.
Das A.
Das A.
David H.
Gao R.
Hassan H.
Hisamoto D.
Huang L.
Jiang L.
K.
Khan S.
Kim J.
Kim J. S.
Kim J. S.
Kim J. S.
Kim Y.
Kim Y.
Kim Y.
Kim Y.
Kraak D.
Kültürsay E.
Lalam A.
Lee B.
Lee B.
Lee B.
Lee D.
Lee D.
Lu Y.
Mallik A.
Mandelman J. A.
Meza J.
Meza J.
Meza J.
Meza J.
Mutlu O.
Mutlu O.
Mutlu O.
Mutlu O.
Nesbit K. J.
Patel M.
Pelley S.
Poremba M.
Qureshi M. K.
Qureshi M. K.
Qureshi M. K.
Qureshi M. K.
Rixner S.
Sadasivam S. K.
Seshadri V.
Song S.
Song S.
Song S.
Song S.
Song S.
Song S.
Srinivasan J.
Subramanian L.
Subramanian L.
Titirsha T.
Titirsha T.
Usui H.
Wong H.-S. P.
Xia F.
Xiong F.
Yavits L.
Yilmaz C.
Yoon H.
Yoon H.
Zhang J.
Zhao J.
Zuravleff W. K.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 30/11/2020
Field of study

Modern computing systems are embracing non-volatile memory (NVM) to implement high-capacity and low-cost main memory. Elevated operating voltages of NVM accelerate the aging of CMOS transistors in the peripheral circuitry of each memory bank. Aggressive device scaling increases power density and temperature, which further accelerates aging, challenging the reliable operation of NVM-based main memory. We propose HEBE, an architectural technique to mitigate the circuit aging-related problems of NVM-based main memory. HEBE is built on three contributions. First, we propose a new analytical model that can dynamically track the aging in the peripheral circuitry of each memory bank based on the bank's utilization. Second, we develop an intelligent memory request scheduler that exploits this aging model at run time to de-stress the peripheral circuitry of a memory bank only when its aging exceeds a critical threshold. Third, we introduce an isolation transistor to decouple parts of a peripheral circuit operating at different voltages, allowing the decoupled logic blocks to undergo long-latency de-stress operations independently and off the critical path of memory read and write accesses, improving performance. We evaluate HEBE with workloads from the SPEC CPU2017 Benchmark suite. Our results show that HEBE significantly improves both performance and lifetime of NVM-based main memory.Comment: To appear in ASP-DAC 202

arXiv.org e-Print Archive

Crossref

Cluster assignment for high-performance embedded VLIW processors

Author: Akturan C.
Capitanio A.
Colwell R.
Dixit K.
Ebcioğlu K.
Faraboschi P.
Faraboschi P.
Fernandes M. M.
Fritts J.
Gustavo A. De Veciana
Hanno S.
Kailas K.
Lee C.
Leupers R.
Margarida F. Jacome
Mattson P.
Nystrom E.
Paulin P. G.
Rau B. R.
Rixner S.
Sánchez J.
Viktor S. Lapinskii
Özer E.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Network Virtualization: Breaking the Performance Barrier

Author: Scot Rixner
Sugerman J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Memory Access Scheduling

Author: Dally William J.
Dally William J.
Kapasi Ujval J.
Kapasi Ujval J.
Mattson Peter
Mattson Peter
Owens John D.
Owens John D.
Rixner Scott
Rixner Scott
Publication venue
Publication date: 28/03/2002
Field of study

Conference PaperThe bandwidth and latency of a memory system are strongly dependent on the manner in which accesses interact with the "3-D" structure of banks, rows, and columns characteristic of contemporary DRAM chips. There is nearly an order of magnitude difference in bandwidth between successive references to different columns within a row and different rows within a bank. This paper introduces memory access scheduling, a technique that improves the performance of a memory system by reordering memory references to exploit locality within the 3-D memory structure. Conservative reordering, in which the first ready reference in a sequence is performed, improves bandwidth by 40% for traces from five media benchmarks. Aggressive reordering, in which operations are scheduled to optimize memory bandwidth, improves bandwidth by 93% for the same set of applications. Memory access scheduling is particularly important for media processors where it enables the processor to make the most efficient use of scarce memory bandwidth

DSpace at Rice University

Memory Access Scheduling

Author: John D. Owens
Peter Mattson
Scott Rixner
Ujval J. Kapasi
William J. Dally
Publication venue
Publication date: 01/01/2000
Field of study

The bandwidth and latency of a memory system are strongly dependent on the manner in which accesses interact with the "3-D" structure of banks, rows, and columns characteristic of contemporary DRAM chips. There is nearly an order of magnitude difference in bandwidth between successive references to different columns within a row and different rows within a bank. This paper introduces memory access scheduling, a technique that improves the performance of a memory system by reordering memory references to exploit locality within the 3-D memory structure. Conservative reordering, in which the first ready reference in a sequence is performed, improves bandwidth by 40% for traces from five media benchmarks. Aggressive reordering, in which operations are scheduled to optimize memory bandwidth, improves bandwidth by 93% for the same set of applications. Memory access scheduling is particularly important for media processors where it enables the processor to make the most efficient use of scar..

CiteSeerX

Communication scheduling

Author: Capitanio A.
Ellis J.
Fernandes M.
Grossman J.
John D. Owens
Nystrom E.
Ozer E.
Peter Mattson
Rau B.
Rixner S.
Scott Rixner
Ujval J. Kapasi
William J. Dally
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref