Search CORE

91 research outputs found

Transparent pointer compression for linked data structures

Author: Chris Lattner
Vikram S. Adve
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2005
Field of study

64-bit address spaces are increasingly important for modern applications, but they come at a price: pointers use twice as much memory, reducing the effective cache capacity and memory bandwidth of the system (compared to 32-bit ad-dress spaces). This paper presents a sophisticated, auto-matic transformation that shrinks pointers from 64-bits to 32-bits. The approach is “macroscopic, ” i.e., it operates on an entire logical data structure in the program at a time. It allows an individual data structure instance or even a subset thereof to grow up to 232 bytes in size, and can compress pointers to some data structures but not others. Together, these properties allow efficient usage of a large (64-bit) ad-dress space. We also describe (but have not implemented) a dynamic version of the technique that can transparently expand the pointers in an individual data structure if it ex-ceeds the 4GB limit. For a collection of pointer-intensive benchmarks, we show that the transformation reduces peak heap sizes substantially by (20 % to 2x) for several of these benchmarks, and improves overall performance significantly in some cases

CiteSeerX

Crossref

Semi-Supervised Object Detection in the Open World

Author: Adve Vikram
Allabadi Garvita
Lucic Ana
Pao-Huang Peter
Wang Yu-Xiong
Publication venue
Publication date: 28/07/2023
Field of study

Existing approaches for semi-supervised object detection assume a fixed set of classes present in training and unlabeled datasets, i.e., in-distribution (ID) data. The performance of these techniques significantly degrades when these techniques are deployed in the open-world, due to the fact that the unlabeled and test data may contain objects that were not seen during training, i.e., out-of-distribution (OOD) data. The two key questions that we explore in this paper are: can we detect these OOD samples and if so, can we learn from them? With these considerations in mind, we propose the Open World Semi-supervised Detection framework (OWSSD) that effectively detects OOD data along with a semi-supervised learning pipeline that learns from both ID and OOD data. We introduce an ensemble based OOD detector consisting of lightweight auto-encoder networks trained only on ID data. Through extensive evalulation, we demonstrate that our method performs competitively against state-of-the-art OOD detection algorithms and also significantly boosts the semi-supervised learning performance in open-world scenarios

arXiv.org e-Print Archive

Technical Report: Region and Effect Inference for Safe Parallelism

Author: Adve Vikram S.
Eloussi Lamyaa
Han Michael
Heumann Stephen T.
Tzannes Alexandros
Vakilian Mohsen
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 17/08/2015
Field of study

In this paper, we present the first full regions-and-effects inference algorithm for explicitly parallel fork-join programs. We infer annotations inspired by Deterministic Parallel Java (DPJ) for a type-safe subset of C++. We chose the DPJ annotations because they give the \emph{strongest} safety guarantees of any existing concurrency-checking approach we know of, static or dynamic, and it is also the most expressive static checking system we know of that gives strong safety guarantees. This expressiveness, however, makes manual annotation difficult and tedious, which motivates the need for automatic inference, but it also makes the inference problem very challenging: the code may use region polymorphism, imperative updates with complex aliasing, arbitrary recursion, hierarchical region specifications, and wildcard elements to describe potentially infinite sets of regions. We express the inference as a constraint satisfaction problem and develop, implement, and evaluate an algorithm for solving it. The region and effect annotations inferred by the algorithm constitute a checkable proof of safe parallelism, and it can be recorded both for documentation and for fast and modular safety checking.Ope

Illinois Digital Environment for Access to Learning and Scholarship Repository

The influence of random delays on parallel execution times

Author: AGARWAL R.
FUF NTES
Mary K. Vernon
MOHAN
TARJAN U.
Vikram S. Adve
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Understanding the propagation of hard errors to software and implications for resilient system design

Author: Chris
Daniel
David
Edward
Fred
Jayanth
Jun
Man-Lap Li
Milos
Nicholas
Pradeep Ramachandran
R. Rodriguez
Rajesh
Rotenberg Eric
Sarita V. Adve
Srinivasan M.
Swarup Kumar Sahoo
Todd
V. Reddy
Vikram S. Adve
Weining
Yuanyuan Zhou
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Comparison of hardware and software cache coherence schemes

Author: AOARWAL R.
BRANTLEY K. P.
CHEONG
CHEONG A. V.
CY~ON S.
EGGERS R. H.
Mark D. Hill
Mary K. Vernon
MIN J.
MIN J.
Sarita V. Adve
VERNON E. D.
Vikram S. Adve
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Automatic Pool Allocation: Compile-Time Control of Data Structure Layout in the Heap

Author: Adve Vikram S.
Lattner Chris A.
Publication venue
Publication date: 01/07/2004
Field of study

Despite the potential importance of data structure layouts and traversal patterns, compiler transformations on pointer-intensive programs are performed primarily using pointer analysis, and not by controlling and using information about the layout of high-level data structures. This paper describes a compiler transformation called \emph{Automatic Pool Allocation} that segregates instances of ``logical'' data structures in the heap into distinct pools, and allows different heuristics to be used to partially control the internal layout of those data structures. Because these are rigorous transformations, their results, combined with pointer analysis information, can be used to perform further compiler analyses and transformations, and we briefly list a few examples. Automatic Pool Allocation also provides several direct performance benefits for pointer intensive programs, most importantly, that traversals of a logical data structure allocated to a separate pool can have better spatial locality and smaller working sets. We evaluate the performance and cache behavior of the code transformed by the automatic pool allocation transformation on a series of heap-intensive and general-purpose benchmarks, and find that it speeds up several C programs by 10-40\% percent or more, and does not hurt (or help) other programs

Illinois Digital Environment for Access to Learning and Scholarship Repository

Parallel Programming Must Be Deterministic By Default

Author: Adve Sarita V.
Adve Vikram S.
Bocchino Robert L., Jr.
Snir Marc
Publication venue
Publication date: 01/10/2008
Field of study

We examine the problem of providing a parallel programming model that guarantees deterministic semantics. We propose a research agenda focusing on the following questions: 1. How to guarantee determinism in a modern object-oriented language; 2. How to provide sound guarantees when parts of the program either cannot be proved deterministic or have "harmless" nondeterminism; 3. How to specify explicit non-determinism when needed; and 4. How to make it easier to port programs to the language

Illinois Digital Environment for Access to Learning and Scholarship Repository

An Empirical Study of Reported Bugs in Server Software with Implications for Automated Bug Diagnosis

Author: Adve Vikram S.
Criswell John
Sahoo Swarup K.
Publication venue
Publication date: 01/01/2009
Field of study

Reproducing bug symptoms is a prerequisite for performing automatic bug diagnosis. Do bugs have characteristics that ease or hinder automatic bug diagnosis? In this paper, we conduct a thorough empirical study of several key characteristics of bugs that affect reproducibility at the production site. We examine randomly selected bug reports of six server applications and consider their implications on automatic bug diagnosis tools. Our results are promising. From the study, we find that nearly 82% of bug symptoms can be reproduced deterministically by re-running with the same set of inputs at the production site. We further find that very few input requests are needed to reproduce most failures; in fact, just one input request after session establishment suffices to reproduce the failure in nearly 77% of the cases. We describe the implications of the results on reproducing software failures and designing automated diagnosis tools for production runs.published or accepted for publicatio

Crossref

Illinois Digital Environment for Access to Learning and Scholarship Repository