Search CORE

3,794 research outputs found

SmartTrack: Efficient Predictive Race Detection

Author: Biswas Swarnendu
Biswas Swarnendu
Blackshear Sam
Boehm J.
Boehm J.
Bond Michael D.
Cao Man
Flanagan Cormac
Flanagan Cormac
Flanagan Cormac
Flanagan Cormac
Genç Kaan
Gorogiannis Nikos
Huang Jeff
Huang Jeff
Huang Shiyou
Kasikci Baris
Liu Peng
Luo Peng
Manson Jeremy
Mattern Friedemann
Pozniansky Eli
Roemer Jake
Roemer Jake
Roemer Jake
Segulja Cedomir
von Praun Christoph
Wood Benjamin P.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 08/04/2020
Field of study

Widely used data race detectors, including the state-of-the-art FastTrack algorithm, incur performance costs that are acceptable for regular in-house testing, but miss races detectable from the analyzed execution. Predictive analyses detect more data races in an analyzed execution than FastTrack detects, but at significantly higher performance cost. This paper presents SmartTrack, an algorithm that optimizes predictive race detection analyses, including two analyses from prior work and a new analysis introduced in this paper. SmartTrack's algorithm incorporates two main optimizations: (1) epoch and ownership optimizations from prior work, applied to predictive analysis for the first time; and (2) novel conflicting critical section optimizations introduced by this paper. Our evaluation shows that SmartTrack achieves performance competitive with FastTrack-a qualitative improvement in the state of the art for data race detection.Comment: Extended arXiv version of PLDI 2020 paper (adds Appendices A-E) #228 SmartTrack: Efficient Predictive Race Detectio

arXiv.org e-Print Archive

Crossref

Dynamic Race Prediction in Linear Time

Author: Farchi E.
Mattern F.
Musuvathi M.
Serbănu¸tă T. F.
Smith L. A.
Surendran R.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 16/04/2017
Field of study

Writing reliable concurrent software remains a huge challenge for today's programmers. Programmers rarely reason about their code by explicitly considering different possible inter-leavings of its execution. We consider the problem of detecting data races from individual executions in a sound manner. The classical approach to solving this problem has been to use Lamport's happens-before (HB) relation. Until now HB remains the only approach that runs in linear time. Previous efforts in improving over HB such as causally-precedes (CP) and maximal causal models fall short due to the fact that they are not implementable efficiently and hence have to compromise on their race detecting ability by limiting their techniques to bounded sized fragments of the execution. We present a new relation weak-causally-precedes (WCP) that is provably better than CP in terms of being able to detect more races, while still remaining sound. Moreover it admits a linear time algorithm which works on the entire execution without having to fragment it.Comment: 22 pages, 8 figures, 1 algorithm, 1 tabl

arXiv.org e-Print Archive

Crossref

OPR

Author: Hargrove Paul
Iancu Costin
Qian Xuehai
Sen Koushik
Publication venue: eScholarship, University of California
Publication date: 09/11/2016
Field of study

The ability to reproduce a parallel execution is desirable for debugging and program reliability purposes. In debugging (13), the programmer needs to manually step back in time, while for resilience (6) this is automatically performed by the the application upon failure. To be useful, replay has to faithfully reproduce the original execution. For parallel programs the main challenge is inferring and maintaining the order of conflicting operations (data races). Deterministic record and replay (R&R) techniques have been developed for multithreaded shared memory programs (5), as well as distributed memory programs (14). Our main interest is techniques for large scale scientific (3; 4) programming models

Crossref

eScholarship - University of California

Space Efficient Breadth-First and Level Traversals of Consistent Global States of Parallel Programs

Author: B Ganter
G Pruesse
G Steiner
KM Chandy
L Bianco
L Lamport
L Lamport
M Chein
M Habib
MM Sysło
S Alagar
S Alagar
T Ball
VK Garg
Publication venue
Publication date: 24/07/2017
Field of study

Enumerating consistent global states of a computation is a fundamental problem in parallel computing with applications to debug- ging, testing and runtime verification of parallel programs. Breadth-first search (BFS) enumeration is especially useful for these applications as it finds an erroneous consistent global state with the least number of events possible. The total number of executed events in a global state is called its rank. BFS also allows enumeration of all global states of a given rank or within a range of ranks. If a computation on n processes has m events per process on average, then the traditional BFS (Cooper-Marzullo and its variants) requires

\mathcal{O}(\frac{m^{n-1}}{n})

space in the worst case, whereas ou r algorithm performs the BFS requires

\mathcal{O}(m^2n^2)

space. Thus, we reduce the space complexity for BFS enumeration of consistent global states exponentially. and give the first polynomial space algorithm for this task. In our experimental evaluation of seven benchmarks, traditional BFS fails in many cases by exhausting the 2 GB heap space allowed to the JVM. In contrast, our implementation uses less than 60 MB memory and is also faster in many cases

arXiv.org e-Print Archive

Crossref

Dynamic Analysis of Embedded Software

Author
Publication venue
Publication date: 01/01/2015
Field of study

abstract: Most embedded applications are constructed with multiple threads to handle concurrent events. For optimization and debugging of the programs, dynamic program analysis is widely used to collect execution information while the program is running. Unfortunately, the non-deterministic behavior of multithreaded embedded software makes the dynamic analysis difficult. In addition, instrumentation overhead for gathering execution information may change the execution of a program, and lead to distorted analysis results, i.e., probe effect. This thesis presents a framework that tackles the non-determinism and probe effect incurred in dynamic analysis of embedded software. The thesis largely consists of three parts. First of all, we discusses a deterministic replay framework to provide reproducible execution. Once a program execution is recorded, software instrumentation can be safely applied during replay without probe effect. Second, a discussion of probe effect is presented and a simulation-based analysis is proposed to detect execution changes of a program caused by instrumentation overhead. The simulation-based analysis examines if the recording instrumentation changes the original program execution. Lastly, the thesis discusses data race detection algorithms that help to remove data races for correctness of the replay and the simulation-based analysis. The focus is to make the detection efficient for C/C++ programs, and to increase scalability of the detection on multi-core machines.Dissertation/ThesisDoctoral Dissertation Computer Science 201

ASU Digital Repository

Fast casual multicast

Author: Birman Kenneth
Schiper Andre
Stephenson Pat
Publication venue
Publication date
Field of study

A new protocol is presented that efficiently implements a reliable, causally ordered multicast primitive and is easily extended into a totally ordered one. Intended for use in the ISIS toolkit, it offers a way to bypass the most costly aspects of ISIS while benefiting from virtual synchrony. The facility scales with bounded overhead. Measured speedups of more than an order of magnitude were obtained when the protocol was implemented within ISIS. One conclusion is that systems such as ISIS can achieve performance competitive with the best existing multicast facilities--a finding contradicting the widespread concern that fault-tolerance may be unacceptably costly

NASA Technical Reports Server