Search CORE

6 research outputs found

ThreadScan: Automatic and Scalable Memory Reclamation

Author: Alistarh Dan
Leiserson William Mitchell
Matveev Alexander
Shavit Nir N.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/06/2015
Field of study

The concurrent memory reclamation problem is that of devising a way for a deallocating thread to verify that no other concurrent threads hold references to a memory block being deallocated. To date, in the absence of automatic garbage collection, there is no satisfactory solution to this problem. Existing tracking methods like hazard pointers, reference counters, or epoch-based techniques like RCU, are either prohibitively expensive or require significant programming expertise, to the extent that implementing them efficiently can be worthy of a publication. None of the existing techniques are automatic or even semi-automated. In this paper, we take a new approach to concurrent memory reclamation: instead of manually tracking access to memory locations as done in techniques like hazard pointers, or restricting shared accesses to specific epoch boundaries as in RCU, our algorithm, called ThreadScan, leverages operating system signaling to automatically detect which memory locations are being accessed by concurrent threads. Initial empirical evidence shows that ThreadScan scales surprisingly well and requires negligible programming effort beyond the standard use of Malloc and Free

DSpace@MIT

Crossref

Between Convergence and Exceptionalism: Americans and the British Model of Labor Relations, c. 1867–1920

Author: Akin William E
Babson Roger W
Bayles James C
Blanshard Paul
Blewett Mary H
Boller Paul F
Burn D. L
Chang Ducksoo
Cohen Julius H
Cohen Julius H
Commons John R
Commons John R
Denning Arthur Du Pre
Derber Milton
Dibblee G. Binney
Durand E. Dana
Ernst Daniel R
Felt Dorr E
Fox Alan
Friedman Gerald
Furner Mary O
Furner Mary O
Gannett Frank E
Gerber Larry G
Gilbert James
Gilson Mary B
Going Charles B
Gray Howard L
Haydu Jeffrey
Haydu Jeffrey
Heindel Richard H
Hewitt Abram S
Hewitt Abram S
Hewitt Abram S
Hewitt Abram S
Hewitt Abram S
Hewitt Abram S
Hoagland Henry E
Hodgson James G
Hogue Richard W
Howell Chris
Howell Harris
Hunt Alfred E
Jeans J. Stephen
Jefferys James B
Kaufman Bruce
Kelley Robert
Kirk Neville
Knoeppel Charles E
Lambert Josiah B
Laslett John H M
Laughlin J. Laurence
Leiby James
Leiserson William M
Low A. Maurice
Low A. Maurice
Lyddon William G
Mackenzie F. A
Manly Basil M
Martellone Anna Maria
McCormick Cyrus
Meine Franklyn
Merchants' Association of New York
Merritt Walter G
Mitchell Charlotte
Montgomery David
Myers Charles S
National Association of Employment Managers
NCF
NCF Commission on Foreign Inquiry
Nearing Scott
Nearing Scott
Nevins Allan
NICB
NICB
Perlman Mark
Perlman Selig
Phelps Brown Henry
Pratt Edwin A
Rapson Richard L
Rice Herbert H
Robertson David Brian
Rockefeller John D
Rodgers Daniel T
Russett Bruce M
Searle G. R
Selekman Benjamin M
Shadwell Arthur
Spain Jonathan
Stead William T
Strout Cushing
Taylor Benjamin
Towne Henry R
US Commission on Industrial Relations in Great Britain and Sweden
US Commissioner of Labor
US Congress Senate
US Department of Labor
US Department of Labor Information and Education Service
US Industrial Commission
Voss Kim
Weeks Joseph D
Weeks Joseph D
Weeks Joseph D
Wells Herbert G
Witte Edwin E
Wright Carroll D
Wuest Robert
Wunderlin Clarence E
Yearley Clifton K
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Defining scalable high performance programming with DEF

Author: Leiserson William Mitchell.
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/2020
Field of study

Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2020Cataloged from PDF of thesis.Includes bibliographical references (pages 149-156).Performance engineering is performed in languages that are close to the machine, especially C and C++, but these languages have little native support for concurrency. We're deep into the multicore era of computer hardware, however, meaning that scalability is dependent upon concurrent data structures. Contrast this with modern systems languages, like Go, that provide support for concurrency but incur invisible, sometimes unavoidable, overheads on basic operations. Many applications, particularly in scientific computing, require something in between. In this thesis, I present DEF, a language that's close to the machine for the sake of performance engineering, but which also has features that provide support for concurrency. These features are designed with costs that don't impede code that doesn't use them, and preserve the flexibility enjoyed by C programmers in organizing memory layout and operations. DEF occupies the excluded middle between the two categories of languages and is suitable for high performance, scalable applications.by William Mitchell Leiserson.Ph. D.Ph.D. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Scienc

DSpace@MIT

The Cilkprof Scalability Profiler

Author: Kuszmaul Bradley C
Lee I-Ting Angelina
Leiserson Charles E
Leiserson William Mitchell
Schardl Tao Benjamin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/06/2015
Field of study

Cilkprof is a scalability profiler for multithreaded Cilk computations. Unlike its predecessor Cilkview, which analyzes only the whole-program scalability of a Cilk computation, Cilkprof collects work (serial running time) and span (critical-path length) data for each call site in the computation to assess how much each call site contributes to the overall work and span. Profiling work and span in this way enables a programmer to quickly diagnose scalability bottlenecks in a Cilk program. Despite the detail and quantity of information required to collect these measurements, Cilkprof runs with only constant asymptotic slowdown over the serial running time of the parallel computation. As an example of Cilkprof's usefulness, we used Cilkprof to diagnose a scalability bottleneck in an 1800-line parallel breadth-first search (PBFS) code. By examining Cilkprof's output in tandem with the source code, we were able to zero in on a call site within the PBFS routine that imposed a scalability bottleneck. A minor code modification then improved the parallelism of PBFS by a factor of 5. Using Cilkprof, it took us less than two hours to find and fix a scalability bug which had, until then, eluded us for months. This paper describes the Cilkprof algorithm and proves theoretically using an amortization argument that Cilkprof incurs only constant overhead compared with the application's native serial running time. Cilkprof was implemented by compiler instrumentation, that is, by modifying the LLVM compiler to insert instrumentation into user programs. On a suite of 16 application benchmarks, Cilkprof incurs a geometric-mean multiplicative overhead of only 1.9 and a maximum multiplicative overhead of only 7.4 compared with running the benchmarks without instrumentation

Crossref

DSpace@MIT

Conservative Memory Reclamation for Modern Operating Systems

Author: Alistarh Dan
Leiserson William Mitchell
Matveev Alexander
Shavit Nir N.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 03/07/2019
Field of study

The problem of efficient concurrent memory reclamation in unmanaged languages such as C or C++ is one of the major challenges facing the parallelization of billions of lines of legacy code. Garbage collectors for C/C++ can be inefficient; thus, programmers are often forced to use finely-crafted concurrent memory reclamation techniques. These techniques can provide good performance, but require considerable programming effort to deploy, and have strict requirements, allowing the programmer very little room for error. In this work, we present Forkscan, a new conservative concurrent memory reclamation scheme which is fully automatic and surprisingly scalable. Forkscan's semantics place it between automatic garbage collectors (it requires the programmer to explicitly retire nodes before they can be reclaimed), and concurrent memory reclamation techniques (as it does not assume that nodes are completely unlinked from the data structure for correctness). Forkscan's implementation exploits these new semantics for efficiency: we leverage parallelism and optimized implementations of signaling and copy-on-write in modern operating systems to efficiently obtain and process consistent snapshots of memory that can be scanned concurrently with the normal program operation. Empirical evaluation on a range of classical concurrent data structure microbenchmarks shows that Forkscan can preserve the scalability of the original code, while maintaining an order of magnitude lower latency than automatic garbage collection, and demonstrating competitive performance with finely crafted memory reclamation techniques

DSpace@MIT

Employee Voice Before Hirschman: Its Early History, Conceptualization, and Practice

Author: ______
______
______
______
______
______
______
Adam Smith
Albert Hirschman
Bloomfield
Bruce Evan Kaufman
Bruce Kaufman
C Balderston
Chad Brinsfield
Daniel Nelson
Daniel Willard
David Fairris
Edwards
Elizabeth Morrison
Eugene Benge
Gerald Zahavi
Gordon Watkins
H Morgan
H Porter
Harvey Levenstein
Henry Ford
Henry Lloyd
Henry Roland
Henry Seager
Howard Gospel
John Addison
John Commons
John D Rockefeller
John Fitch
John Leitch
John Mill
John Mitchell
John Pencavel
Karl Marx
La Dame
M Macnamara
Marcus Alexander
Milton Derber
Nathan Godfried
Nelson Lichtenstein
Ordway Tead
Paul Douglas
Paul Litchfield
Peter Holland
Richard Freeman
Robert Dunn
Robert Hoxie
Robert Porter
Robert Valentine
S M Jelley
Sam Lewisohn
Samuel Gompers
Sanford Jacoby
Sidney Webb
Sidney Williams
Sumner Slichter
W Atterbury
Walter Merritt
Willard Hotchkiss
William Chenery
William Green
William Leiserson
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

Crossref