Search CORE

83 research outputs found

Beltway: Getting Around Garbage Collection Gridlock

Author: B Moss
J Eliot
J Eliot B Moss
Kathryn S. McKinley
Richard Jones
Stephen M. Blackburn
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2002
Field of study

We present the design and implementation of a new garbage collection framework that significantly generalizes existing copying collectors. The Beltway framework exploits and separates object age and incrementality. It groups objects in one or more increments on queues called belts, collects belts independently, and collects increments on a belt in first-in-first-out order. We show that Beltway configurations, selected by command line options, act and perform the same as semi-space, generational, and older-first collectors, and encompass all previous copying collectors of which we are aware. The increasing reliance on garbage collected languages such as Java requires that the collector perform well. We show that the generality of Beltway enables us to design and implement new collectors that are robust to variations in heap size and improve total execution time over the best generational copying collectors of which we are aware by up to 40%, and on average by 5 to 10%, for small to moderate heap sizes. New garbage collection algorithms are rare, and yet we define not just one, but a new family of collectors that subsumes previous work. This generality enables us to explore a larger design space and build better collectors

CiteSeerX

Kent Academic Repository

An Algorithm for Static Tracing of Message Passing Interface Programs Using Data Flow Analysis

Author
Publication venue: 'MECS Publisher'
Publication date
Field of study

Crossref

When Dataflow Analysis Meets Large Language Models

Author: Su Zian
Wang Chengpeng
Xie Xiaoheng
Xu Xiangzhe
Zhang Wuqi
Zhang Xiangyu
Publication venue
Publication date: 16/02/2024
Field of study

Dataflow analysis is a powerful code analysis technique that reasons dependencies between program values, offering support for code optimization, program comprehension, and bug detection. Existing approaches require the successful compilation of the subject program and customizations for downstream applications. This paper introduces LLMDFA, an LLM-powered dataflow analysis framework that analyzes arbitrary code snippets without requiring a compilation infrastructure and automatically synthesizes downstream applications. Inspired by summary-based dataflow analysis, LLMDFA decomposes the problem into three sub-problems, which are effectively resolved by several essential strategies, including few-shot chain-of-thought prompting and tool synthesis. Our evaluation has shown that the design can mitigate the hallucination and improve the reasoning ability, obtaining high precision and recall in detecting dataflow-related bugs upon benchmark programs, outperforming state-of-the-art (classic) tools, including a very recent industrial analyzer.Comment: 15 pages, 16 figures, 5 table

arXiv.org e-Print Archive

A Transparent Runtime Data Distribution Engine for OpenMP

Author: Ayguade E.
Labarta J.
Nikolopoulos Dimitrios
Papatheodorou T.S.
Polychronopoulos C.D.
Publication venue
Publication date: 01/12/2000
Field of study

Queen's University Belfast Research Portal

Scheduling Transformation and Dependence Tests for Recursive Programs

Author: Kulkarni Milind
Sundararajah Kirshanthan
Publication venue: 'Purdue University (bepress)'
Publication date: 01/11/2018
Field of study

Scheduling transformations reorder the execution of operations in a program to improve locality and/or parallelism. The polyhedral model provides a general framework for performing instance-wise scheduling transformations for regular programs, reordering the iterations of loops that operate over dense arrays through transformations like tiling. There is no analogous framework for recursive programs—despite recent interest in optimizations like tiling and fusion for recursive applications. This paper presents PolyRec, the first general framework for applying scheduling transformations—like inlining, interchange, and code motion—to nested recursive programs and reasoning about their correctness. We describe the phases of PolyRec—representing dynamic instances, applying transformations, reasoning about correctness—and show that PolyRec is able to apply sophisticated, composed transformations to complex, nested recursive programs and improve performance through enhanced locality

Purdue E-Pubs

Self-plagiarism in computer science

Author: Bird S.J.
Brogan M.
Carpenter S.
Christian Collberg
Collberg C.
Stephen Kobourov
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Optimizing Local Memory Allocation and Assignment Through a Decoupled Approach

Author: Cohen Albert
Diouf Boubacar
Özturk Özcan
Publication venue: HAL CCSD
Publication date: 01/10/2009
Field of study

International audienceSoftware-controlled local memories (LMs) are widely used to provide fast, scalable, power efficient and predictable access to critical data. While many studies addressed LM management, keeping hot data in the LM continues to cause major headache. This paper revisits LM management of arrays in light of recent progresses in register allocation, supporting multiple live-range splitting schemes through a generic integer linear program. These schemes differ in the grain of decision points. The model can also be extended to address fragmentation, assigning live ranges to precise offsets. We show that the links between LM management and register allocation have been underexploited, leaving much fundamental questions open and effective applications to be explored

INRIA a CCSD electronic archive server

Macroservers: An Execution Model for DRAM Processor-In-Memory Arrays

Author: Sterling Thomas L.
Zima Hans P.
Publication venue: 'California Institute of Technology Library'
Publication date: 01/01/2000
Field of study

The emergence of semiconductor fabrication technology allowing a tight coupling between high-density DRAM and CMOS logic on the same chip has led to the important new class of Processor-In-Memory (PIM) architectures. Newer developments provide powerful parallel processing capabilities on the chip, exploiting the facility to load wide words in single memory accesses and supporting complex address manipulations in the memory. Furthermore, large arrays of PIMs can be arranged into a massively parallel architecture. In this report, we describe an object-based programming model based on the notion of a macroserver. Macroservers encapsulate a set of variables and methods; threads, spawned by the activation of methods, operate asynchronously on the variables' state space. Data distributions provide a mechanism for mapping large data structures across the memory region of a macroserver, while work distributions allow explicit control of bindings between threads and data. Both data and work distributuions are first-class objects of the model, supporting the dynamic management of data and threads in memory. This offers the flexibility required for fully exploiting the processing power and memory bandwidth of a PIM array, in particular for irregular and adaptive applications. Thread synchronization is based on atomic methods, condition variables, and futures. A special type of lightweight macroserver allows the formulation of flexible scheduling strategies for the access to resources, using a monitor-like mechanism

CiteSeerX

Caltech Authors

Compiler architecture using a portable intermediate language

Author: Reig Galilea Fermín Javier
Publication venue
Publication date: 01/01/2002
Field of study

The back end of a compiler performs machine-dependent tasks and low-level optimisations that are laborious to implement and difficult to debug. In addition, in languages that require run-time services such as garbage collection, the back end must interface with the run-time system to provide those services. The net result is that building a compiler back end entails a high implementation cost. In this dissertation I describe reusable code generation infrastructure that enables the construction of a complete programming language implementation (compiler and run-time system) with reduced effort. The infrastructure consists of a portable intermediate language, a compiler for this language and a low-level run-time system. I provide an implementation of this system and I show that it can support a variety of source programming languages, it reduces the overall eort required to implement a programming language, it can capture and retain information necessary to support run-time services and optimisations, and it produces efficient code

Glasgow Theses Service