Search CORE

144 research outputs found

Recommended from our members

Strangers on a Tram: Space, Time, Power, and Performance in Franz Kafka's ‘Der Fahrgast’ and Ruth Klüger's <em>weiter leben</em>

Author: Wickerson
Publication venue: The Modern Language Review
Publication date: 01/01/2019
Field of study

This article compares two encounters on trams: a scene from Ruth Klüger’s weiter leben and Franz Kafka’s Der Fahrgast. It suggests that narrative spaces are performative and narrative time complicates performances of identity. Both scenes construct apparent dichotomies between male and female, old and young, subject and object, binaries that are made ambiguous by the topography of theatricalization and cinema. The article introduces the concepts of ‘spatial and temporal concertinas’, whereby shifts in perspective alter power dynamics. The analysis suggests that the depiction of bodies, space, and time offers insight into aspects of identity and subjectivity that otherwise remain hidden

Apollo (Cambridge)

Recommended from our members

Beyond vision: myth, catharsis and the narration of absence in Art Spiegelman’s Maus and W.G. Sebald’s Austerlitz

Author: Wickerson Erica
Publication venue: Comparative Critical Studies
Publication date: 01/03/2020
Field of study

Art Spiegelman’s Maus and W.G. Sebald’s Austerlitz are unusual as second-generation Holocaust narratives not only for their combination of text and image, but also for their subtle allusions to myth. This article considers the confluence of literary and visual narrative, and mythology and the Holocaust. It proposes an extension to Claude Lévi-Strauss’s structuralist approach to myth, suggesting the distinction between semantic and syntactic ‘bundles of relations’. In the context of Maus and Austerlitz, this distinction unveils a tension between theme and form in the echoes of well-known mythological tales. It indicates the multiple narrative levels at work and the disparity that often exists between them. Although both works allude to myths, they subvert the traditional endings, denying the possibility of cathartic release or narrative predictability. These echoes and subversions form part of a wider project that also operates in the combination of text and image. Spiegelman and Sebald draw attention to attempts to visualise and to represent subjectivities and memories, before then indicating the fallibility of literal and metaphorical sight. Bringing together word and image, and myth and the Holocaust in both works highlights the limits of mimetic representation as well the significance of attempting it

Apollo (Cambridge)

Remote-scope Promotion: Clarified, Rectified, and Verified

Author: Cederman D.
Kyriazis G.
Munshi A.
Nipkow T.
Wickerson J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/10/2015
Field of study

Modern accelerator programming frameworks, such as OpenCL, organise threads into work-groups. Remote-scope promotion (RSP) is a language extension recently proposed by AMD researchers that is designed to enable applications, for the first time, both to optimise for the common case of intra-work-group communication (using memory scopes to provide consistency only within a work-group) and to allow occasional inter-work-group communication (as required, for instance, to support the popular load-balancing idiom of work stealing). We present the first formal, axiomatic memory model of OpenCL extended with RSP. We have extended the Herd memory model simulator with support for OpenCL kernels that exploit RSP, and used it to discover bugs in several litmus tests and a work-stealing queue, that have been used previously in the study of RSP. We have also formalised the proposed GPU implementation of RSP. The formalisation process allowed us to identify bugs in the description of RSP that could result in well-synchronised programs experiencing memory inconsistencies. We present and prove sound a new implementation of RSP that incorporates bug fixes and requires less non-standard hardware than the original implementation. This work, a collaboration between academia and industry, clearly demonstrates how, when designing hardware support for a new concurrent language feature, the early application of formal tools and techniques can help to prevent errors, such as those we have found, from making it into silicon

CiteSeerX

Crossref

Kent Academic Repository

Spiral - Imperial College Digital Repository

Weak persistency semantics from the ground up: formalising the persistency semantics of ARMv8 and transactional models

Author: Raad A
Vafeiadis V
Wickerson J
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2019
Field of study

Emerging non-volatile memory (NVM) technologies promise the durability of disks with the performance of volatile memory (RAM). To describe the persistency guarantees of NVM, several memory persistency models have been proposed in the literature. However, the formal persistency semantics of mainstream hardware is unexplored to date. To close this gap, we present a formal declarative framework for describing concurrency models in the NVM context, and then develop the PARMv8 persistency model as an instance of our framework, formalising the persistency semantics of the ARMv8 architecture for the first time. To facilitate correct persistent programming, we study transactions as a simple abstraction for concurrency and persistency control. We thus develop the PSER (persistent serialisability) persistency model, formalising transactional semantics in the NVM context for the first time, and demonstrate that PSER correctly compiles to PARMv8. This then enables programmers to write correct, concurrent and persistent programs, without having to understand the low-level architecture-specific persistency semantics of the underlying hardware

Spiral - Imperial College Digital Repository

MPG.PuRe

Loop splitting for efficient pipelining in high-level synthesis

Author: Constantinides GA
Liu J
Wickerson J
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/03/2016
Field of study

Loop pipelining is widely adopted as a key optimization method in high-level synthesis (HLS). However, when complex memory dependencies appear in a loop, commercial HLS tools are still not able to maximize pipeline performance. In this paper, we leverage parametric polyhedral analysis to reason about memory dependence patterns that are uncertain (i.e., parameterised by an undetermined variable) and/or nonuniform (i.e., varying between loop iterations). We develop an automated source-to-source code transformation to split the loop into pieces, which are then synthesised by Vivado HLS as the hardware generation back-end. Our technique allows generated loops to run with a minimal interval, automatically inserting statically-determined parametric pipeline breaks at those iterations violating dependencies. Our experiments on seven representative benchmarks show that, compared to default loop pipelining, our parametric loop splitting improves pipeline performance by 4:3 in terms of clock cycles per iteration. The optimized pipelines consume 2:0 as many LUTs, 1:8 as many registers, and 1:1 as many DSP blocks. Hence the area-time product is improved by nearly a factor of 2

Spiral - Imperial College Digital Repository

Simulating operational memory models using off-the-shelf program analysis tools

Author: Donaldson A
Iorga D
Wickerson J
Publication venue: Institute of Electrical and Electronics Engineers
Publication date: 08/10/2023
Field of study

Memory models allow reasoning about the correctness of multithreaded programs. Constructing and using such models is facilitated by simulators that reveal which behaviours of a given program are allowed. While extensive work has been done on simulating axiomatic memory models, there has been less work on simulation of operational models. Operational models are often considered more intuitive than axiomatic models, but are challenging to simulate due to the vast number of paths through the model’s transition system. Observing that a similar path-explosion problem is tackled by program analysis tools, we investigate the idea of reducing the decision problem of “whether a given memory model allows a given behaviour” to the decision problem of “whether a given C program is safe”, which can be handled by a variety of off-the-shelf tools. We report on our experience using multiple program analysis tools for C for this purpose—a model checker (CBMC), a symbolic execution tool (KLEE), and three coverage-guided fuzzers (libFuzzer, Centipede and AFL++)—presenting two case-studies. First, we evaluate the performance and scalability of these tools in the context of the x86 memory model, showing that fuzzers offer performance competitive with that of RMEM, a state-of-the-art bespoke memory model simulator. Second, we study a more complex, recently developed memory model for hybrid CPU/FPGA devices for which no bespoke simulator is available. We highlight how different encoding strategies can aid the various tools and show how our approach allows us to simulate the CPU/FPGA model twice as deeply as in prior work, leading to us finding and fixing several infidelities in the model. We also experimented with applying three analysis tools that won the “falsification” category in the 2023 Annual Software Verification Competition (SV-COMP). We found that these tools do not scale to our use cases, motivating us to submit example C programs arising from our work for inclusion in the set of SV-COMP benchmarks, so that they can serve as challenge examples

Spiral - Imperial College Digital Repository

Recommended from our members

The Transformation of the Lives of Others: Space, Sensuality and Spectator Complicity in Kafka’s Die Verwandlung and von Donnersmarck’s Das Leben der Anderen

Author: Wickerson EH
Publication venue: Forum for Modern Language Studies
Publication date: 01/10/2018
Field of study

This article reads von Donnersmarck’s film

\textbf{Das Leben der Anderen}

in the light of Kafka’s story

\textbf{Die Verwandlung}

, suggesting that the works bear a striking and as yet unnoticed similarity on the level of story, while diverging significantly in terms of

\textbf{plot}

. The aim of the comparison is to explore notable aspects of this similarity through the presentations of

\textbf{space, sensual experience}

, and

\textbf{spectator complicity}

, while raising wider points about the functions and capabilities of

\textbf{different media}

. I propose the concept of

\textbf{erlebtes Zeigen}

in my reading of Das Leben der Anderen as a contrast to Kafka’s use of

\textbf{erlebte Rede}

. I question just how it is that such a similar story may be told to such different effect

Apollo (Cambridge)

Concurrency-aware thread scheduling for high-level synthesis

Author: Constantinides G
Ramanathan N
Wickerson JP
Publication venue
Publication date: 05/03/2018
Field of study

When mapping C programs to hardware, high-level synthesis (HLS) tools seek to reorder instructions so they can be packed into as few clock cycles as possible. However, when synthesising multi-threaded C, instruction reordering is inhibited by the presence of atomic operations (‘atomics’), such as compare- and-swap. Atomics, the fundamental concurrency primitive in C, are the basis of more abstract concurrency mechanisms such as locks, and also of efficient lock-free data structures. Whether a particular atomic can be legally reordered within a thread can depend on the memory access patterns of other threads. Existing HLS tools that support atomics typically sched- ule each thread independently, and so must be conservative when optimising around atomics. Yet HLS tools are distinguished from conventional compilers by having the entire program available. Can this information be exploited to allow more reorderings within each thread, and hence to obtain more efficient schedules? In this work, we propose a global analysis that determines, for each thread, which pairs of instructions must not be reordered. Our analysis is sensitive to the C consistency mode of the atomics involved (e.g. relaxed, release, acquire, and sequentially- consistent). We have used the Alloy model checker to validate our analysis against the C language standard, and have implemented it in the LegUp HLS tool. An evaluation on several lock-free data structure benchmarks indicates that our analysis leads to a 1.6 × average global speedup

Crossref

Spiral - Imperial College Digital Repository

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

A Case for Work-Stealing on FPGAs with OpenCL Atomics

Author: Constantinides GA
Ramanathan N
Wickerson J
Winterstein F
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 15/11/2015
Field of study

Crossref

Spiral - Imperial College Digital Repository

Hardware synthesis of weakly consistent C concurrency

Author: Constantinides GA
Fleming S
Ramanathan N
Wickerson J
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 20/11/2016
Field of study

Lock-free algorithms, in which threads synchronise not via coarse-grained mutual exclusion but via fine-grained atomic operations ('atomics'), have been shown empirically to be the fastest class of multi-threaded algorithms in the realm of conventional processors. This paper explores how these algorithms can be compiled from C to reconfigurable hardware via high-level synthesis (HLS). We focus on the scheduling problem, in which software instructions are assigned to hardware clock cycles. We first show that typical HLS scheduling constraints are insufficient to implement atomics, because they permit some instruction reorderings that, though sound in a single-threaded context, demonstrably cause erroneous results when synthesising multi-threaded programs. We then show that correct behaviour can be restored by imposing additional intra-thread constraints among the memory operations. We implement our approach in the open-source LegUp HLS framework, and provide both sequentially consistent (SC) and weakly consistent ('weak') atomics. Weak atomics necessitate fewer constraints than SC atomics, but suffice for many concurrent algorithms. We confirm, via automatic model-checking, that we correctly implement the semantics defined by the 2011 revision of the C standard. A case study on a circular buffer suggests that circuits synthesised from programs that use atomics can be 2.5x faster than those that use locks, and that weak atomics can yield a further 1.5x speedup

Spiral - Imperial College Digital Repository