Python is a popular language for end-user software development in many application domains. End-users want to harness parallel compute resources effectively, by exploiting commodity manycore technology including GPUs. However, existing approaches to parallelism in Python are esoteric, and generally seem too complex for the typical end-user developer. We argue that implicit, or automatic, parallelization is the best way to deliver the benefits of manycore to end-users, since it avoids domain-specific languages, specialist libraries, complex annotations or restrictive language subsets. Auto-parallelization fits the Python philosophy, provides effective performance, and is convenient for non-expert developers.



Despite being a dynamic language, we show that Python is a suitable target for auto-parallelization. In an empirical study of 3000+ open-source Python notebooks, we demonstrate that typical loop behaviour ‘in the wild’ is amenable to auto-parallelization. We show that staging the dependence analysis is an effective way to maximize performance. We apply classical dependence analysis techniques, then leverage the Python runtime’s rich introspection capabilities to resolve additional loop bounds and variable types in a just-in-time manner. The parallel loop nest code is then converted to CUDA kernels for GPU execution. We achieve orders of magnitude speedup over baseline interpreted execution and some speedup (up to 50x, although not consistently) over CPU JIT-compiled execution, across 12 loop-intensive standard benchmarks

Jacob, Dejice

Trinder, Phil

Singer, Jeremy

English

Enlighten: Publications

Python Programmers Have GPUs Too: Automatic Python Loop Parallelization with Staged Dependence Analysis

Electronic coordination may drastically reduce transport costs, especially for digital or digitalizable products where local markets may actually shrink to a point in space. In the present paper we use a model with differentiated products to analyze the impact of declining transport costs on profits and consumer surplus. While consumers always gain, the effect on producers depends on the degree of product differentiation and the magnitude of transport costs in the electronic market mode. Profits do only rise if products are substantially differentiated – in this case the positive effect of an extended consumer base due to the preference for product differentiation dominates the negative effect of intensified competition. This result is amplified if transport costs in the electronic market mode are substantial. In this case profits only increase if products are almost independent.Durch elektronische Koordination reduzieren sich Transportkosten zum Teil erheblich – bei digitalen Produkten kann aus bislang regional abgegrenzten lokalen Märkten sogar ein Punktmarkt entstehen. Im vorliegenden Papier wird in einem Modell mit Produktdifferenzierung die Auswirkung dieser Verringerung der Transportkosten auf  die Unternehmensgewinne und die Konsumentenwohlfahrt thematisiert. Während die Konsumenten immer profitieren, hängt die Auswirkung für die Produzenten vom Grad der Produktdifferenzierung und der Höhe der Transportkosten bei elektronischer Koordination ab. Die Gewinne steigen nur bei relativ ausgeprägter Produktdifferenzierung – in diesem Fall dominiert der positive Effekt durch die Ausweitung der Nachfrage aufgrund der Präferenz der Konsumenten für Produktdifferenzierung den negativen Effekt über die verstärkte Wettbewerbsintensität. Dies gilt um so mehr, wenn auch beim Verkauf über den elektronischen Markt noch ausgeprägte Transportkosten anfallen. Nur bei weitgehend unabhängigen Produkten ergibt sich dann noch ein Vorteil durch die Entstehung des elektronischen Marktes

Morasch, Karl

Welzel, Peter

Online-Publikationserver Augsburg

Emergence of electronic markets: implication of declining transport costs on firm profits and consumer surplus

Enlighten

OPUS Augsburg

OPUS - Augsburg University Publication Server

https://opus.bibliothek.uni-augsburg.de/opus4/files/71265/196.pdf

Python Programmers Have GPUs Too: Automatic Python Loop Parallelization with Staged Dependence Analysis

Abstract

Similar works

Full text

Available Versions

Enlighten: Publications

Online-Publikationserver Augsburg

Enlighten

OPUS Augsburg

OPUS - Augsburg University Publication Server