Search CORE

23,338 research outputs found

Dynamic Control Flow in Large-Scale Machine Learning

Author: Abadi Martín
Barham Paul
Brevdo Eugene
Burrows Mike
Davis Andy
Dean Jeff
Ghemawat Sanjay
Harley Tim
Hawkins Peter
Isard Michael
Kudlur Manjunath
Monga Rajat
Murray Derek
Yu Yuan
Zheng Xiaoqiang
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 04/05/2018
Field of study

Many recent machine learning models rely on fine-grained dynamic control flow for training and inference. In particular, models based on recurrent neural networks and on reinforcement learning depend on recurrence relations, data-dependent conditional execution, and other features that call for dynamic control flow. These applications benefit from the ability to make rapid control-flow decisions across a set of computing devices in a distributed system. For performance, scalability, and expressiveness, a machine learning system must support dynamic control flow in distributed and heterogeneous environments. This paper presents a programming model for distributed machine learning that supports dynamic control flow. We describe the design of the programming model, and its implementation in TensorFlow, a distributed machine learning system. Our approach extends the use of dataflow graphs to represent machine learning models, offering several distinctive features. First, the branches of conditionals and bodies of loops can be partitioned across many machines to run on a set of heterogeneous devices, including CPUs, GPUs, and custom ASICs. Second, programs written in our model support automatic differentiation and distributed gradient computations, which are necessary for training machine learning models that use control flow. Third, our choice of non-strict semantics enables multiple loop iterations to execute in parallel across machines, and to overlap compute and I/O operations. We have done our work in the context of TensorFlow, and it has been used extensively in research and production. We evaluate it using several real-world applications, and demonstrate its performance and scalability.Comment: Appeared in EuroSys 2018. 14 pages, 16 figure

arXiv.org e-Print Archive

Crossref

GeNN: a code generation framework for accelerated brain simulations

Author: AJ Cope
C Rossant
DF Goodman
DF Goodman
E Ros
EM Izhikevich
EM Izhikevich
HÜ Dinkelbach
I Raikov
J Baladron
JM Nageswaran
MA Swertz
ML Hines
NF Rulkov
P Gleeson
R Brette
SC Eisenstat
T Nowotny
T Nowotny
VK Pallipuram
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/05/2015
Field of study

Large-scale numerical simulations of detailed brain circuit models are important for identifying hypotheses on brain functions and testing their consistency and plausibility. An ongoing challenge for simulating realistic models is, however, computational speed. In this paper, we present the GeNN (GPU-enhanced Neuronal Networks) framework, which aims to facilitate the use of graphics accelerators for computational models of large-scale neuronal networks to address this challenge. GeNN is an open source library that generates code to accelerate the execution of network simulations on NVIDIA GPUs, through a flexible and extensible interface, which does not require in-depth technical knowledge from the users. We present performance benchmarks showing that 200-fold speedup compared to a single core of a CPU can be achieved for a network of one million conductance based Hodgkin-Huxley neurons but that for other models the speedup can differ. GeNN is available for Linux, Mac OS X and Windows platforms. The source code, user manual, tutorials, Wiki, in-depth example projects and all other related information can be found on the project website http://genn-team.github.io/genn/

Crossref

PubMed Central

Sussex Research Online