Search CORE

27,177 research outputs found

AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training

Author: Agrawal Ankur
Brand Daniel
Chen Chia-Yu
Choi Jungwook
Gopalakrishnan Kailash
Zhang Wei
Publication venue
Publication date: 07/12/2017
Field of study

Highly distributed training of Deep Neural Networks (DNNs) on future compute platforms (offering 100 of TeraOps/s of computational capacity) is expected to be severely communication constrained. To overcome this limitation, new gradient compression techniques are needed that are computationally friendly, applicable to a wide variety of layers seen in Deep Neural Networks and adaptable to variations in network architectures as well as their hyper-parameters. In this paper we introduce a novel technique - the Adaptive Residual Gradient Compression (AdaComp) scheme. AdaComp is based on localized selection of gradient residues and automatically tunes the compression rate depending on local activity. We show excellent results on a wide spectrum of state of the art Deep Learning models in multiple domains (vision, speech, language), datasets (MNIST, CIFAR10, ImageNet, BN50, Shakespeare), optimizers (SGD with momentum, Adam) and network parameters (number of learners, minibatch-size etc.). Exploiting both sparsity and quantization, we demonstrate end-to-end compression rates of ~200X for fully-connected and recurrent layers, and ~40X for convolutional layers, without any noticeable degradation in model accuracies.Comment: IBM Research AI, 9 pages, 7 figures, AAAI18 accepte

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

BLADE: Filter Learning for General Purpose Computational Photography

Author: Choi Sungjoon
Garcia-Dorado Ignacio
Getreuer Pascal
Isidoro John
Milanfar Peyman
Ong Frank
Publication venue
Publication date: 07/12/2017
Field of study

The Rapid and Accurate Image Super Resolution (RAISR) method of Romano, Isidoro, and Milanfar is a computationally efficient image upscaling method using a trained set of filters. We describe a generalization of RAISR, which we name Best Linear Adaptive Enhancement (BLADE). This approach is a trainable edge-adaptive filtering framework that is general, simple, computationally efficient, and useful for a wide range of problems in computational photography. We show applications to operations which may appear in a camera pipeline including denoising, demosaicing, and stylization

arXiv.org e-Print Archive

Crossref

Body randomization reduces the sim-to-real gap for compliant quadruped locomotion

Author: Dambre Joni
Mahmud Hossain
Urbain Gabriel
Vandesompele Alexander
wyffels Francis
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2019
Field of study

Designing controllers for compliant, underactuated robots is challenging and usually requires a learning procedure. Learning robotic control in simulated environments can speed up the process whilst lowering risk of physical damage. Since perfect simulations are unfeasible, several techniques are used to improve transfer to the real world. Here, we investigate the impact of randomizing body parameters during learning of CPG controllers in simulation. The controllers are evaluated on our physical quadruped robot. We find that body randomization in simulation increases chances of finding gaits that function well on the real robot

Ghent University Academic Bibliography

Directory of Open Access Journals

Generative Compression

Author: Budden David
Santurkar Shibani
Shavit Nir
Publication venue
Publication date: 04/06/2017
Field of study

Traditional image and video compression algorithms rely on hand-crafted encoder/decoder pairs (codecs) that lack adaptability and are agnostic to the data being compressed. Here we describe the concept of generative compression, the compression of data using generative models, and suggest that it is a direction worth pursuing to produce more accurate and visually pleasing reconstructions at much deeper compression levels for both image and video data. We also demonstrate that generative compression is orders-of-magnitude more resilient to bit error rates (e.g. from noisy wireless channels) than traditional variable-length coding schemes

arXiv.org e-Print Archive

Crossref