Search CORE

10 research outputs found

Probabilistic Adaptive Computation Time

Author: Figurnov Michael
Sobolev Artem
Vetrov Dmitry
Publication venue
Publication date: 01/12/2017
Field of study

We present a probabilistic model with discrete latent variables that control the computation time in deep learning models such as ResNets and LSTMs. A prior on the latent variables expresses the preference for faster computation. The amount of computation for an input is determined via amortized maximum a posteriori (MAP) inference. MAP inference is performed using a novel stochastic variational optimization method. The recently proposed Adaptive Computation Time mechanism can be seen as an ad-hoc relaxation of this model. We demonstrate training using the general-purpose Concrete relaxation of discrete variables. Evaluation on ResNet shows that our method matches the speed-accuracy trade-off of Adaptive Computation Time, while allowing for evaluation with a simple deterministic procedure that has a lower memory footprint

arXiv.org e-Print Archive

Biblioteka Nauki - repozytorium artykuÅÃ³w

Spatially Adaptive Computation Time for Residual Networks

Author: Collins Maxwell D.
Figurnov Michael
Huang Jonathan
Salakhutdinov Ruslan
Vetrov Dmitry
Zhang Li
Zhu Yukun
Publication venue
Publication date: 02/07/2017
Field of study

This paper proposes a deep learning architecture based on Residual Network that dynamically adjusts the number of executed layers for the regions of the image. This architecture is end-to-end trainable, deterministic and problem-agnostic. It is therefore applicable without any modifications to a wide range of computer vision problems such as image classification, object detection and image segmentation. We present experimental results showing that this model improves the computational efficiency of Residual Networks on the challenging ImageNet classification and COCO object detection datasets. Additionally, we evaluate the computation time maps on the visual saliency dataset cat2000 and find that they correlate surprisingly well with human eye fixation positions.Comment: CVPR 201

arXiv.org e-Print Archive

Crossref