Dynamic Distribution Pruning for Efficient Network Architecture Search

Ji, Rongrong; Shao, Ling; Tang, Lang; Wan, Yan; Wu, Yongjian; Wu, Yunsheng; Zhang, Baochang; Zheng, Xiawu

Dynamic Distribution Pruning for Efficient Network Architecture Search

Authors: Rongrong Ji
Ling Shao
Lang Tang
Yan Wan
Yongjian Wu
Yunsheng Wu
Baochang Zhang
Xiawu Zheng
Publication date: 9 June 2019
Publisher

Abstract

Network architectures obtained by Neural Architecture Search (NAS) have shown state-of-the-art performance in various computer vision tasks. Despite the exciting progress, the computational complexity of the forward-backward propagation and the search process makes it difficult to apply NAS in practice. In particular, most previous methods require thousands of GPU days for the search process to converge. In this paper, we propose a dynamic distribution pruning method towards extremely efficient NAS, which samples architectures from a joint categorical distribution. The search space is dynamically pruned every a few epochs to update this distribution, and the optimal neural architecture is obtained when there is only one structure remained. We conduct experiments on two widely-used datasets in NAS. On CIFAR-10, the optimal structure obtained by our method achieves the state-of-the-art

1.9

\% test error, while the search process is more than

1,000

times faster (only

1.5

GPU hours on a Tesla V100) than the state-of-the-art NAS algorithms. On ImageNet, our model achieves 75.2\% top-1 accuracy under the MobileNet settings, with a time cost of only

2

GPU days that is

100\%

acceleration over the fastest NAS algorithm. The code is available at \url{ https://github.com/tanglang96/DDPNAS

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:1905.13543

Last time updated on 12/10/2020