Soft Masking for Cost-Constrained Channel Pruning

Alvarez, Jose M.; Darve1, Eric; Humble, Ryan; Latorre, Jorge Albericio; Shen, Maying

Soft Masking for Cost-Constrained Channel Pruning

Authors: Jose M. Alvarez
Eric Darve1
Ryan Humble
Jorge Albericio Latorre
Maying Shen
Publication date: 3 November 2022
Publisher

Abstract

Structured channel pruning has been shown to significantly accelerate inference time for convolution neural networks (CNNs) on modern hardware, with a relatively minor loss of network accuracy. Recent works permanently zero these channels during training, which we observe to significantly hamper final accuracy, particularly as the fraction of the network being pruned increases. We propose Soft Masking for cost-constrained Channel Pruning (SMCP) to allow pruned channels to adaptively return to the network while simultaneously pruning towards a target cost constraint. By adding a soft mask re-parameterization of the weights and channel pruning from the perspective of removing input channels, we allow gradient updates to previously pruned channels and the opportunity for the channels to later return to the network. We then formulate input channel pruning as a global resource allocation problem. Our method outperforms prior works on both the ImageNet classification and PASCAL VOC detection datasets.Comment: Accepted by ECCV 202

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2211.02206

Last time updated on 12/12/2022