Search CORE

587 research outputs found

Sparsely Aggregated Convolutional Networks

Author: Deng Ruizhi
Deng Zhiwei
Maire Michael
Mori Greg
Tan Ping
Zhu Ligeng
Publication venue
Publication date: 07/02/2019
Field of study

We explore a key architectural aspect of deep convolutional neural networks: the pattern of internal skip connections used to aggregate outputs of earlier layers for consumption by deeper layers. Such aggregation is critical to facilitate training of very deep networks in an end-to-end manner. This is a primary reason for the widespread adoption of residual networks, which aggregate outputs via cumulative summation. While subsequent works investigate alternative aggregation operations (e.g. concatenation), we focus on an orthogonal question: which outputs to aggregate at a particular point in the network. We propose a new internal connection structure which aggregates only a sparse set of previous outputs at any given depth. Our experiments demonstrate this simple design change offers superior performance with fewer parameters and lower computational requirements. Moreover, we show that sparse aggregation allows networks to scale more robustly to 1000+ layers, thereby opening future avenues for training long-running visual processes.Comment: Accepted to ECCV 201

arXiv.org e-Print Archive

Crossref

Dual Skipping Networks

Author: Cheng Changmao
Feng Jianfeng
Fu Yanwei
Jiang Yu-Gang
Liu Wei
Lu Wenlian
Xue Xiangyang
Publication venue
Publication date: 27/05/2018
Field of study

Inspired by the recent neuroscience studies on the left-right asymmetry of the human brain in processing low and high spatial frequency information, this paper introduces a dual skipping network which carries out coarse-to-fine object categorization. Such a network has two branches to simultaneously deal with both coarse and fine-grained classification tasks. Specifically, we propose a layer-skipping mechanism that learns a gating network to predict which layers to skip in the testing stage. This layer-skipping mechanism endows the network with good flexibility and capability in practice. Evaluations are conducted on several widely used coarse-to-fine object categorization benchmarks, and promising results are achieved by our proposed network model.Comment: CVPR 2018 (poster); fix typ

arXiv.org e-Print Archive

Crossref