Search CORE

65 research outputs found

A Review of Object Detection Models based on Convolutional Neural Network

Author: DG Lowe
E Shelhamer
G Wolberg
K Fukushima
M Everingham
MA Hearst
O Russakovsky
PF Felzenszwalb
T-Y Lin
W Li
Y LeCun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2019
Field of study

Convolutional Neural Network (CNN) has become the state-of-the-art for object detection in image task. In this chapter, we have explained different state-of-the-art CNN based object detection models. We have made this review with categorization those detection models according to two different approaches: two-stage approach and one-stage approach. Through this chapter, it has shown advancements in object detection models from R-CNN to latest RefineDet. It has also discussed the model description and training details of each model. Here, we have also drawn a comparison among those models.Comment: 17 pages, 11 figures, 1 tabl

arXiv.org e-Print Archive

Crossref

MegDet: A Large Mini-Batch Object Detector

Author: Jia Kai
Jiang Yuning
Li Zeming
Peng Chao
Sun Jian
Xiao Tete
Yu Gang
Zhang Xiangyu
Publication venue
Publication date: 11/04/2018
Field of study

The improvements in recent CNN-based object detection works, from R-CNN [11], Fast/Faster R-CNN [10, 31] to recent Mask R-CNN [14] and RetinaNet [24], mainly come from new network, new framework, or novel loss design. But mini-batch size, a key factor in the training, has not been well studied. In this paper, we propose a Large MiniBatch Object Detector (MegDet) to enable the training with much larger mini-batch size than before (e.g. from 16 to 256), so that we can effectively utilize multiple GPUs (up to 128 in our experiments) to significantly shorten the training time. Technically, we suggest a learning rate policy and Cross-GPU Batch Normalization, which together allow us to successfully train a large mini-batch detector in much less time (e.g., from 33 hours to 4 hours), and achieve even better accuracy. The MegDet is the backbone of our submission (mmAP 52.5%) to COCO 2017 Challenge, where we won the 1st place of Detection task

arXiv.org e-Print Archive

Crossref