Search CORE

14,253 research outputs found

Single-Shot Refinement Neural Network for Object Detection

Author: Bian Xiao
Lei Zhen
Li Stan Z.
Wen Longyin
Zhang Shifeng
Publication venue
Publication date: 03/01/2018
Field of study

For object detection, the two-stage approach (e.g., Faster R-CNN) has been achieving the highest accuracy, whereas the one-stage approach (e.g., SSD) has the advantage of high efficiency. To inherit the merits of both while overcoming their disadvantages, in this paper, we propose a novel single-shot based detector, called RefineDet, that achieves better accuracy than two-stage methods and maintains comparable efficiency of one-stage methods. RefineDet consists of two inter-connected modules, namely, the anchor refinement module and the object detection module. Specifically, the former aims to (1) filter out negative anchors to reduce search space for the classifier, and (2) coarsely adjust the locations and sizes of anchors to provide better initialization for the subsequent regressor. The latter module takes the refined anchors as the input from the former to further improve the regression and predict multi-class label. Meanwhile, we design a transfer connection block to transfer the features in the anchor refinement module to predict locations, sizes and class labels of objects in the object detection module. The multi-task loss function enables us to train the whole network in an end-to-end way. Extensive experiments on PASCAL VOC 2007, PASCAL VOC 2012, and MS COCO demonstrate that RefineDet achieves state-of-the-art detection accuracy with high efficiency. Code is available at https://github.com/sfzhang15/RefineDetComment: 14 pages, 7 figures, 7 table

arXiv.org e-Print Archive

Crossref

A Review of Object Detection Models based on Convolutional Neural Network

Author: DG Lowe
E Shelhamer
G Wolberg
K Fukushima
M Everingham
MA Hearst
O Russakovsky
PF Felzenszwalb
T-Y Lin
W Li
Y LeCun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2019
Field of study

Convolutional Neural Network (CNN) has become the state-of-the-art for object detection in image task. In this chapter, we have explained different state-of-the-art CNN based object detection models. We have made this review with categorization those detection models according to two different approaches: two-stage approach and one-stage approach. Through this chapter, it has shown advancements in object detection models from R-CNN to latest RefineDet. It has also discussed the model description and training details of each model. Here, we have also drawn a comparison among those models.Comment: 17 pages, 11 figures, 1 tabl

arXiv.org e-Print Archive

Crossref

S $^3$ FD: Single Shot Scale-invariant Face Detector

Author: Lei Zhen
Li Stan Z.
Shi Hailin
Wang Xiaobo
Zhang Shifeng
Zhu Xiangyu
Publication venue
Publication date: 15/11/2017
Field of study

This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S

^3

FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces. Specifically, we try to solve the common problem that anchor-based detectors deteriorate dramatically as the objects become smaller. We make contributions in the following three aspects: 1) proposing a scale-equitable face detection framework to handle different scales of faces well. We tile anchors on a wide range of layers to ensure that all scales of faces have enough features for detection. Besides, we design anchor scales based on the effective receptive field and a proposed equal proportion interval principle; 2) improving the recall rate of small faces by a scale compensation anchor matching strategy; 3) reducing the false positive rate of small faces via a max-out background label. As a consequence, our method achieves state-of-the-art detection performance on all the common face detection benchmarks, including the AFW, PASCAL face, FDDB and WIDER FACE datasets, and can run at 36 FPS on a Nvidia Titan X (Pascal) for VGA-resolution images.Comment: Accepted by ICCV 2017 + its supplementary materials; Updated the latest results on WIDER FAC

arXiv.org e-Print Archive

Crossref