Search CORE

68,429 research outputs found

A Set of Criteria for Face Detection Preprocessing

Author: Ayatollahi Seyed Morteza
Nafchi Hossein Ziaei
Publication venue: Published by Elsevier B.V.
Publication date: 31/12/2012
Field of study

AbstractThe goal of this paper is to provide a robust set of preprocessing steps to be used with any face detection system. Usually, the purpose of using preprocessing steps in face detection system is to speed up the detection process and reducing false positives. A preprocessing step should reject an acceptable amount of non-face windows. First proposed criterion is based on linear image transform (LIT) which ignores scanning a number of non-face windows. Second criterion utilizes regional minima (RM) to reject non-face windows. The last one uses a modified adaptive thresholding (ADT) technique to convert input image into a binary representation and perform an exclusion process on the latter form. The proposed criteria have been used in conjunction with a version of Viola-Jones face detector. Experimental results show significant advantage against early exclusion criterion or variance classifier in terms of speed and rejection rate. CMU-MIT and BioID datasets have been used in the experiments

Elsevier - Publisher Connector

Incremental Training of a Detector Using Online Sparse Eigen-decomposition

Author: Paisitkriangkrai Sakrapee
Shen Chunhua
Zhang Jian
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 22/05/2010
Field of study

The ability to efficiently and accurately detect objects plays a very crucial role for many computer vision tasks. Recently, offline object detectors have shown a tremendous success. However, one major drawback of offline techniques is that a complete set of training data has to be collected beforehand. In addition, once learned, an offline detector can not make use of newly arriving data. To alleviate these drawbacks, online learning has been adopted with the following objectives: (1) the technique should be computationally and storage efficient; (2) the updated classifier must maintain its high classification accuracy. In this paper, we propose an effective and efficient framework for learning an adaptive online greedy sparse linear discriminant analysis (GSLDA) model. Unlike many existing online boosting detectors, which usually apply exponential or logistic loss, our online algorithm makes use of LDA's learning criterion that not only aims to maximize the class-separation criterion but also incorporates the asymmetrical property of training data distributions. We provide a better alternative for online boosting algorithms in the context of training a visual object detector. We demonstrate the robustness and efficiency of our methods on handwriting digit and face data sets. Our results confirm that object detection tasks benefit significantly when trained in an online manner.Comment: 14 page

arXiv.org e-Print Archive

Crossref

Adelaide Research & Scholarship

OPUS - University of Technology Sydney

The Australian National University

A Comprehensive Performance Evaluation of Deformable Face Tracking "In-the-Wild"

Author: Antonakos Epameinondas
Asthana Akshay
Chrysos Grigorios G.
Snape Patrick
Zafeiriou Stefanos
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/02/2017
Field of study

Recently, technologies such as face detection, facial landmark localisation and face recognition and verification have matured enough to provide effective and efficient solutions for imagery captured under arbitrary conditions (referred to as "in-the-wild"). This is partially attributed to the fact that comprehensive "in-the-wild" benchmarks have been developed for face detection, landmark localisation and recognition/verification. A very important technology that has not been thoroughly evaluated yet is deformable face tracking "in-the-wild". Until now, the performance has mainly been assessed qualitatively by visually assessing the result of a deformable face tracking technology on short videos. In this paper, we perform the first, to the best of our knowledge, thorough evaluation of state-of-the-art deformable face tracking pipelines using the recently introduced 300VW benchmark. We evaluate many different architectures focusing mainly on the task of on-line deformable face tracking. In particular, we compare the following general strategies: (a) generic face detection plus generic facial landmark localisation, (b) generic model free tracking plus generic facial landmark localisation, as well as (c) hybrid approaches using state-of-the-art face detection, model free tracking and facial landmark localisation technologies. Our evaluation reveals future avenues for further research on the topic.Comment: E. Antonakos and P. Snape contributed equally and have joint second authorshi

arXiv.org e-Print Archive

Springer - Publisher Connector

Spiral - Imperial College Digital Repository

Grid Loss: Detecting Occluded Faces

Author: C Dubout
C Garcia
D Chen
H Rowley
J Yan
M Everingham
M Mathias
N Srivastava
P Dollár
P Viola
PF Felzenszwalb
R Vaillant
S Zafeiriou
Publication venue
Publication date: 01/09/2016
Field of study

Detection of partially occluded objects is a challenging computer vision problem. Standard Convolutional Neural Network (CNN) detectors fail if parts of the detection window are occluded, since not every sub-part of the window is discriminative on its own. To address this issue, we propose a novel loss layer for CNNs, named grid loss, which minimizes the error rate on sub-blocks of a convolution layer independently rather than over the whole feature map. This results in parts being more discriminative on their own, enabling the detector to recover if the detection window is partially occluded. By mapping our loss layer back to a regular fully connected layer, no additional computational cost is incurred at runtime compared to standard CNNs. We demonstrate our method for face detection on several public face detection benchmarks and show that our method outperforms regular CNNs, is suitable for realtime applications and achieves state-of-the-art performance.Comment: accepted to ECCV 201

arXiv.org e-Print Archive

Crossref

Asymmetric Pruning for Learning Cascade Detectors

Author: Hengel Anton van den
Paisitkriangkrai Sakrapee
Shen Chunhua
Publication venue
Publication date: 01/01/2014
Field of study

Cascade classifiers are one of the most important contributions to real-time object detection. Nonetheless, there are many challenging problems arising in training cascade detectors. One common issue is that the node classifier is trained with a symmetric classifier. Having a low misclassification error rate does not guarantee an optimal node learning goal in cascade classifiers, i.e., an extremely high detection rate with a moderate false positive rate. In this work, we present a new approach to train an effective node classifier in a cascade detector. The algorithm is based on two key observations: 1) Redundant weak classifiers can be safely discarded; 2) The final detector should satisfy the asymmetric learning objective of the cascade architecture. To achieve this, we separate the classifier training into two steps: finding a pool of discriminative weak classifiers/features and training the final classifier by pruning weak classifiers which contribute little to the asymmetric learning criterion (asymmetric classifier construction). Our model reduction approach helps accelerate the learning time while achieving the pre-determined learning objective. Experimental results on both face and car data sets verify the effectiveness of the proposed algorithm. On the FDDB face data sets, our approach achieves the state-of-the-art performance, which demonstrates the advantage of our approach.Comment: 14 page

arXiv.org e-Print Archive

Adelaide Research & Scholarship

"'Who are you?' - Learning person specific classifiers from video"

Author: Everingham M.
Sivic J.
Zisserman A.
Publication venue
Publication date: 01/06/2009
Field of study

We investigate the problem of automatically labelling faces of characters in TV or movie material with their names, using only weak supervision from automaticallyaligned subtitle and script text. Our previous work (Everingham et al. [8]) demonstrated promising results on the task, but the coverage of the method (proportion of video labelled) and generalization was limited by a restriction to frontal faces and nearest neighbour classification. In this paper we build on that method, extending the coverage greatly by the detection and recognition of characters in profile views. In addition, we make the following contributions: (i) seamless tracking, integration and recognition of profile and frontal detections, and (ii) a character specific multiple kernel classifier which is able to learn the features best able to discriminate between the characters. We report results on seven episodes of the TV series “Buffy the Vampire Slayer”, demonstrating significantly increased coverage and performance with respect to previous methods on this material

CiteSeerX

White Rose Research Online

Automatic nesting seabird detection based on boosted HOG-LBP descriptors

Author: Dickinson Patrick
Freeman Robin
Lawson Shaun
Qing Chunmei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/09/2011
Field of study

Seabird populations are considered an important and accessible indicator of the health of marine environments: variations have been linked with climate change and pollution 1. However, manual monitoring of large populations is labour-intensive, and requires significant investment of time and effort. In this paper, we propose a novel detection system for monitoring a specific population of Common Guillemots on Skomer Island, West Wales (UK). We incorporate two types of features, Histograms of Oriented Gradients (HOG) and Local Binary Pattern (LBP), to capture the edge/local shape information and the texture information of nesting seabirds. Optimal features are selected from a large HOG-LBP feature pool by boosting techniques, to calculate a compact representation suitable for the SVM classifier. A comparative study of two kinds of detectors, i.e., whole-body detector, head-beak detector, and their fusion is presented. When the proposed method is applied to the seabird detection, consistent and promising results are achieved. Â© 2011 IEEE

University of Lincoln Institutional Repository

Crossref

UCL Discovery