Search CORE

1,383 research outputs found

Recommended from our members

Efficient smile detection by Extreme Learning Machine

Author: An L
Bhanu B
Yang S
Publication venue: eScholarship, University of California
Publication date: 01/02/2015
Field of study

Smile detection is a specialized task in facial expression analysis with applications such as photo selection, user experience analysis, and patient monitoring. As one of the most important and informative expressions, smile conveys the underlying emotion status such as joy, happiness, and satisfaction. In this paper, an efficient smile detection approach is proposed based on Extreme Learning Machine (ELM). The faces are first detected and a holistic flow-based face registration is applied which does not need any manual labeling or key point detection. Then ELM is used to train the classifier. The proposed smile detector is tested with different feature descriptors on publicly available databases including real-world face images. The comparisons against benchmark classifiers including Support Vector Machine (SVM) and Linear Discriminant Analysis (LDA) suggest that the proposed ELM based smile detector in general performs better and is very efficient. Compared to state-of-the-art smile detector, the proposed method achieves competitive results without preprocessing and manual registration

eScholarship - University of California

Provably scale-covariant networks from oriented quasi quadrature measures in cascade

Author: DH Hubel
DJ Heeger
DLK Yamins
E Adelson
EN Johnson
H Bay
J Bruna
J Touryan
J Westö
JJ Koenderink
JJ Koenderink
K Fukushima
L Liu
L Liu
L Liu
M Carandini
M Varma
RL Valois De
RLT Goris
T Lindeberg
T Lindeberg
T Lindeberg
T Lindeberg
T Lindeberg
T Ojala
T Serre
TH Chan
Z Cai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

This article presents a continuous model for hierarchical networks based on a combination of mathematically derived models of receptive fields and biologically inspired computations. Based on a functional model of complex cells in terms of an oriented quasi quadrature combination of first- and second-order directional Gaussian derivatives, we couple such primitive computations in cascade over combinatorial expansions over image orientations. Scale-space properties of the computational primitives are analysed and it is shown that the resulting representation allows for provable scale and rotation covariance. A prototype application to texture analysis is developed and it is demonstrated that a simplified mean-reduced representation of the resulting QuasiQuadNet leads to promising experimental results on three texture datasets.Comment: 12 pages, 3 figures, 1 tabl

arXiv.org e-Print Archive

Publikationer från KTH

Crossref

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Oriented Response Networks

Author: Jiao Jianbin
Qiu Qiang
Ye Qixiang
Zhou Yanzhao
Publication venue
Publication date: 12/07/2017
Field of study

Deep Convolution Neural Networks (DCNNs) are capable of learning unprecedentedly effective image representations. However, their ability in handling significant local and global image rotations remains limited. In this paper, we propose Active Rotating Filters (ARFs) that actively rotate during convolution and produce feature maps with location and orientation explicitly encoded. An ARF acts as a virtual filter bank containing the filter itself and its multiple unmaterialised rotated versions. During back-propagation, an ARF is collectively updated using errors from all its rotated versions. DCNNs using ARFs, referred to as Oriented Response Networks (ORNs), can produce within-class rotation-invariant deep features while maintaining inter-class discrimination for classification tasks. The oriented response produced by ORNs can also be used for image and object orientation estimation tasks. Over multiple state-of-the-art DCNN architectures, such as VGG, ResNet, and STN, we consistently observe that replacing regular filters with the proposed ARFs leads to significant reduction in the number of network parameters and improvement in classification performance. We report the best results on several commonly used benchmarks.Comment: Accepted in CVPR 2017. Source code available at http://yzhou.work/OR

arXiv.org e-Print Archive

Crossref

Spatial Statistics of Visual Keypoints for Texture Recognition

Author: D. Lowe
D. Stoyan
F. Breiman
F. Goreaud
G. Csurka
G.L. Chenadec
H. Bay
I. Karoui
J. Møller
J. Zhang
K. Mikolajczyk
L. Linnett
M. Cummins
M. Heikkilä
R. Haralick
S. Kotsiantis
S. Lazebnik
T. Randen
Y. Xu
Y. Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

recognitio

CiteSeerX

Crossref

On the Design and Analysis of Multiple View Descriptors

Author: Balzer Jonathan
Davis Damek
Dong Jingming
Hernandez Joshua
Soatto Stefano
Publication venue
Publication date: 23/11/2013
Field of study

We propose an extension of popular descriptors based on gradient orientation histograms (HOG, computed in a single image) to multiple views. It hinges on interpreting HOG as a conditional density in the space of sampled images, where the effects of nuisance factors such as viewpoint and illumination are marginalized. However, such marginalization is performed with respect to a very coarse approximation of the underlying distribution. Our extension leverages on the fact that multiple views of the same scene allow separating intrinsic from nuisance variability, and thus afford better marginalization of the latter. The result is a descriptor that has the same complexity of single-view HOG, and can be compared in the same manner, but exploits multiple views to better trade off insensitivity to nuisance variability with specificity to intrinsic variability. We also introduce a novel multi-view wide-baseline matching dataset, consisting of a mixture of real and synthetic objects with ground truthed camera motion and dense three-dimensional geometry

arXiv.org e-Print Archive

CiteSeerX

Study of object recognition and identification based on shape and texture analysis

Author: Wang Guanqi
Publication venue: Electrical and Electronic Engineering, Imperial College London
Publication date: 01/03/2012
Field of study

The objective of object recognition is to enable computers to recognize image patterns without human intervention. According to its applications, it is mainly divided into two parts: recognition of object categories and detection/identification of objects. My thesis studied the techniques of object feature analysis and identification strategies, which solve the object recognition problem by employing effective and perceptually important object features. The shape information is of particular interest and a review of the shape representation and description is presented, as well as the latest research work on object recognition. In the second chapter of the thesis, a novel content-based approach is proposed for efficient shape classification and retrieval of 2D objects. Two object detection approaches, which are designed according to the characteristics of the shape context and SIFT descriptors, respectively, are analyzed and compared. It is found that the identification strategy constructed on a single type of object feature is only able to recognize the target object under specific conditions which the identifier is adapted to. These identifiers are usually designed to detect the target objects which are rich in the feature type captured by the identifier. In addition, this type of feature often distinguishes the target object from the complex scene. To overcome this constraint, a novel prototyped-based object identification method is presented to detect the target object in the complex scene by employing different types of descriptors to capture the heterogeneous features. All types of descriptors are modified to meet the requirement of the detection strategy’s framework. Thus this new method is able to describe and identify various kinds of objects whose dominant features are quite different. The identification system employs the cosine similarity to evaluate the resemblance between the prototype image and image windows on the complex scene. Then a ‘resemblance map’ is established with values on each patch representing the likelihood of the target object’s presence. The simulation approved that this novel object detection strategy is efficient, robust and of scale and rotation invariance

Spiral - Imperial College Digital Repository