Search CORE

5,132 research outputs found

Automatic human face detection for content-based image annotation

Author: Jiang M
Sadka A H
Zhou H
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2008
Field of study

In this paper, an automatic human face detection approach using colour analysis is applied for content-based image annotation. In the face detection, the probable face region is detected by adaptive boosting algorithm, and then combined with a colour filtering classifier to enhance the accuracy in face detection. The initial experimental benchmark shows the proposed scheme can be efficiently applied for image annotation with higher fidelity

Queen's University Belfast Research Portal

Crossref

Lancaster E-Prints

Brunel University Research Archive

Improving Landmark Localization with Semi-Supervised Learning

Author: Honari Sina
Kautz Jan
Molchanov Pavlo
Pal Christopher
Tyree Stephen
Vincent Pascal
Publication venue
Publication date: 01/01/2018
Field of study

We present two techniques to improve landmark localization in images from partially annotated datasets. Our primary goal is to leverage the common situation where precise landmark locations are only provided for a small data subset, but where class labels for classification or regression tasks related to the landmarks are more abundantly available. First, we propose the framework of sequential multitasking and explore it here through an architecture for landmark localization where training with class labels acts as an auxiliary signal to guide the landmark localization on unlabeled data. A key aspect of our approach is that errors can be backpropagated through a complete landmark localization model. Second, we propose and explore an unsupervised learning technique for landmark localization based on having a model predict equivariant landmarks with respect to transformations applied to the image. We show that these techniques, improve landmark prediction considerably and can learn effective detectors even when only a small fraction of the dataset has landmark labels. We present results on two toy datasets and four real datasets, with hands and faces, and report new state-of-the-art on two datasets in the wild, e.g. with only 5\% of labeled images we outperform previous state-of-the-art trained on the AFLW dataset.Comment: Published as a conference paper in CVPR 201

arXiv.org e-Print Archive

Crossref

PolyPublie

Dynamics of trimming the content of face representations for categorization in the brain

Author: A Akselrod-Ballin
A Chauvin
A Holmes
AV Flevaris
BJ Liddell
DG Pelli
DH Brainard
DJ Field
DJ Field
E Halgren
F Gosselin
FW Campbell
GA Rousselet
GB Henning
H Kirchner
HC Hughes
HM Morgan
JL McClelland
JS Morris
JS Winston
K Grill-Spector
Karl J. Friston
M Bar
M Eimer
M Livingstone
MG Philiastides
ML Smith
ML Smith
N Sigala
Nicola J. van Rijsbergen
P Ekman
P Rotshtein
P Vuilleumier
PG Schyns
PG Schyns
PG Schyns
PG Schyns
PH Schiller
Philippe G. Schyns
PT Sowden
R Adolphs
S Bentin
S Campanella
S Campanella
S Thorpe
S Ullman
SR Schweinberger
V Goffaux
V Goffaux
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2009
Field of study

To understand visual cognition, it is imperative to determine when, how and with what information the human brain categorizes the visual input. Visual categorization consistently involves at least an early and a late stage: the occipito-temporal N170 event related potential related to stimulus encoding and the parietal P300 involved in perceptual decisions. Here we sought to understand how the brain globally transforms its representations of face categories from their early encoding to the later decision stage over the 400 ms time window encompassing the N170 and P300 brain events. We applied classification image techniques to the behavioral and electroencephalographic data of three observers who categorized seven facial expressions of emotion and report two main findings: (1) Over the 400 ms time course, processing of facial features initially spreads bilaterally across the left and right occipito-temporal regions to dynamically converge onto the centro-parietal region; (2) Concurrently, information processing gradually shifts from encoding common face features across all spatial scales (e.g. the eyes) to representing only the finer scales of the diagnostic features that are richer in useful information for behavior (e.g. the wide opened eyes in 'fear'; the detailed mouth in 'happy'). Our findings suggest that the brain refines its diagnostic representations of visual categories over the first 400 ms of processing by trimming a thorough encoding of features over the N170, to leave only the detailed information important for perceptual decisions over the P300

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Edge Hill University Research Information Repository

Enlighten

Synthesizing Normalized Faces from Facial Identity Features

Author: Belanger David
Cole Forrester
Freeman William T.
Krishnan Dilip
Mosseri Inbar
Sarna Aaron
Publication venue
Publication date: 17/10/2017
Field of study

We present a method for synthesizing a frontal, neutral-expression image of a person's face given an input face photograph. This is achieved by learning to generate facial landmarks and textures from features extracted from a facial-recognition network. Unlike previous approaches, our encoding feature vector is largely invariant to lighting, pose, and facial expression. Exploiting this invariance, we train our decoder network using only frontal, neutral-expression photographs. Since these photographs are well aligned, we can decompose them into a sparse set of landmark points and aligned texture maps. The decoder then predicts landmarks and textures independently and combines them using a differentiable image warping operation. The resulting images can be used for a number of applications, such as analyzing facial attributes, exposure and white balance adjustment, or creating a 3-D avatar

arXiv.org e-Print Archive

Crossref

Extreme 3D Face Reconstruction: Seeing Through Occlusions

Author: Hassner Tal
Masi Iacopo
Medioni Gerard
Nirkin Yuval
Paz Eran
Tran Anh Tuan
Publication venue
Publication date: 29/03/2018
Field of study

Existing single view, 3D face reconstruction methods can produce beautifully detailed 3D results, but typically only for near frontal, unobstructed viewpoints. We describe a system designed to provide detailed 3D reconstructions of faces viewed under extreme conditions, out of plane rotations, and occlusions. Motivated by the concept of bump mapping, we propose a layered approach which decouples estimation of a global shape from its mid-level details (e.g., wrinkles). We estimate a coarse 3D face shape which acts as a foundation and then separately layer this foundation with details represented by a bump map. We show how a deep convolutional encoder-decoder can be used to estimate such bump maps. We further show how this approach naturally extends to generate plausible details for occluded facial regions. We test our approach and its components extensively, quantitatively demonstrating the invariance of our estimated facial details. We further provide numerous qualitative examples showing that our method produces detailed 3D face shapes in viewing conditions where existing state of the art often break down.Comment: Accepted to CVPR'18. Previously titled: "Extreme 3D Face Reconstruction: Looking Past Occlusions

arXiv.org e-Print Archive

Crossref