Search CORE

183 research outputs found

Content-Based Video Retrieval in Historical Collections of the German Broadcasting Archive

Author: B Zhou
D Albertson
G Marchionini
J Matas
M Mühling
PN Belhumeur
R Salakhutdinov
T Ahonen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/02/2017
Field of study

The German Broadcasting Archive (DRA) maintains the cultural heritage of radio and television broadcasts of the former German Democratic Republic (GDR). The uniqueness and importance of the video material stimulates a large scientific interest in the video content. In this paper, we present an automatic video analysis and retrieval system for searching in historical collections of GDR television recordings. It consists of video analysis algorithms for shot boundary detection, concept classification, person recognition, text recognition and similarity search. The performance of the system is evaluated from a technical and an archival perspective on 2,500 hours of GDR television recordings.Comment: TPDL 2016, Hannover, Germany. Final version is available at Springer via DO

arXiv.org e-Print Archive

Crossref

DeepEthnic: Multi-Label Ethnic Classification from Face Images

Author: A Ahmed
C Xie
J Kittler
N Kumar
PJ Phillips
PN Belhumeur
W Wang
Z Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 06/12/2019
Field of study

Ethnic group classification is a well-researched problem, which has been pursued mainly during the past two decades via traditional approaches of image processing and machine learning. In this paper, we propose a method of classifying an image face into an ethnic group by applying transfer learning from a previously trained classification network for large-scale data recognition. Our proposed method yields state-of-the-art success rates of 99.02%, 99.76%, 99.2%, and 96.7%, respectively, for the four ethnic groups: African, Asian, Caucasian, and Indian

arXiv.org e-Print Archive

Crossref

A Family of Maximum Margin Criterion for Adaptive Learning

Author: CM Bishop
G-B Huang
H Li
H Xiong
H Yu
J Liu
J Xiao
J Yang
JM Geusebroek
K Labusch
M Cheng
M Li
MA Turk
PN Belhumeur
S Yan
T Ojala
T-H Chan
Y Lecun
Publication venue
Publication date: 01/01/2018
Field of study

In recent years, pattern analysis plays an important role in data mining and recognition, and many variants have been proposed to handle complicated scenarios. In the literature, it has been quite familiar with high dimensionality of data samples, but either such characteristics or large data have become usual sense in real-world applications. In this work, an improved maximum margin criterion (MMC) method is introduced firstly. With the new definition of MMC, several variants of MMC, including random MMC, layered MMC, 2D^2 MMC, are designed to make adaptive learning applicable. Particularly, the MMC network is developed to learn deep features of images in light of simple deep networks. Experimental results on a diversity of data sets demonstrate the discriminant ability of proposed MMC methods are compenent to be adopted in complicated application scenarios.Comment: 14 page

arXiv.org e-Print Archive

Crossref

Research Online

3D Face Reconstruction from Light Field Images: A Model-free Approach

Author: A Jourabloo
A Mian
A Savran
AF Abate
AJ Whitehouse
C Cao
C Queirolo
DW Tan
H Sheng
I Kemelmacher-Shlizerman
K Honauer
K Marwah
O Aldrian
P Hammond
P Hammond
PN Belhumeur
S Wanner
S Zhang
SZ Gilani
SZ Gilani
SZ Gilani
T Fang
T-C Wang
Publication venue
Publication date: 05/07/2018
Field of study

Reconstructing 3D facial geometry from a single RGB image has recently instigated wide research interest. However, it is still an ill-posed problem and most methods rely on prior models hence undermining the accuracy of the recovered 3D faces. In this paper, we exploit the Epipolar Plane Images (EPI) obtained from light field cameras and learn CNN models that recover horizontal and vertical 3D facial curves from the respective horizontal and vertical EPIs. Our 3D face reconstruction network (FaceLFnet) comprises a densely connected architecture to learn accurate 3D facial curves from low resolution EPIs. To train the proposed FaceLFnets from scratch, we synthesize photo-realistic light field images from 3D facial scans. The curve by curve 3D face estimation approach allows the networks to learn from only 14K images of 80 identities, which still comprises over 11 Million EPIs/curves. The estimated facial curves are merged into a single pointcloud to which a surface is fitted to get the final 3D face. Our method is model-free, requires only a few training samples to learn FaceLFnet and can reconstruct 3D faces with high accuracy from single light field images under varying poses, expressions and lighting conditions. Comparison on the BU-3DFE and BU-4DFE datasets show that our method reduces reconstruction errors by over 20% compared to recent state of the art

arXiv.org e-Print Archive

Crossref

Photometric stereo for 3D face reconstruction using non-linear illumination models

Author: A Robles-Kelly
G Der-Maaten van
IA Kakadiaris
PN Belhumeur
R Basri
R Woodham
SN Kautkar
ST Roweis
T Papadhimitri
V Argyriou
V Silva
X He
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Face recognition in presence of illumination changes, variant pose and different facial expressions is a challenging problem. In this paper, a method for 3D face reconstruction using photometric stereo and without knowing the illumination directions and facial expression is proposed in order to achieve improvement in face recognition. A dimensionality reduction method was introduced to represent the face deformations due to illumination variations and self shadows in a lower space. The obtained mapping function was used to determine the illumination direction of each input image and that direction was used to apply photometric stereo. Experiments with faces were performed in order to evaluate the performance of the proposed scheme. From the experiments it was shown that the proposed approach results very accurate 3D surfaces without knowing the light directions and with a very small differences compared to the case of known directions. As a result the proposed approach is more general and requires less restrictions enabling 3D face recognition methods to operate with less data

Crossref

Kingston University Research Repository

WestminsterResearch

Discriminative sparse representation for face recognition

Author: AS Georghiades
B Quanz
C Zhang
Edwin R. Hancock
J Wright
J Wright
Lu Bai
M Yang
M Zheng
PN Belhumeur
S Ji
X He
Y Peng
Yuanheng Liang
Zhihong Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Crossref

White Rose Research Online

Weighted Fisher Discriminant Analysis in the Input and Feature Spaces

Author: BN Parlett
H Mohammadzade
J Ah-Pine
J Friedman
J Nocedal
JL Alperin
JS Hamid
M Loog
P Jain
PN Belhumeur
R Lotlikar
RA Fisher
S Boyd
T Hastie
V Perlibakas
XY Zhang
YQ Wang
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/04/2020
Field of study

Fisher Discriminant Analysis (FDA) is a subspace learning method which minimizes and maximizes the intra- and inter-class scatters of data, respectively. Although, in FDA, all the pairs of classes are treated the same way, some classes are closer than the others. Weighted FDA assigns weights to the pairs of classes to address this shortcoming of FDA. In this paper, we propose a cosine-weighted FDA as well as an automatically weighted FDA in which weights are found automatically. We also propose a weighted FDA in the feature space to establish a weighted kernel FDA for both existing and newly proposed weights. Our experiments on the ORL face recognition dataset show the effectiveness of the proposed weighting schemes.Comment: Accepted (to appear) in International Conference on Image Analysis and Recognition (ICIAR) 2020, Springe

arXiv.org e-Print Archive

Crossref

Biometrically linking document leakage to the individuals responsible

Author: D Bouslimi
D Quick
ES Pilli
G Magklaras
I Homem
J Kornblum
J Oliver
PN Belhumeur
S Khan
S Rahayu Selamat
V Roussev
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Insider threats are a significant security issue. The last decade has witnessed countless instances of data loss and exposure in which data has become publicly available and easily accessible. Losing or disclosing sensitive data or confidential information may cause substantial financial and reputational damage to a company. Whilst more recent research has specifically focused on the insider misuse problem, it has tended to focus on the information itself – either through its protection or approaches to detect leakage. In contrast, this paper presents a proactive approach to the attribution of misuse via information leakage using biometrics and a locality-sensitive hashing scheme. The hash digest of the object (e.g. a document) is mapped with the given biometric information of the person who interacted with it and generates a digital imprint file that represents the correlation between the two parties. The proposed approach does not directly store or preserve any explicit biometric information nor document copy in a repository. It is only the established correlation (imprint) is kept for the purpose of reconstructing the mapped information once an incident occurred. Comprehensive experiments for the proposed approach have shown that it is highly possible to establish this correlation even when the original version has undergone significant file modification. In many scenarios, such as changing the file format r removing parts of the document, including words and sentences, it was possible to extract and reconstruct the correlated biometric information out of a modified document (e.g. 100 words were deleted) with an average success rate of 89.31%

Crossref

Portsmouth University Research Portal (Pure)

Research Online @ ECU

Robust eye center localization based on an improved SVR method

Author: A George
A Villanueva
Alex J. Smola
C Gou
Haibin Cai
Marco Leo
N Markuš
O Jesorsky
PN Belhumeur
R. Valenti
V Le
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 18/11/2018
Field of study

Crossref

Portsmouth University Research Portal (Pure)

Manifold Elastic Net: A Unified Framework for Sparse Dimension Reduction

Author: A D’aspremont
B Efron
C Fyfe
CHQ Ding
CM Bishop
D Cai
D Tao
D Tao
D Tao
Dacheng Tao
DB Graham
DL Donoho
E Candes
EJ Candes
GH Golub
GM James
H Hotelling
H Zou
H Zou
H Zou
H Zou
HP Kriegel
J Fan
J Fan
J Lv
JB Tenenbaum
M Belkin
M Belkin
PJ Phillips
PN Belhumeur
R Tibshirani
RA Fisher
S Yan
ST Roweis
T Li
T Zhang
Tianyi Zhou
X He
X Li
Xindong Wu
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/07/2010
Field of study

It is difficult to find the optimal sparse solution of a manifold learning based dimensionality reduction algorithm. The lasso or the elastic net penalized manifold learning based dimensionality reduction is not directly a lasso penalized least square problem and thus the least angle regression (LARS) (Efron et al. \cite{LARS}), one of the most popular algorithms in sparse learning, cannot be applied. Therefore, most current approaches take indirect ways or have strict settings, which can be inconvenient for applications. In this paper, we proposed the manifold elastic net or MEN for short. MEN incorporates the merits of both the manifold learning based dimensionality reduction and the sparse learning based dimensionality reduction. By using a series of equivalent transformations, we show MEN is equivalent to the lasso penalized least square problem and thus LARS is adopted to obtain the optimal sparse solution of MEN. In particular, MEN has the following advantages for subsequent classification: 1) the local geometry of samples is well preserved for low dimensional data representation, 2) both the margin maximization and the classification error minimization are considered for sparse projection calculation, 3) the projection matrix of MEN improves the parsimony in computation, 4) the elastic net penalty reduces the over-fitting problem, and 5) the projection matrix of MEN can be interpreted psychologically and physiologically. Experimental evidence on face recognition over various popular datasets suggests that MEN is superior to top level dimensionality reduction algorithms.Comment: 33 pages, 12 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref

OPUS - University of Technology Sydney