Search CORE

372 research outputs found

3D Human Face Reconstruction and 2D Appearance Synthesis

Author: Zhao Yajie
Publication venue: UKnowledge
Publication date: 01/01/2018
Field of study

3D human face reconstruction has been an extensive research for decades due to its wide applications, such as animation, recognition and 3D-driven appearance synthesis. Although commodity depth sensors are widely available in recent years, image based face reconstruction are significantly valuable as images are much easier to access and store. In this dissertation, we first propose three image-based face reconstruction approaches according to different assumption of inputs. In the first approach, face geometry is extracted from multiple key frames of a video sequence with different head poses. The camera should be calibrated under this assumption. As the first approach is limited to videos, we propose the second approach then focus on single image. This approach also improves the geometry by adding fine grains using shading cue. We proposed a novel albedo estimation and linear optimization algorithm in this approach. In the third approach, we further loose the constraint of the input image to arbitrary in the wild images. Our proposed approach can robustly reconstruct high quality model even with extreme expressions and large poses. We then explore the applicability of our face reconstructions on four interesting applications: video face beautification, generating personalized facial blendshape from image sequences, face video stylizing and video face replacement. We demonstrate great potentials of our reconstruction approaches on these real-world applications. In particular, with the recent surge of interests in VR/AR, it is increasingly common to see people wearing head-mounted displays. However, the large occlusion on face is a big obstacle for people to communicate in a face-to-face manner. Our another application is that we explore hardware/software solutions for synthesizing the face image with presence of HMDs. We design two setups (experimental and mobile) which integrate two near IR cameras and one color camera to solve this problem. With our algorithm and prototype, we can achieve photo-realistic results. We further propose a deep neutral network to solve the HMD removal problem considering it as a face inpainting problem. This approach doesn\u27t need special hardware and run in real-time with satisfying results

University of Kentucky

Joint optimization of manifold learning and sparse representations for face and gesture analysis

Author: Ptucha Raymond
Publication venue: RIT Scholar Works
Publication date: 16/04/2013
Field of study

Face and gesture understanding algorithms are powerful enablers in intelligent vision systems for surveillance, security, entertainment, and smart spaces. In the future, complex networks of sensors and cameras may disperse directions to lost tourists, perform directory lookups in the office lobby, or contact the proper authorities in case of an emergency. To be effective, these systems will need to embrace human subtleties while interacting with people in their natural conditions. Computer vision and machine learning techniques have recently become adept at solving face and gesture tasks using posed datasets in controlled conditions. However, spontaneous human behavior under unconstrained conditions, or in the wild, is more complex and is subject to considerable variability from one person to the next. Uncontrolled conditions such as lighting, resolution, noise, occlusions, pose, and temporal variations complicate the matter further. This thesis advances the field of face and gesture analysis by introducing a new machine learning framework based upon dimensionality reduction and sparse representations that is shown to be robust in posed as well as natural conditions. Dimensionality reduction methods take complex objects, such as facial images, and attempt to learn lower dimensional representations embedded in the higher dimensional data. These alternate feature spaces are computationally more efficient and often more discriminative. The performance of various dimensionality reduction methods on geometric and appearance based facial attributes are studied leading to robust facial pose and expression recognition models. The parsimonious nature of sparse representations (SR) has successfully been exploited for the development of highly accurate classifiers for various applications. Despite the successes of SR techniques, large dictionaries and high dimensional data can make these classifiers computationally demanding. Further, sparse classifiers are subject to the adverse effects of a phenomenon known as coefficient contamination, where for example variations in pose may affect identity and expression recognition. This thesis analyzes the interaction between dimensionality reduction and sparse representations to present a unified sparse representation classification framework that addresses both issues of computational complexity and coefficient contamination. Semi-supervised dimensionality reduction is shown to mitigate the coefficient contamination problems associated with SR classifiers. The combination of semi-supervised dimensionality reduction with SR systems forms the cornerstone for a new face and gesture framework called Manifold based Sparse Representations (MSR). MSR is shown to deliver state-of-the-art facial understanding capabilities. To demonstrate the applicability of MSR to new domains, MSR is expanded to include temporal dynamics. The joint optimization of dimensionality reduction and SRs for classification purposes is a relatively new field. The combination of both concepts into a single objective function produce a relation that is neither convex, nor directly solvable. This thesis studies this problem to introduce a new jointly optimized framework. This framework, termed LGE-KSVD, utilizes variants of Linear extension of Graph Embedding (LGE) along with modified K-SVD dictionary learning to jointly learn the dimensionality reduction matrix, sparse representation dictionary, sparse coefficients, and sparsity-based classifier. By injecting LGE concepts directly into the K-SVD learning procedure, this research removes the support constraints K-SVD imparts on dictionary element discovery. Results are shown for facial recognition, facial expression recognition, human activity analysis, and with the addition of a concept called active difference signatures, delivers robust gesture recognition from Kinect or similar depth cameras

RIT Scholar Works

SIFT Flow: Dense Correspondence across Scenes and its Applications

Author: Freeman William T.
Liu Ce
Torralba Antonio
Yuen Jenny
Publication venue
Publication date: 08/05/2010
Field of study

While image alignment has been studied in different areas of computer vision for decades, aligning images depicting different scenes remains a challenging problem. Analogous to optical flow where an image is aligned to its temporally adjacent frame, we propose SIFT flow, a method to align an image to its nearest neighbors in a large image corpus containing a variety of scenes. The SIFT flow algorithm consists of matching densely sampled, pixel-wise SIFT features between two images, while preserving spatial discontinuities. The SIFT features allow robust matching across different scene/object appearances, whereas the discontinuity-preserving spatial model allows matching of objects located at different parts of the scene. Experiments show that the proposed approach robustly aligns complex scene pairs containing significant spatial differences. Based on SIFT flow, we propose an alignment-based large database framework for image analysis and synthesis, where image information is transferred from the nearest neighbors to a query image according to the dense scene correspondence. This framework is demonstrated through concrete applications, such as motion field prediction from a single image, motion synthesis via object transfer, satellite image registration and face recognition

DSpace@MIT

Super-resolution:A comprehensive survey

Author: A Adler
A Almansa
A Chakrabarti
A Corduneanu
A Gholipour
A Giachetti
A Lorette
A Marquina
A Panagiotopoulou
A Schatzberg
A Zomet
AJ Patti
AJ Patti
AJ Storkey
AJ Tatem
AK Katsaggelos
ALD Martins
AWMV Eekeren
AWMV Eekeren
B Choi
B Cohen
B Huhle
B Li
B Li
B Narayanan
B Wu
BC Song
BGV Kumar
BK Gunturk
BK Gunturk
BK Gunturk
BK Gunturk
BR Hunt
C Jung
C Liu
C Liu
C Miravet
C Miravet
C Papathanassiou
C Pohl
C Su
C Wang
C Wang
CA Segall
CA Segall
CA Segall
CS Tong
CV Jiji
CV Jiji
CV Jiji
D Calle
D Capel
D Datsenko
D Lin
D Pastina
D Rajan
D Rajan
D Rajan
D Rajan
D Rajan
D Rajan
D Robinson
D Robinson
D Yldrm
D Zhang
D Zhang
DO Walsh
DP Capel
E Salari
E Shechtman
EM Hung
F Champagnat
F Humblot
F Rousseau
F Sroubek
F Sroubek
F Zhou
FM Candocia
G Dedeoglu
G Gilboa
G Ye
GH Costa
GH Costa
GH Costa
GK Chantas
GM Callic
GM Callico
H Bouzari
H Chang
H Demirel
H He
H He
H Huang
H Huanga
H Ji
H Nasir
H Shekarforoush
H Shekarforoush
H Shekarforoush
H Stark
H Su
H Su
H Takeda
H Takeda
H Yang
H Zhang
H Zhang
H Zhao
HF Shen
HK Aghajan
I Begin
J Chen
J Chung
J Cui
J Sun
J Tian
J Tian
J Tian
J Wang
J Wang
J Wu
J Yang
J Yang
J Yu
JA Kennedy
JD Ouwerkerk van
JJ Green
JS Park
K Aizawa
K Choi
K Donaldson
K Jia
K Jia
K Jia
K Kimura
K Nasrollahi
Kamal Nasrollahi
KD Sauer
KH Yap
KI Kim
KV Suresh
L Ma
L Zhang
LC Pickup
LC Pickup
LC Pickup
LJ Karam
M Ben-Ezra
M Ben-Ezra
M Bertero
M Carcenac
M Chappalli
M Elad
M Elad
M Elad
M Elad
M Elad
M Elad
M Gevrekci
M Gevrekci
M Gonzalez-Audcana
M Irani
M Irani
M Jung
M Protter
M Protter
M Shen
M Shen
M Shen
M Singh
MC Chiang
MC Hong
MC Pan
MD Robinson
ME Tipping
ME Tipping
MH Cheng
MK Nema
MK Ng
MK Ng
MK Ng
MM Islam
MV Joshi
MV Joshi
MVW Zibetti
MVW Zibetti
MVW Zibetti
MVW Zibetti
N Bose
N Bose
N Goldberg
N Kulkarni
N Nguyen
N Nguyen
NA Woods
NA Yamany
NK Bose
NK Bose
NK Bose
OA Omer
OA Omer
P Chainais
P Kramer
P Milanfar
P Purkait
P Vandewalle
P Vandewalle
P Vandewalle
PD Santis
PE Eren
PP Gajjar
Q Pan
Q Yuan
Q Yuan
R Fransens
R He
R Molina
R Sasaharay
R Tsai
RC Hardie
RC Hardie
RC Hardie
RR Schultz
RR Schultz
RR Schultz
RS Prendergast
RW Gerchberg
S Baker
S Chaudhuri
S Dai
S Farsiu
S Farsiu
S Farsiu
S Farsiu
S Farsiu
S Kim
S Liu
S Lui
S Mallat
S Peleg
S Pelletier
S Peng
S Rajaram
S Yang
S Zhang
S Zhao
SC Park
SD Babacan
SH Keller
SP Belekos
SW Park
T Akgun
T Gotoh
T Katsuki
T Komatsu
T Szydzik
TA Stephenson
TC Ho
TF Gee
Thomas B. Moeslund
V Patanavijit
V Patanavijit
V Patanavijit
W Fan
W Liu
W Liu
W Wu
W Zhang
W Zhao
WT Freeman
WT Freeman
WWW Zou
WZ Shao
X Gao
X Gao
X Gao
X Li
X Li
X Li
X Ma
X Maa
X Wang
X Zeng
X Zhang
Y Altunbasak
Y He
Y He
Y Hu
Y Hu
Y Huang
Y Mochizuki
Y Zhuang
Y-W Tai
YJ Ma
YR Li
Z Arycan
Z Bi
Z Jiang
Z Lin
Z Lin
Z Wang
Z Wang
Z Xiong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/06/2014
Field of study

Crossref

VBN

State of the Art in Face Recognition

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

Notwithstanding the tremendous effort to solve the face recognition problem, it is not possible yet to design a face recognition system with a potential close to human performance. New computer vision and pattern recognition approaches need to be investigated. Even new knowledge and perspectives from different fields like, psychology and neuroscience must be incorporated into the current field of face recognition to design a robust face recognition system. Indeed, many more efforts are required to end up with a human like face recognition system. This book tries to make an effort to reduce the gap between the previous face recognition research state and the future state

Directory of Open Access Books (DOAB)

Automatic Emotion Recognition: Quantifying Dynamics and Structure in Human Behavior.

Author: Kim Yelin
Publication venue
Publication date: 01/01/2016
Field of study

Emotion is a central part of human interaction, one that has a huge influence on its overall tone and outcome. Today's human-centered interactive technology can greatly benefit from automatic emotion recognition, as the extracted affective information can be used to measure, transmit, and respond to user needs. However, developing such systems is challenging due to the complexity of emotional expressions and their dynamics in terms of the inherent multimodality between audio and visual expressions, as well as the mixed factors of modulation that arise when a person speaks. To overcome these challenges, this thesis presents data-driven approaches that can quantify the underlying dynamics in audio-visual affective behavior. The first set of studies lay the foundation and central motivation of this thesis. We discover that it is crucial to model complex non-linear interactions between audio and visual emotion expressions, and that dynamic emotion patterns can be used in emotion recognition. Next, the understanding of the complex characteristics of emotion from the first set of studies leads us to examine multiple sources of modulation in audio-visual affective behavior. Specifically, we focus on how speech modulates facial displays of emotion. We develop a framework that uses speech signals which alter the temporal dynamics of individual facial regions to temporally segment and classify facial displays of emotion. Finally, we present methods to discover regions of emotionally salient events in a given audio-visual data. We demonstrate that different modalities, such as the upper face, lower face, and speech, express emotion with different timings and time scales, varying for each emotion type. We further extend this idea into another aspect of human behavior: human action events in videos. We show how transition patterns between events can be used for automatically segmenting and classifying action events. Our experimental results on audio-visual datasets show that the proposed systems not only improve performance, but also provide descriptions of how affective behaviors change over time. We conclude this dissertation with the future directions that will innovate three main research topics: machine adaptation for personalized technology, human-human interaction assistant systems, and human-centered multimedia content analysis.PhDElectrical Engineering: SystemsUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/133459/1/yelinkim_1.pd

Deep Blue Documents at the University of Michigan

Improved Two-Dimensional Warping

Author: Mardziel Piotr
Publication venue: Digital WPI
Publication date: 24/08/2005
Field of study

The well-known dynamic programming time warping algorithm (DTW) provides an optimal matching between 1-dimensional sequences in polynomial time. Finding an optimal 2-dimensional warping is an NP-complete problem. Hence, only approximate non-exponential time 2-dimensional warping algorithms currently exist. A polynomial time 2-dimensional approximation algorithm was proposed recently. This project provides a thorough analytical and experimental study of this algorithm. Its time complexity is improved from O(N^6) to O(N^4). An extension of the algorithm to 3D and potential higher-dimensional applications are described

DigitalCommons@WPI

Classifying Human Leg Motions with Uniaxial Piezoelectric Gyroscopes

Author: Aggarwal
Allen
Aminian
Barshan
Billur Barshan
Bussmann
Deller
Duda
Duin
Ermes
Foerster
Fukunaga
Hagan
Hauer
Haykin
Hsu
Hyeon-Kyu
Jain
Junker
Kangas
Karantonis
Keogh
Kerem Altun
Kiani
Kil
Kovács-Vajna
Lee
Lin
Mackenzie
Mathie
Mathie
Moeslund
Moeslund
Nadler
Najafi
Najafi
Nichol
Orkun Tunçel
Parsons
Pärkkä
Roetenberg
Rosenblatt
Sabatini
Schölkopf
Shelley
Shiratori
Silverman
Sukkarieh
Tan
Tao
Theodoridis
Tong
Uiterwaal
Vapnik
Veltink
Viéville
Wang
Webb
Wong
Wu
Zanuy
Zhu
Zijlstra
Publication venue: Molecular Diversity Preservation International (MDPI)
Publication date: 01/01/2009
Field of study

This paper provides a comparative study on the different techniques of classifying human leg motions that are performed using two low-cost uniaxial piezoelectric gyroscopes worn on the leg. A number of feature sets, extracted from the raw inertial sensor data in different ways, are used in the classification process. The classification techniques implemented and compared in this study are: Bayesian decision making (BDM), a rule-based algorithm (RBA) or decision tree, least-squares method (LSM), k-nearest neighbor algorithm (k-NN), dynamic time warping (DTW), support vector machines (SVM), and artificial neural networks (ANN). A performance comparison of these classification techniques is provided in terms of their correct differentiation rates, confusion matrices, computational cost, and training and storage requirements. Three different cross-validation techniques are employed to validate the classifiers. The results indicate that BDM, in general, results in the highest correct classification rate with relatively small computational cost

Multidisciplinary Digital Publishing Institute

CiteSeerX

Crossref

Bilkent University Institutional Repository

Directory of Open Access Journals

PubMed Central