Search CORE

1,802 research outputs found

Recognition of nonmanual markers in American Sign Language (ASL) using non-parametric adaptive 2D-3D face tracking

Author: Liu Bo
Metaxas Dimitris
Michael Nicholas
Neidle Carol
Yang Fei
Yang Peng
Publication venue: EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA
Publication date: 01/01/2012
Field of study

This paper addresses the problem of automatically recognizing linguistically significant nonmanual expressions in American Sign Language from video. We develop a fully automatic system that is able to track facial expressions and head movements, and detect and recognize facial events continuously from video. The main contributions of the proposed framework are the following: (1) We have built a stochastic and adaptive ensemble of face trackers to address factors resulting in lost face track; (2) We combine 2D and 3D deformable face models to warp input frames, thus correcting for any variation in facial appearance resulting from changes in 3D head pose; (3) We use a combination of geometric features and texture features extracted from a canonical frontal representation. The proposed new framework makes it possible to detect grammatically significant nonmanual expressions from continuous signing and to differentiate successfully among linguistically significant expressions that involve subtle differences in appearance. We present results that are based on the use of a dataset containing 330 sentences from videos that were collected and linguistically annotated at Boston University

Boston University Institutional Repository (OpenBU)

Variational Autoencoders for Deforming 3D Mesh Models

Author: Gao Lin
Lai Yu-Kun
Tan Qingyang
Xia Shihong
Publication venue
Publication date: 28/03/2018
Field of study

3D geometric contents are becoming increasingly popular. In this paper, we study the problem of analyzing deforming 3D meshes using deep neural networks. Deforming 3D meshes are flexible to represent 3D animation sequences as well as collections of objects of the same category, allowing diverse shapes with large-scale non-linear deformations. We propose a novel framework which we call mesh variational autoencoders (mesh VAE), to explore the probabilistic latent space of 3D surfaces. The framework is easy to train, and requires very few training examples. We also propose an extended model which allows flexibly adjusting the significance of different latent variables by altering the prior distribution. Extensive experiments demonstrate that our general framework is able to learn a reasonable representation for a collection of deformable shapes, and produce competitive results for a variety of applications, including shape generation, shape interpolation, shape space embedding and shape exploration, outperforming state-of-the-art methods.Comment: CVPR 201

arXiv.org e-Print Archive

Crossref

A Survey on Deep Learning in Medical Image Analysis

Author: Bejnordi Babak Ehteshami
Ciompi Francesco
Ghafoorian Mohsen
Kooi Thijs
Litjens Geert
Setio Arnaud Arindra Adiyoso
Sánchez Clara I.
van der Laak Jeroen A. W. M.
van Ginneken Bram
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Deep learning algorithms, in particular convolutional networks, have rapidly become a methodology of choice for analyzing medical images. This paper reviews the major deep learning concepts pertinent to medical image analysis and summarizes over 300 contributions to the field, most of which appeared in the last year. We survey the use of deep learning for image classification, object detection, segmentation, registration, and other tasks and provide concise overviews of studies per application area. Open challenges and directions for future research are discussed.Comment: Revised survey includes expanded discussion section and reworked introductory section on common deep architectures. Added missed papers from before Feb 1st 201

arXiv.org e-Print Archive

Radboud Repository

Imaging Informatics and the Human Brain Project: the Role of Structure, Review

Author: Brinkley James F
Rosse Cornelius
Publication venue
Publication date: 01/01/2002
Field of study

(no abstract

University of Washington Structural Informatics Group Publications

A survey on deep geometry learning: from a representation perspective

Author: Gao Lin
Lai Yu-Kun
Li Chunpeng
Xiao Yun-Peng
Zhang Fang-Lue
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/04/2020
Field of study

Researchers have achieved great success in dealing with 2D images using deep learning. In recent years, 3D computer vision and geometry deep learning have gained ever more attention. Many advanced techniques for 3D shapes have been proposed for different applications. Unlike 2D images, which can be uniformly represented by a regular grid of pixels, 3D shapes have various representations, such as depth images, multi-view images, voxels, point clouds, meshes, implicit surfaces, etc. The performance achieved in different applications largely depends on the representation used, and there is no unique representation that works well for all applications. Therefore, in this survey, we review recent developments in deep learning for 3D geometry from a representation perspective, summarizing the advantages and disadvantages of different representations for different applications. We also present existing datasets in these representations and further discuss future research directions

arXiv.org e-Print Archive

Online Research @ Cardiff

Shape Retrieval of Non-rigid 3D Human Models

Author: A Elad
A Giachetti
A Vedaldi
A. Ben Hamza
A. Bronstein
A. Giachetti
A. Godil
A. Tatsuma
AM Bronstein
B Li
B Li
B. Li
C Ionescu
C Li
C Li
C Li
C. Li
D. Pickup
F Heijden Van Der
G. Tam
GE Hinton
GK Tam
H. Johan
H. Li
J Sun
J. Han
J. Ye
KQ Weinberger
L. Isaia
L. Lai
L. Sun
M Kac
M Ovsjanikov
M Reuter
M. Aono
M. Bronstein
N Hasler
P. L. Rosin
R Gal
R Litman
R Osada
R. Litman
R. R. Martin
RO Duda
S Bu
S Bu
S Valette
S. Bu
S. Cheng
U. Castellani
V. Garro
X. Liu
X. Sun
Y Lipman
Y Rubner
Y. Lu
Z Lian
Z Lian
Z. Cheng
Z. Lian
Z. Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

3D models of humans are commonly used within computer graphics and vision, and so the ability to distinguish between body shapes is an important shape retrieval problem. We extend our recent paper which provided a benchmark for testing non-rigid 3D shape retrieval algorithms on 3D human models. This benchmark provided a far stricter challenge than previous shape benchmarks. We have added 145 new models for use as a separate training set, in order to standardise the training data used and provide a fairer comparison. We have also included experiments with the FAUST dataset of human scans. All participants of the previous benchmark study have taken part in the new tests reported here, many providing updated results using the new data. In addition, further participants have also taken part, and we provide extra analysis of the retrieval results. A total of 25 different shape retrieval methods are compared

Crossref

Online Research @ Cardiff

Springer - Publisher Connector

Fraunhofer-ePrints

Catalogo dei prodotti della ricerca

Cronfa at Swansea University