Search CORE

756 research outputs found

Design and implementation of an affect-responsive interactive photo frame

Author: Dibeklioğlu H.
Gevers T.
Kosunen I.
Ortega Hortas M.
Salah A.A.
Zuzánek P.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

International Migration, Integration and Social Cohesion online publications

Design and implementation of an affect-responsive interactive photo frame

Author: Dibeklioğlu H.
Gevers T.
Kosunen I.
Ortega Hortas M.
Salah A.A.
Zuzánek P.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

International Migration, Integration and Social Cohesion online publications

Recommended from our members

The role of HG in the analysis of temporal iteration and interaural correlation

Author: Barrett DJK
Hall DA
Publication venue
Publication date: 01/01/2004
Field of study

Nottingham Trent Institutional Repository (IRep)

Vision-Based 2D and 3D Human Activity Recognition

Author: Holte Michael Boelstoft
Publication venue: Department of Architecture, Design & Media Technology, Aalborg University
Publication date: 01/01/2012
Field of study

VBN

RE@CT - Immersive Production and Delivery of Interactive 3D Content

Author: Boyer Edmond
Grau Oliver
Huang Pen
Knossow David
Maggio Emilio
Schneider David
Publication venue: HAL CCSD
Publication date: 16/10/2012
Field of study

International audienceThis paper describes the aims and concepts of the FP7 RE@CT project. Building upon the latest advances in 3D capture and free-viewpoint video RE@CT aims to revolutionise the production of realistic characters and significantly reduce costs by developing an automated process to extract and represent animated characters from actor performance capture in a multiple camera studio. The key innovation is the development of methods for analysis and representation of 3D video to allow reuse for real-time interactive animation. This will enable efficient authoring of interactive characters with video quality appearance and motion

Hal - Université Grenoble Alpes

Design and implementation of an affect-responsive interactive photo frame

Author: A Mehrabian
A Salah
Albert Ali Salah
BD Lucas
C Carver
C Shan
F Bookstein
H Dibeklioğlu
H Gunes
H Tao
Hamdi Dibeklioğlu
Ilkka Kosunen
J Bailenson
J Cao
J Russell
J Shi
M Mancas
M Schröder
M Schröder
Marcos Ortega Hortas
N Sebe
O John
P Ekman
P Markopoulos
P Viola
Petr Zuzánek
R Buchanan
R Kaliouby
R Lienhart
R Valenti
S Gilroy
T Bui
T Kanade
Theo Gevers
Z Zeng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Sign Language Recognition

Author: A. Corradini
A. Farhadi
A. Micilotta
A. Rezaei
A. Roussos
B. Bauer
B. Bauer
B. Stenger
B. Stenger
British Deaf Association
C. Valli
C. Vogler
C. Vogler
C. Vogler
C. Vogler
C. Wang
C.-L. Huang
C.-S. Lee
D. Stein
E. Efthimiou
E. Murphy-Chutorian
E.-J. Ong
E.-J. Ong
E.J. Holden
E.J. Holden
F. Gaolin
H. Cooper
H. Cooper
H. Cooper
H. Ershaed
H. Fillbrandt
H. Hienz
H.-D. Yang
I. Oikonomidis
J. Bungeroth
J. Han
J. Isaacs
J. Segen
J. Zieren
J.-S. Kim
J.B. Kim
J.L. Hernandez-Rebollar
J.W. Han
K. Bailly
K. Grobel
K. Lyons
K. Murakami
K.W. Ming
L.G. Zhang
M. Krinidis
M. Ouhyoung
M. Pahlevanzadeh
M. Zahedi
M. Zahedi
M.-H. Yang
M.B. Waldron
M.W. Kadous
N. Pugeault
O. Aran
P. Doliotis
P. Ekman
P. Goh
P. Heracleous
P. Yin
R. Bowden
R. Elliott
R. Feris
R. Grzeszcuk
R. Munoz-Salinas
R. Sutton-Spence
S. Akyol
S. Hadfield
S. Hong
S. Koelstra
S. Liwicki
S. Mitra
S.-F. Wong
S.C.W. Ong
S.K. Liddell
S.O. Ba
T. Sheerman-Chase
T. Starner
T. Starner
T. Yamaguchi
T.D. Nguyen
T.E. Jerde
U. Agris von
U. Agris von
V. Athitsos
W. Gao
W.C. Stokoe
Y. Lan
Y. Yacoob
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

This chapter covers the key aspects of sign-language recognition (SLR), starting with a brief introduction to the motivations and requirements, followed by a précis of sign linguistics and their impact on the field. The types of data available and the relative merits are explored allowing examination of the features which can be extracted. Classifying the manual aspects of sign (similar to gestures) is then discussed from a tracking and non-tracking viewpoint before summarising some of the approaches to the non-manual aspects of sign languages. Methods for combining the sign classification results into full SLR are given showing the progression towards speech recognition techniques and the further adaptations required for the sign specific case. Finally the current frontiers are discussed and the recent research presented. This covers the task of continuous sign recognition, the work towards true signer independence, how to effectively combine the different modalities of sign, making use of the current linguistic research and adapting to larger more noisy data set

Crossref

Surrey Research Insight

Computer vision methods for unconstrained gesture recognition in the context of sign language annotation

Author: Gonzalez Preciado Matilde
Publication venue
Publication date: 24/09/2012
Field of study

Cette thèse porte sur l'étude des méthodes de vision par ordinateur pour la reconnaissance de gestes naturels dans le contexte de l'annotation de la Langue des Signes. La langue des signes (LS) est une langue gestuelle développée par les sourds pour communiquer. Un énoncé en LS consiste en une séquence de signes réalisés par les mains, accompagnés d'expressions du visage et de mouvements du haut du corps, permettant de transmettre des informations en parallèles dans le discours. Même si les signes sont définis dans des dictionnaires, on trouve une très grande variabilité liée au contexte lors de leur réalisation. De plus, les signes sont souvent séparés par des mouvements de co-articulation. Cette extrême variabilité et l'effet de co-articulation représentent un problème important dans les recherches en traitement automatique de la LS. Il est donc nécessaire d'avoir de nombreuses vidéos annotées en LS, si l'on veut étudier cette langue et utiliser des méthodes d'apprentissage automatique. Les annotations de vidéo en LS sont réalisées manuellement par des linguistes ou experts en LS, ce qui est source d'erreur, non reproductible et extrêmement chronophage. De plus, la qualité des annotations dépend des connaissances en LS de l'annotateur. L'association de l'expertise de l'annotateur aux traitements automatiques facilite cette tâche et représente un gain de temps et de robustesse. Le but de nos recherches est d'étudier des méthodes de traitement d'images afin d'assister l'annotation des corpus vidéo: suivi des composantes corporelles, segmentation des mains, segmentation temporelle, reconnaissance de gloses. Au cours de cette thèse nous avons étudié un ensemble de méthodes permettant de réaliser l'annotation en glose. Dans un premier temps, nous cherchons à détecter les limites de début et fin de signe. Cette méthode d'annotation nécessite plusieurs traitements de bas niveau afin de segmenter les signes et d'extraire les caractéristiques de mouvement et de forme de la main. D'abord nous proposons une méthode de suivi des composantes corporelles robuste aux occultations basée sur le filtrage particulaire. Ensuite, un algorithme de segmentation des mains est développé afin d'extraire la région des mains même quand elles se trouvent devant le visage. Puis, les caractéristiques de mouvement sont utilisées pour réaliser une première segmentation temporelle des signes qui est par la suite améliorée grâce à l'utilisation de caractéristiques de forme. En effet celles-ci permettent de supprimer les limites de segmentation détectées en milieu des signes. Une fois les signes segmentés, on procède à l'extraction de caractéristiques visuelles pour leur reconnaissance en termes de gloses à l'aide de modèles phonologiques. Nous avons évalué nos algorithmes à l'aide de corpus internationaux, afin de montrer leur avantages et limitations. L'évaluation montre la robustesse de nos méthodes par rapport à la dynamique et le grand nombre d'occultations entre les différents membres. L'annotation résultante est indépendante de l'annotateur et représente un gain de robustese important.This PhD thesis concerns the study of computer vision methods for the automatic recognition of unconstrained gestures in the context of sign language annotation. Sign Language (SL) is a visual-gestural language developed by deaf communities. Continuous SL consists on a sequence of signs performed one after another involving manual and non-manual features conveying simultaneous information. Even though standard signs are defined in dictionaries, we find a huge variability caused by the context-dependency of signs. In addition signs are often linked by movement epenthesis which consists on the meaningless gesture between signs. The huge variability and the co-articulation effect represent a challenging problem during automatic SL processing. It is necessary to have numerous annotated video corpus in order to train statistical machine translators and study this language. Generally the annotation of SL video corpus is manually performed by linguists or computer scientists experienced in SL. However manual annotation is error-prone, unreproducible and time consuming. In addition de quality of the results depends on the SL annotators knowledge. Associating annotator knowledge to image processing techniques facilitates the annotation task increasing robustness and speeding up the required time. The goal of this research concerns on the study and development of image processing technique in order to assist the annotation of SL video corpus: body tracking, hand segmentation, temporal segmentation, gloss recognition. Along this PhD thesis we address the problem of gloss annotation of SL video corpus. First of all we intend to detect the limits corresponding to the beginning and end of a sign. This annotation method requires several low level approaches for performing temporal segmentation and for extracting motion and hand shape features. First we propose a particle filter based approach for robustly tracking hand and face robust to occlusions. Then a segmentation method for extracting hand when it is in front of the face has been developed. Motion is used for segmenting signs and later hand shape is used to improve the results. Indeed hand shape allows to delete limits detected in the middle of a sign. Once signs have been segmented we proceed to the gloss recognition using lexical description of signs. We have evaluated our algorithms using international corpus, in order to show their advantages and limitations. The evaluation has shown the robustness of the proposed methods with respect to high dynamics and numerous occlusions between body parts. Resulting annotation is independent on the annotator and represents a gain on annotation consistency

Thèses en ligne de l'Université Toulouse III - Paul Sabatier

Robust density modelling using the student's t-distribution for human action recognition

Author: Moghaddam Z
Piccardi M
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/12/2011
Field of study

The extraction of human features from videos is often inaccurate and prone to outliers. Such outliers can severely affect density modelling when the Gaussian distribution is used as the model since it is highly sensitive to outliers. The Gaussian distribution is also often used as base component of graphical models for recognising human actions in the videos (hidden Markov model and others) and the presence of outliers can significantly affect the recognition accuracy. In contrast, the Student's t-distribution is more robust to outliers and can be exploited to improve the recognition rate in the presence of abnormal data. In this paper, we present an HMM which uses mixtures of t-distributions as observation probabilities and show how experiments over two well-known datasets (Weizmann, MuHAVi) reported a remarkable improvement in classification accuracy. © 2011 IEEE

OPUS - University of Technology Sydney