Search CORE

1,614 research outputs found

Human behavior understanding for worker-centered intelligent manufacturing

Author: Tao Wenjin
Publication venue: Scholars\u27 Mine
Publication date: 01/01/2020
Field of study

“In a worker-centered intelligent manufacturing system, sensing and understanding of the worker’s behavior are the primary tasks, which are essential for automatic performance evaluation & optimization, intelligent training & assistance, and human-robot collaboration. In this study, a worker-centered training & assistant system is proposed for intelligent manufacturing, which is featured with self-awareness and active-guidance. To understand the hand behavior, a method is proposed for complex hand gesture recognition using Convolutional Neural Networks (CNN) with multiview augmentation and inference fusion, from depth images captured by Microsoft Kinect. To sense and understand the worker in a more comprehensive way, a multi-modal approach is proposed for worker activity recognition using Inertial Measurement Unit (IMU) signals obtained from a Myo armband and videos from a visual camera. To automatically learn the importance of different sensors, a novel attention-based approach is proposed to human activity recognition using multiple IMU sensors worn at different body locations. To deploy the developed algorithms to the factory floor, a real-time assembly operation recognition system is proposed with fog computing and transfer learning. The proposed worker-centered training & assistant system has been validated and demonstrated the feasibility and great potential for applying to the manufacturing industry for frontline workers. Our developed approaches have been evaluated: 1) the multi-view approach outperforms the state-of-the-arts on two public benchmark datasets, 2) the multi-modal approach achieves an accuracy of 97% on a worker activity dataset including 6 activities and achieves the best performance on a public dataset, 3) the attention-based method outperforms the state-of-the-art methods on five publicly available datasets, and 4) the developed transfer learning model achieves a real-time recognition accuracy of 95% on a dataset including 10 worker operations”--Abstract, page iv

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Chapter From the Lab to the Real World: Affect Recognition Using Multiple Cues and Modalities

Author: Gunes Hatice
Or Jimmy
Pantic Maja
Piccardi Massimo
Publication venue: 'IntechOpen'
Publication date: 02/06/2021
Field of study

Interdisciplinary concept of dissipative soliton is unfolded in connection with ultrafast fibre lasers. The different mode-locking techniques as well as experimental realizations of dissipative soliton fibre lasers are surveyed briefly with an emphasis on their energy scalability. Basic topics of the dissipative soliton theory are elucidated in connection with concepts of energy scalability and stability. It is shown that the parametric space of dissipative soliton has reduced dimension and comparatively simple structure that simplifies the analysis and optimization of ultrafast fibre lasers. The main destabilization scenarios are described and the limits of energy scalability are connected with impact of optical turbulence and stimulated Raman scattering. The fast and slow dynamics of vector dissipative solitons are exposed

Directory of Open Access Books (DOAB)

Sign Language Recognition

Author: A. Corradini
A. Farhadi
A. Micilotta
A. Rezaei
A. Roussos
B. Bauer
B. Bauer
B. Stenger
B. Stenger
British Deaf Association
C. Valli
C. Vogler
C. Vogler
C. Vogler
C. Vogler
C. Wang
C.-L. Huang
C.-S. Lee
D. Stein
E. Efthimiou
E. Murphy-Chutorian
E.-J. Ong
E.-J. Ong
E.J. Holden
E.J. Holden
F. Gaolin
H. Cooper
H. Cooper
H. Cooper
H. Ershaed
H. Fillbrandt
H. Hienz
H.-D. Yang
I. Oikonomidis
J. Bungeroth
J. Han
J. Isaacs
J. Segen
J. Zieren
J.-S. Kim
J.B. Kim
J.L. Hernandez-Rebollar
J.W. Han
K. Bailly
K. Grobel
K. Lyons
K. Murakami
K.W. Ming
L.G. Zhang
M. Krinidis
M. Ouhyoung
M. Pahlevanzadeh
M. Zahedi
M. Zahedi
M.-H. Yang
M.B. Waldron
M.W. Kadous
N. Pugeault
O. Aran
P. Doliotis
P. Ekman
P. Goh
P. Heracleous
P. Yin
R. Bowden
R. Elliott
R. Feris
R. Grzeszcuk
R. Munoz-Salinas
R. Sutton-Spence
S. Akyol
S. Hadfield
S. Hong
S. Koelstra
S. Liwicki
S. Mitra
S.-F. Wong
S.C.W. Ong
S.K. Liddell
S.O. Ba
T. Sheerman-Chase
T. Starner
T. Starner
T. Yamaguchi
T.D. Nguyen
T.E. Jerde
U. Agris von
U. Agris von
V. Athitsos
W. Gao
W.C. Stokoe
Y. Lan
Y. Yacoob
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

This chapter covers the key aspects of sign-language recognition (SLR), starting with a brief introduction to the motivations and requirements, followed by a précis of sign linguistics and their impact on the field. The types of data available and the relative merits are explored allowing examination of the features which can be extracted. Classifying the manual aspects of sign (similar to gestures) is then discussed from a tracking and non-tracking viewpoint before summarising some of the approaches to the non-manual aspects of sign languages. Methods for combining the sign classification results into full SLR are given showing the progression towards speech recognition techniques and the further adaptations required for the sign specific case. Finally the current frontiers are discussed and the recent research presented. This covers the task of continuous sign recognition, the work towards true signer independence, how to effectively combine the different modalities of sign, making use of the current linguistic research and adapting to larger more noisy data set

Crossref

Surrey Research Insight

Gesture and sign language recognition with deep learning

Author: Pigou Lionel
Publication venue: Ghent University. Faculty of Engineering and Architecture
Publication date: 01/01/2018
Field of study

Ghent University Academic Bibliography

Face and Body gesture recognition for a vision-based multimodal analyser

Author: Gunes H
Jan T
Piccardi M
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2004
Field of study

users, computers should be able to recognize emotions, by analyzing the human's affective state, physiology and behavior. In this paper, we present a survey of research conducted on face and body gesture and recognition. In order to make human-computer interfaces truly natural, we need to develop technology that tracks human movement, body behavior and facial expression, and interprets these movements in an affective way. Accordingly in this paper, we present a framework for a vision-based multimodal analyzer that combines face and body gesture and further discuss relevant issues

CiteSeerX

OPUS - University of Technology Sydney

Automatic Sign Language Recognition from Image Data

Author: Campr Pavel
Publication venue: Západočeská univerzita v Plzni
Publication date: 12/02/2013
Field of study

Tato práce se zabývá problematikou automatického rozpoznávání znakového jazyka z obrazových dat. Práce představuje pět hlavních přínosů v oblasti tvorby systému pro rozpoznávání, tvorby korpusů, extrakci příznaků z rukou a obličeje s využitím metod pro sledování pozice a pohybu rukou (tracking) a modelování znaků s využitím menších fonetických jednotek (sub-units). Metody využité v rozpoznávacím systému byly využity i k tvorbě vyhledávacího nástroje "search by example", který dokáže vyhledávat ve videozáznamech podle obrázku ruky. Navržený systém pro automatické rozpoznávání znakového jazyka je založen na statistickém přístupu s využitím skrytých Markovových modelů, obsahuje moduly pro analýzu video dat, modelování znaků a dekódování. Systém je schopen rozpoznávat jak izolované, tak spojité promluvy. Veškeré experimenty a vyhodnocení byly provedeny s vlastními korpusy UWB-06-SLR-A a UWB-07-SLR-P, první z nich obsahuje 25 znaků, druhý 378. Základní extrakce příznaků z video dat byla provedena na nízkoúrovňových popisech obrazu. Lepších výsledků bylo dosaženo s příznaky získaných z popisů vyšší úrovně porozumění obsahu v obraze, které využívají sledování pozice rukou a metodu pro segmentaci rukou v době překryvu s obličejem. Navíc, využitá metoda dokáže interpolovat obrazy s obličejem v době překryvu a umožňuje tak využít metody pro extrakci příznaků z obličeje, které by během překryvu nefungovaly, jako např. metoda active appearance models (AAM). Bylo porovnáno několik různých metod pro extrakci příznaků z rukou, jako např. local binary patterns (LBP), histogram of oriented gradients (HOG), vysokoúrovnové lingvistické příznaky a nové navržená metoda hand shape radial distance function (hRDF). Bylo také zkoumáno využití menších fonetických jednotek, než jsou celé znaky, tzv. sub-units. Pro první krok tvorby těchto jednotek byl navržen iterativní algoritmus, který tyto jednotky automaticky vytváří analýzou existujících dat. Bylo ukázáno, že tento koncept je vhodný pro modelování a rozpoznávání znaků. Kromě systému pro rozpoznávání je v práci navržen a představen systém "search by example", který funguje jako vyhledávací systém pro videa se záznamy znakového jazyka a může být využit například v online slovnících znakového jazyka, kde je v současné době složité či nemožné v takovýchto datech vyhledávat. Tento nástroj využívá metody, které byly použity v rozpoznávacím systému. Výstupem tohoto vyhledávacího nástroje je seřazený seznam videí, které obsahují stejný nebo podobný tvar ruky, které zadal uživatel, např. přes webkameru.Katedra kybernetikyObhájenoThis thesis addresses several issues of automatic sign language recognition, namely the creation of vision based sign language recognition framework, sign language corpora creation, feature extraction, making use of novel hand tracking with face occlusion handling, data-driven creation of sub-units and "search by example" tool for searching in sign language corpora using hand images as a search query. The proposed sign language recognition framework, based on statistical approach incorporating hidden Markov models (HMM), consists of video analysis, sign modeling and decoding modules. The framework is able to recognize both isolated signs and continuous utterances from video data. All experiments and evaluations were performed on two own corpora, UWB-06-SLR-A and UWB-07-SLR-P, the first containing 25 signs and second 378. As a baseline feature descriptors, low level image features are used. It is shown that better performance is gained by higher level features that employ hand tracking, which resolve occlusions of hands and face. As a side effect, the occlusion handling method interpolates face area in the frames during the occlusion and allows to use face feature descriptors that fail in such a case, for instance features extracted from active appearance models (AAM) tracker. Several state-of-the-art appearance-based feature descriptors were compared for tracked hands, such as local binary patterns (LBP), histogram of oriented gradients (HOG), high-level linguistic features or newly proposed hand shape radial distance function (denoted as hRDF) that enhances the feature description of hand-shape like concave regions. The concept of sub-units, that uses HMM models based on linguistic units smaller than whole sign and covers inner structures of the signs, was investigated in the proposed iterative method that is a first required step for data-driven construction of sub-units, and shows that such a concept is suitable for sign modeling and recognition tasks. Except of experiments in the sign language recognition, additional tool \textit{search by example} was created and evaluated. This tool is a search engine for sign language videos. Such a system can be incorporated into an online sign language dictionary where it is difficult to search in the sign language data. This proposed tool employs several methods which were examined in the sign language recognition task and allows to search in the video corpora based on an user-given query that consists of one or multiple images of hands. As a result, an ordered list of videos that contain the same or similar hand configurations is returned

University of West Bohemia Digital Library

DSpace at University of West Bohemia

Towards Subject Independent Sign Language Recognition : A Segment-Based Probabilistic Approach

Author: KONG WEI WEON
Publication venue
Publication date: 25/07/2011
Field of study

Ph.DDOCTOR OF PHILOSOPH

ScholarBank@NUS

Sensor Signal Processing for Human Gait Deterioration Analysis by Machine Learning

Author: Alharthi Abdullah
Publication venue
Publication date: 01/08/2022
Field of study

The University of Manchester - Institutional Repository

A Multi-class Classification Strategy for Fisher Scores: Application to Signer Independent Sign Language Recognition

Author: Akarun Lale
Aran Oya
Publication venue: 'Elsevier BV'
Publication date: 26/08/2010
Field of study

Fisher kernels combine the powers of discriminative and generative classifiers by mapping the variable-length sequences to a new fixed length feature space, called the Fisher score space. The mapping is based on a single generative model and the classifier is intrinsically binary. We propose a multi-class classification strategy that applies a multi-class classification on each Fisher score space and combines the decisions of multi-class classifiers. We experimentally show that the Fisher scores of one class provide discriminative information for the other classes as well. We compare several multi-class classification strategies for Fisher scores generated from the hidden Markov models of sign sequences. The proposed multi-class classification strategy increases the classification accuracy in comparison with the state of the art strategies based on combining binary classifiers. To reduce the computational complexity of the Fisher score extraction and the training phases, we also propose a score space selection method and show that, similar or even higher accuracies can be obtained by using only a subset of the score spaces. Based on the proposed score space selection method, a signer adaptation technique is also presented that does not require any re-training

Infoscience - École polytechnique fédérale de Lausanne