Search CORE

2,063 research outputs found

Detecting Social Signals with Spatiotemporal Gabor Filters

Author: Joosten Bart
Publication venue: [s.n.]
Publication date: 01/01/2018
Field of study

Анализ перспектив применения высокоскоростных камер для распознавания динамической видеоинформации

Author: Иванько Денис Викторович
Карпов Алексей Анатольевич
Publication venue: СПб ФИЦ РАН
Publication date: 15/02/2016
Field of study

In this paper, we review the actual and perspective areas of use of high-speed video cameras. We discuss the possibility of applying high-speed cameras in the field of human-computer interaction to detect dynamic video information (including visual speech). We also describe main tasks, which can be solved with high-speed cameras, such as: automatic lip-reading, eye blink detection, facial micro-expression recognition, etc. We identify potential challenges associated with the introduction of high-speed video cameras and analyze the conditions of research area. Besides, we analyze state-of-the-art in the field at the moment and prove that there is an urgent need for further scientific and technical developments in this area. We propose some advanced applications and tasks in the human-computer interaction domain, where high-speed video capturing can be useful, such as audio-visual continuous speech recognition and automatic reading speech by lips. In further research, we will implement such a multimodal system for audio-visual Russian speech recognition using a microphone and a high-speed video camera JAI Pulnix.Рассматриваются актуальные и перспективные направления по использованию высокоскоростных видеокамер. Обсуждается возможность применения высокоскоростных камер в области человеко-машинного взаимодействия для автоматического распознавания динамической видеоинформации (в том числе визуальной речи диктора). Выделяются основные задачи взаимодействия, решаемые с помощью высокоскоростных камер, такие как: автоматическое чтение речи по губам диктора, обнаружение моргания, распознавание микровыражений. Обозначаются возможные проблемы, связанные с внедрением высокоскоростных видеокамер. Анализируется состояние области исследований на настоящий момент и доказывается, что имеется высокая актуальность развития данного научно-технического направления. Предлагаются многообещающие области применения и задачи организации человеко-машинного взаимодействия с применением высокоскоростной видеосъемки. Основными направлениями являются аудиовизуальное распознавание слитной речи и чтение речи по губам диктора. В ходе дальнейших исследований планируется реализация подобной многомодальной системы аудиовизуального распознавания речи для русского языка с использованием микрофона и высокоскоростной видеокамеры JAI Pulnix

Machine Analysis of Facial Expressions

Author: Bartlett M.S.
Pantic M.
Publication venue: I-Tech Education and Publishing
Publication date: 01/01/2007
Field of study

No abstract

CiteSeerX

University of Twente Research Information

Individual Differences in Speech Production and Perception

Author
Publication venue: 'Peter Lang, International Academic Publishers'
Publication date
Field of study

Inter-individual variation in speech is a topic of increasing interest both in human sciences and speech technology. It can yield important insights into biological, cognitive, communicative, and social aspects of language. Written by specialists in psycholinguistics, phonetics, speech development, speech perception and speech technology, this volume presents experimental and modeling studies that provide the reader with a deep understanding of interspeaker variability and its role in speech processing, speech development, and interspeaker interactions. It discusses how theoretical models take into account individual behavior, explains why interspeaker variability enriches speech communication, and summarizes the limitations of the use of speaker information in forensics

OAPEN Library

Machine Analysis of Facial Expressions

Author: Bartlett M.S.
Pantic Maja
Publication venue: I-Tech Education and Publishing
Publication date: 01/07/2007
Field of study

University of Twente Research Information

Articulatory features for robust visual speech recognition

Author
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2004
Field of study

Models and Analysis of Vocal Emissions for Biomedical Applications

Author
Publication venue: 'Firenze University Press'
Publication date: 31/05/2022
Field of study

The International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA) came into being in 1999 from the particularly felt need of sharing know-how, objectives and results between areas that until then seemed quite distinct such as bioengineering, medicine and singing. MAVEBA deals with all aspects concerning the study of the human voice with applications ranging from the neonate to the adult and elderly. Over the years the initial issues have grown and spread also in other aspects of research such as occupational voice disorders, neurology, rehabilitation, image and video analysis. MAVEBA takes place every two years always in Firenze, Italy. This edition celebrates twenty years of uninterrupted and succesfully research in the field of voice analysis

My English sounds better than yours: Second-language learners perceive their own accent as better than that of their peers

Author: Eger Nikola Anna
Mitterer Holger
Reinisch Eva
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2020
Field of study

Second language (L2) learners are often aware of the typical pronunciation errors that speakers of their native language make, yet often persist in making these errors themselves. We hypothesised that L2 learners may perceive their own accent as closer to the target language than the accent of other learners, due to frequent exposure to their own productions. This was tested by recording 24 female native speakers of German producing 60 sentences. The same participants later rated these recordings for accentedness. Importantly, the recordings had been altered to sound male so that participants were unaware of their own productions in the to-be-rated samples. We found evidence supporting our hypothesis: participants rated their own altered voice, which they did not recognize as their own, as being closer to a native speaker than that of other learners. This finding suggests that objective feedback may be crucial in fostering L2 acquisition and reduce fossilization of erroneous patterns

Directory of Open Access Journals

Models and Analysis of Vocal Emissions for Biomedical Applications

Author
Publication venue: 'Firenze University Press'
Publication date: 31/05/2022
Field of study

The Models and Analysis of Vocal Emissions with Biomedical Applications (MAVEBA) workshop came into being in 1999 from the particularly felt need of sharing know-how, objectives and results between areas that until then seemed quite distinct such as bioengineering, medicine and singing. MAVEBA deals with all aspects concerning the study of the human voice with applications ranging from the neonate to the adult and elderly. Over the years the initial issues have grown and spread also in other aspects of research such as occupational voice disorders, neurology, rehabilitation, image and video analysis. MAVEBA takes place every two years always in Firenze, Italy