2,063 research outputs found

    Анализ пСрспСктив примСнСния высокоскоростных ΠΊΠ°ΠΌΠ΅Ρ€ для распознавания динамичСской Π²ΠΈΠ΄Π΅ΠΎΠΈΠ½Ρ„ΠΎΡ€ΠΌΠ°Ρ†ΠΈΠΈ

    Get PDF
    In this paper, we review the actual and perspective areas of use of high-speed video cameras. We discuss the possibility of applying high-speed cameras in the field of human-computer interaction to detect dynamic video information (including visual speech). We also describe main tasks, which can be solved with high-speed cameras, such as: automatic lip-reading, eye blink detection, facial micro-expression recognition, etc. We identify potential challenges associated with the introduction of high-speed video cameras and analyze the conditions of research area. Besides, we analyze state-of-the-art in the field at the moment and prove that there is an urgent need for further scientific and technical developments in this area. We propose some advanced applications and tasks in the human-computer interaction domain, where high-speed video capturing can be useful, such as audio-visual continuous speech recognition and automatic reading speech by lips. In further research, we will implement such a multimodal system for audio-visual Russian speech recognition using a microphone and a high-speed video camera JAI Pulnix.Π Π°ΡΡΠΌΠ°Ρ‚Ρ€ΠΈΠ²Π°ΡŽΡ‚ΡΡ Π°ΠΊΡ‚ΡƒΠ°Π»ΡŒΠ½Ρ‹Π΅ ΠΈ пСрспСктивныС направлСния ΠΏΠΎ использованию высокоскоростных Π²ΠΈΠ΄Π΅ΠΎΠΊΠ°ΠΌΠ΅Ρ€. ΠžΠ±ΡΡƒΠΆΠ΄Π°Π΅Ρ‚ΡΡ Π²ΠΎΠ·ΠΌΠΎΠΆΠ½ΠΎΡΡ‚ΡŒ примСнСния высокоскоростных ΠΊΠ°ΠΌΠ΅Ρ€ Π² области Ρ‡Π΅Π»ΠΎΠ²Π΅ΠΊΠΎ-машинного взаимодСйствия для автоматичСского распознавания динамичСской Π²ΠΈΠ΄Π΅ΠΎΠΈΠ½Ρ„ΠΎΡ€ΠΌΠ°Ρ†ΠΈΠΈ (Π² Ρ‚ΠΎΠΌ числС Π²ΠΈΠ·ΡƒΠ°Π»ΡŒΠ½ΠΎΠΉ Ρ€Π΅Ρ‡ΠΈ Π΄ΠΈΠΊΡ‚ΠΎΡ€Π°). Π’Ρ‹Π΄Π΅Π»ΡΡŽΡ‚ΡΡ основныС Π·Π°Π΄Π°Ρ‡ΠΈ взаимодСйствия, Ρ€Π΅ΡˆΠ°Π΅ΠΌΡ‹Π΅ с ΠΏΠΎΠΌΠΎΡ‰ΡŒΡŽ высокоскоростных ΠΊΠ°ΠΌΠ΅Ρ€, Ρ‚Π°ΠΊΠΈΠ΅ ΠΊΠ°ΠΊ: автоматичСскоС Ρ‡Ρ‚Π΅Π½ΠΈΠ΅ Ρ€Π΅Ρ‡ΠΈ ΠΏΠΎ Π³ΡƒΠ±Π°ΠΌ Π΄ΠΈΠΊΡ‚ΠΎΡ€Π°, ΠΎΠ±Π½Π°Ρ€ΡƒΠΆΠ΅Π½ΠΈΠ΅ моргания, распознаваниС ΠΌΠΈΠΊΡ€ΠΎΠ²Ρ‹Ρ€Π°ΠΆΠ΅Π½ΠΈΠΉ. ΠžΠ±ΠΎΠ·Π½Π°Ρ‡Π°ΡŽΡ‚ΡΡ Π²ΠΎΠ·ΠΌΠΎΠΆΠ½Ρ‹Π΅ ΠΏΡ€ΠΎΠ±Π»Π΅ΠΌΡ‹, связанныС с Π²Π½Π΅Π΄Ρ€Π΅Π½ΠΈΠ΅ΠΌ высокоскоростных Π²ΠΈΠ΄Π΅ΠΎΠΊΠ°ΠΌΠ΅Ρ€. АнализируСтся состояниС области исслСдований Π½Π° настоящий ΠΌΠΎΠΌΠ΅Π½Ρ‚ ΠΈ доказываСтся, Ρ‡Ρ‚ΠΎ имССтся высокая Π°ΠΊΡ‚ΡƒΠ°Π»ΡŒΠ½ΠΎΡΡ‚ΡŒ развития Π΄Π°Π½Π½ΠΎΠ³ΠΎ Π½Π°ΡƒΡ‡Π½ΠΎ-тСхничСского направлСния. ΠŸΡ€Π΅Π΄Π»Π°Π³Π°ΡŽΡ‚ΡΡ ΠΌΠ½ΠΎΠ³ΠΎΠΎΠ±Π΅Ρ‰Π°ΡŽΡ‰ΠΈΠ΅ области примСнСния ΠΈ Π·Π°Π΄Π°Ρ‡ΠΈ ΠΎΡ€Π³Π°Π½ΠΈΠ·Π°Ρ†ΠΈΠΈ Ρ‡Π΅Π»ΠΎΠ²Π΅ΠΊΠΎ-машинного взаимодСйствия с ΠΏΡ€ΠΈΠΌΠ΅Π½Π΅Π½ΠΈΠ΅ΠΌ высокоскоростной видСосъСмки. ΠžΡΠ½ΠΎΠ²Π½Ρ‹ΠΌΠΈ направлСниями ΡΠ²Π»ΡΡŽΡ‚ΡΡ Π°ΡƒΠ΄ΠΈΠΎΠ²ΠΈΠ·ΡƒΠ°Π»ΡŒΠ½ΠΎΠ΅ распознаваниС слитной Ρ€Π΅Ρ‡ΠΈ ΠΈ Ρ‡Ρ‚Π΅Π½ΠΈΠ΅ Ρ€Π΅Ρ‡ΠΈ ΠΏΠΎ Π³ΡƒΠ±Π°ΠΌ Π΄ΠΈΠΊΡ‚ΠΎΡ€Π°. Π’ Ρ…ΠΎΠ΄Π΅ Π΄Π°Π»ΡŒΠ½Π΅ΠΉΡˆΠΈΡ… исслСдований планируСтся рСализация ΠΏΠΎΠ΄ΠΎΠ±Π½ΠΎΠΉ многомодальной систСмы Π°ΡƒΠ΄ΠΈΠΎΠ²ΠΈΠ·ΡƒΠ°Π»ΡŒΠ½ΠΎΠ³ΠΎ распознавания Ρ€Π΅Ρ‡ΠΈ для русского языка с использованиСм ΠΌΠΈΠΊΡ€ΠΎΡ„ΠΎΠ½Π° ΠΈ высокоскоростной Π²ΠΈΠ΄Π΅ΠΎΠΊΠ°ΠΌΠ΅Ρ€Ρ‹ JAI Pulnix

    Machine Analysis of Facial Expressions

    Get PDF
    No abstract

    Individual Differences in Speech Production and Perception

    Get PDF
    Inter-individual variation in speech is a topic of increasing interest both in human sciences and speech technology. It can yield important insights into biological, cognitive, communicative, and social aspects of language. Written by specialists in psycholinguistics, phonetics, speech development, speech perception and speech technology, this volume presents experimental and modeling studies that provide the reader with a deep understanding of interspeaker variability and its role in speech processing, speech development, and interspeaker interactions. It discusses how theoretical models take into account individual behavior, explains why interspeaker variability enriches speech communication, and summarizes the limitations of the use of speaker information in forensics

    Machine Analysis of Facial Expressions

    Get PDF

    Articulatory features for robust visual speech recognition

    Full text link

    Models and Analysis of Vocal Emissions for Biomedical Applications

    Get PDF
    The International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA) came into being in 1999 from the particularly felt need of sharing know-how, objectives and results between areas that until then seemed quite distinct such as bioengineering, medicine and singing. MAVEBA deals with all aspects concerning the study of the human voice with applications ranging from the neonate to the adult and elderly. Over the years the initial issues have grown and spread also in other aspects of research such as occupational voice disorders, neurology, rehabilitation, image and video analysis. MAVEBA takes place every two years always in Firenze, Italy. This edition celebrates twenty years of uninterrupted and succesfully research in the field of voice analysis

    My English sounds better than yours: Second-language learners perceive their own accent as better than that of their peers

    Get PDF
    Second language (L2) learners are often aware of the typical pronunciation errors that speakers of their native language make, yet often persist in making these errors themselves. We hypothesised that L2 learners may perceive their own accent as closer to the target language than the accent of other learners, due to frequent exposure to their own productions. This was tested by recording 24 female native speakers of German producing 60 sentences. The same participants later rated these recordings for accentedness. Importantly, the recordings had been altered to sound male so that participants were unaware of their own productions in the to-be-rated samples. We found evidence supporting our hypothesis: participants rated their own altered voice, which they did not recognize as their own, as being closer to a native speaker than that of other learners. This finding suggests that objective feedback may be crucial in fostering L2 acquisition and reduce fossilization of erroneous patterns

    Models and Analysis of Vocal Emissions for Biomedical Applications

    Get PDF
    The Models and Analysis of Vocal Emissions with Biomedical Applications (MAVEBA) workshop came into being in 1999 from the particularly felt need of sharing know-how, objectives and results between areas that until then seemed quite distinct such as bioengineering, medicine and singing. MAVEBA deals with all aspects concerning the study of the human voice with applications ranging from the neonate to the adult and elderly. Over the years the initial issues have grown and spread also in other aspects of research such as occupational voice disorders, neurology, rehabilitation, image and video analysis. MAVEBA takes place every two years always in Firenze, Italy
    • …
    corecore