VISUAL FEATURES FOR MULTIMODAL SPEECH RECOGNITION

Petr Motlíček

VISUAL FEATURES FOR MULTIMODAL SPEECH RECOGNITION

Authors: Petr Motlíček
Publication date
Publisher

Abstract

This paper demonstrates the use of visual parameters extracted from video for automatic recognition of phoneme strings. Encouraged by previous works utilizing ”visually clean” data we investigate their efficiency in non-ideal conditions which are introduced by meeting audio-visual data employed in our experiments.

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.88.30...

Last time updated on 22/10/2014