Search CORE

1 research outputs found

Classifying visemes for automatic lipreading

Author: Matousek Vaclav
Mautner Pavel
Nijholt Antinus
Ocelikovi Jana
Poel Mannes
Sojka Petr
Visser Michiel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/09/1999
Field of study

Automatic lipreading is automatic speech recognition that uses only visual information. The relevant data in a video signal is isolated and features are extracted from it. From a sequence of feature vectors, where every vector represents one video image, a sequence of higher level semantic elements is formed. These semantic elements are "visemes" the visual equivalent of "phonemes" The developed prototype uses a Time Delayed Neural Network to classify the visemes