Talking faces for MPEG-4 compliant scalable face-to-face telecommunication

Abstract

We present here a system that captures, encodes and renders speaker-specific speech gestures in a MPEG-4 compliant framework. The process is eased by two original options: (a) the use of a specific video capture via a head-mounted camera, (b).the a priori construction of speaker-specific shape and appearance models. We will show that speaker-specific articulatory movements can be straightforward encoded into the normalized MPEG-4 Facial Animation Parameters. We will comment the perquisite to possible extensions of this work towards every day life face-to-face communications and multimodal virtual teleconferencing

    Similar works

    Full text

    thumbnail-image

    Available Versions