Can audio-visual speech recognition outperform acoustically enhanced speech recognition in automotive environment?

Navarathna, Rajitha; Kleinschmidt, Tristan; Dean, David; Sridharan, Sridha; Lucey, Patrick

research

oai:eprints.qut.edu.au:45770

Can audio-visual speech recognition outperform acoustically enhanced speech recognition in automotive environment?

Authors: Rajitha Navarathna
Tristan Kleinschmidt
David Dean
Sridha Sridharan
Patrick Lucey
Publication date: 1 January 2011
Publisher: 'International Speech Communication Association'

Abstract

The use of visual features in the form of lip movements to improve the performance of acoustic speech recognition has been shown to work well, particularly in noisy acoustic conditions. However, whether this technique can outperform speech recognition incorporating well-known acoustic enhancement techniques, such as spectral subtraction, or multi-channel beamforming is not known. This is an important question to be answered especially in an automotive environment, for the design of an efficient human-vehicle computer interface. We perform a variety of speech recognition experiments on a challenging automotive speech dataset and results show that synchronous HMM-based audio-visual fusion can outperform traditional single as well as multi-channel acoustic speech enhancement techniques. We also show that further improvement in recognition performance can be obtained by fusing speech-enhanced audio with the visual modality, demonstrating the complementary nature of the two robust speech recognition approaches

Chapter in Book, Report or Conference volume

Similar works

Full text

Open in the Core reader

Download PDF

Queensland University of Technology ePrints Archive

oai:eprints.qut.edu.au:45770

Last time updated on 02/07/2013

This paper was published in Queensland University of Technology ePrints Archive.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.