Computer classification of stop consonants in a speaker independent continuous speech environment

Campanelli, Michael R.

Computer classification of stop consonants in a speaker independent continuous speech environment

Authors: Michael R. Campanelli
Publication date: 1 January 1991
Publisher: RIT Scholar Works

Abstract

In the English language there are six stop consonants, /b,d,g,p,t,k/. They account for over 17% of all phonemic occurrences. In continuous speech, phonetic recognition of stop consonants requires the ability to explicitly characterize the acoustic signal. Prior work has shown that high classification accuracy of discrete syllables and words can be achieved by characterizing the shape of the spectrally transformed acoustic signal. This thesis extends this concept to include a multispeaker continuous speech database and statistical moments of a distribution to characterize shape. A multivariate maximum likelihood classifier was used to discriminate classes. To reduce the number of features used by the discriminant model a dynamic programming scheme was employed to optimize subset combinations. The top six moments were the mean, variance, and skewness in both frequency and energy. Results showed 85% classification on the full database of 952 utterances. Performance improved to 97% when the discriminant model was trained separately for male and female talkers

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Name not available

oai:scholarworks.rit.edu:these...

Last time updated on 06/01/2018

RIT Scholar Works

oai:repository.rit.edu:theses-...

Last time updated on 12/01/2024