Improving the recognition of pathological voice using the discriminant HLDA transformation

Di Martino, Joseph; Hammouch, Ahmed; Ibn Elhaj, El Hassane; Lachhab, Othman

Improving the recognition of pathological voice using the discriminant HLDA transformation

Authors: Joseph Di Martino
Ahmed Hammouch
El Hassane Ibn Elhaj
Othman Lachhab
Publication date: 20 October 2014
Publisher: HAL CCSD

Abstract

International audienceIn this paper, we propose a simple and fast method for evaluating the pathological voice (esophageal) by applying the continuous speech recognition in a speaker dependent mode, on our own database of the pathological voice, we call FPSD (French Pathological Speech Database). The recognition system used is implemented using the HTK platform, based on HMM/GMM monophone models. The acoustic vectors are linearly transformed by the HLDA (Heteroscedastic Linear Discriminant Analysis) method to reduce their size in a smaller space with good discriminative properties. The obtained phone recognition rate (63.59 %) is very promising when we know that esophageal voice contains unnatural sounds, difficult to understand

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Archive Ouverte en Sciences de l'Information et de la Communication

oai:HAL:hal-01093309v1

Last time updated on 29/04/2016

INRIA a CCSD electronic archive server

oai:HAL:hal-01093309v1

Last time updated on 09/11/2016