Search CORE

3 research outputs found

Acoustic-to-articulatory inversion in speech based on statistical models

Author: Badin Pierre
Bailly Gérard
Ben Youssef Atef
Publication venue: HAL CCSD
Publication date: 30/09/2010
Field of study

International audienceTwo speech inversion methods are implemented and compared. In the first, multistream Hidden Markov Models (HMMs) of phonemes are jointly trained from synchronous streams of articulatory data acquired by EMA and speech spectral parameters; an acoustic recognition system uses the acoustic part of the HMMs to deliver a phoneme chain and the states durations; this information is then used by a trajectory formation procedure based on the articulatory part of the HMMs to resynthesise the articulatory movements. In the second, Gaussian Mixture Models (GMMs) are trained on these streams to directly associate articulatory frames with acoustic frames in context, using Maximum Likelihood Estimation. Over a corpus of 17 minutes uttered by a French speaker, the RMS error was 1.62 mm with the HMMs and 2.25 mm with the GMMs

Hal - Université Grenoble Alpes

HAL Descartes

Acoustic-to-articulatory inversion in speech based on statistical models

Author: Badin Pierre
Bailly Gérard
Ben Youssef Atef
Publication venue: HAL CCSD
Publication date: 30/09/2010
Field of study

Hal - Université Grenoble Alpes

Acoustic-to-articulatory inversion in speech based on statistical models

Author: Badin Pierre
Bailly Gérard
Ben Youssef Atef
Publication venue: HAL CCSD
Publication date: 30/09/2010
Field of study

HAL Descartes