Search CORE

15 research outputs found

How speaker tongue and name source language affect the automatic recognition of spoken names

Author: D'Hoore Bart
Martens Jean-Pierre
Réveil Bert
Publication venue: International Speech Communication Association (ISCA)
Publication date: 01/01/2009
Field of study

In this paper the automatic recognition of person names and geographical names uttered by native and non-native speakers is examined in an experimental set-up. The major aim was to raise our understanding of how well and under which circumstances previously proposed methods of multilingual pronunciation modeling and multilingual acoustic modeling contribute to a better name recognition in a cross-lingual context. To come to a meaningful interpretation of results we have categorized each language according to the amount of exposure a native speaker is expected to have had to this language. After having interpreted our results we have also tried to find an answer to the question of how much further improvement one might be able to attain with a more advanced pronunciation modeling technique which we plan to develop

Ghent University Academic Bibliography

Recognition of foreign names spoken by native speakers

Author: Martens Jean-Pierre
Stouten Frederik
Publication venue: International Speech Communication Association (ISCA)
Publication date: 01/01/2007
Field of study

It is a challenge to develop a speech recognizer that can handle the kind of lexicons encountered in an automatic attendant or car navigation application. Such lexicons can contain several 100K entries, mainly proper names. Many of these names are of a foreign origin, and native speakers can pronounce them in different ways, ranging from a completely nativized to a completely foreignized pronunciation. In this paper we propose a method that tries to deal with the observed pronunciation variability by introducing the concept of a foreignizable phoneme, and by combining standard acoustic models with a phonologically inspired back-off acoustic model. The main advantage of the approach is that it does not require any foreign phoneme models nor foreign speech data. For the recognition of English names by means of Dutch acoustic models, we obtained a reduction of the word error rate by more than 10% relative

Ghent University Academic Bibliography

Learning Pronunciation and Formulation Variants in Continuous Speech Applications

Author: COLIBRO D.
FISSORE L.
LAFACE P.
POPOVIVI C.
VAIR C.
Publication venue: IEEE
Publication date
Field of study

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Fully Automated Non-Native Speech Recognition Using Confusion-Based Acoustic Model Integration

Author: Bouselmi Ghazi
Fohr Dominique
Haton Jean-Paul
Illina Irina
Publication venue: 'The International Fiscal Association of Korea'
Publication date: 04/09/2005
Field of study

This paper presents a fully automated approach for the recognition of non-native speech based on acoustic model modification. For a native language (L1) and a spoken language (L2), pronunciation variants of the phones of L2 are automatically extracted from an existing non-native database as a confusion matrix with sequences of phones of L1. This is done using L1's and L2's ASR systems. This confusion concept deals with the problem of non existence of match between some L2 and L1 phones. The confusion matrix is then used to modify the acoustic models (HMMs) of L2 phones by integrating corresponding L1 phone models as alternative HMM paths. In this way, no lexicon modification is carried. The modified ASR system achieved an improvement between 32% and 40% (relative, L1=French and L2=English) in WER on the French non-native database used for testing

INRIA a CCSD electronic archive server

HAL-Rennes 1

Fully Automated Non-Native Speech Recognition Using Confusion-Based Acoustic Model Integration And Graphemic Constraints

Author: Bouselmi Ghazi
Fohr Dominique
Haton Jean-Paul
Illina Irina
Publication venue: HAL CCSD
Publication date: 15/05/2006
Field of study

This paper presents a fully automated approach for the recognition of non-native speech based on acoustic model modification. For a native language (L1) and a spoken language (L2), pronunciation variants of the phones of L2 are automatically extracted from an existing non-native database as a confusion matrix with sequences of phones of L1. This is done using L1's and L2's ASR systems. This confusion concept deals with the problem of non existence of match between some L2 and L1 phones. The confusion matrix is then used to modify the acoustic models (HMMs) of L2 phones by integrating corresponding L1 phone models as alternative HMM paths. We introduce graphemic contraints in the confusion extraction process: the phonetic confusion is established for each couple of `L2-phone' and the grapheme(s) correspondig to that phone. We claim that prononciation errors may depend on the graphemes related to each phone. The modified ASR system achieved an improvement between 32% and 40% (relative, L1=French and L2=English) in WER on the French non-native database used for testing. The introduction of graphemic contraints in the phonetic confusion allowed further improvements

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Reconnaissance de parole non native fondée sur l'utilisation de confusion phonétique et de contraintes graphèmiques

Author: Bouselmi Ghazi
Fohr Dominique
Haton Jean-Paul
Illina Irina
Publication venue: HAL CCSD
Publication date: 12/06/2006
Field of study

This paper presents a fully automated approach for the recognition of non native speech based on acoustic model modification. For a native language (LM) and a spoken language (LP), pronunciation variants of the phones of LP are automatically extracted from an existing non native database. These variants are stored in a confusion matrix between phones of LP and sequences of phones of LM. This confusion concept deals with the problem of non existence of match between some LM and LP phones. The confusion matrix is then used to modify the acoustic models (HMMs) of LP phones by integrating corresponding LM phone models as alternative HMM paths. We introduce graphemic contraints in the confusion extraction process. We claim that prononciation errors may depend on the graphemes related to each phone. The modified ASR system achieved a significant improvement varying between 20.3% and 43.2% (relative) in ``sentence error rate'' and between 26.6% and 50.0% (relative) in ``word error rate''. The introduction of graphemic contraints in the phonetic confusion allowed improvements while using the word-loop grammar

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Reconnaissance automatique de la parole : génération des prononciations non natives pour l'enrichissement du lexique

Author: Bada Ismael
Fohr Dominique
Illina Irina
Publication venue: AFCP
Publication date: 01/01/2020
Field of study

Dans cet article nous proposons une méthode d’adaptation du lexique, destinée à améliorer les systèmes de la reconnaissance automatique de la parole (SRAP) des locuteurs non natifs. En effet, la reconnaissance automatique souffre d’une chute significative de ses performances quand elle est utilisée pour reconnaître la parole des locuteurs non natifs, car les phonèmes de la langue étrangère sont fréquemment mal prononcés par ces locuteurs. Pour prendre en compte ce problème de prononciations erronées, notre approche propose d’intégrer les prononciations non natives dans le lexique et par la suite d’utiliser ce lexique enrichi pour la reconnaissance. Pour réaliser notre approche nous avons besoin d’un petit corpus de parole non native et de sa transcription. Pour générer les prononciations non natives, nous proposons de tenir compte des correspondances graphèmes-phonèmes en vue de générer de manière automatique des règles de création de nouvelles prononciations. Ces nouvelles prononciations seront ajoutées au lexique. Nous présentons une évaluation de notre méthode sur un corpus de locuteurs non natifs français s’exprimant en anglais

INRIA a CCSD electronic archive server

Non-Native Pronunciation Variation Modeling for Automatic Speech Recognition

Author: Hong Kook Kim
Mina Kim
Yoo Rhee Oh
Publication venue: 'IntechOpen'
Publication date: 16/08/2010
Field of study

IntechOpen

Optimizing Automatic Speech Recognition for Low-Proficient Non-Native Speakers

Author
Publication venue: Springer
Publication date
Field of study

Springer - Publisher Connector

Generating non-native pronunciation variants for lexicon adaptation

Author: Ralf Kompe
Silke Goronzy
Publication venue
Publication date: 01/01/2004
Field of study

Traditional approaches to model pronunciation variations either require expert knowledge or extensive speech databases. In the cases where non-native speech is considered they are too costly, especially if a flexible modelling of various accents is desired. We propose to exclusively use native speech databases to derive non-native pronunciation variants. We use a phoneme recognizer to generate English pronunciations for German words and use these to train decision trees that are able to predict the respective English-accented variant from the German canonical transcription. In first experiments we achieved promising results using the enhanced dictionary for decoding accented-data. 1

CiteSeerX