Search CORE

1,817 research outputs found

Multilingual Training and Cross-lingual Adaptation on CTC-based Acoustic Model

Author: Bourlard Hervé
Garner Philip N.
Tong Sibo
Publication venue
Publication date: 23/01/2018
Field of study

Multilingual models for Automatic Speech Recognition (ASR) are attractive as they have been shown to benefit from more training data, and better lend themselves to adaptation to under-resourced languages. However, initialisation from monolingual context-dependent models leads to an explosion of context-dependent states. Connectionist Temporal Classification (CTC) is a potential solution to this as it performs well with monophone labels. We investigate multilingual CTC in the context of adaptation and regularisation techniques that have been shown to be beneficial in more conventional contexts. The multilingual model is trained to model a universal International Phonetic Alphabet (IPA)-based phone set using the CTC loss function. Learning Hidden Unit Contribution (LHUC) is investigated to perform language adaptive training. In addition, dropout during cross-lingual adaptation is also studied and tested in order to mitigate the overfitting problem. Experiments show that the performance of the universal phoneme-based CTC system can be improved by applying LHUC and it is extensible to new phonemes during cross-lingual adaptation. Updating all the parameters shows consistent improvement on limited data. Applying dropout during adaptation can further improve the system and achieve competitive performance with Deep Neural Network / Hidden Markov Model (DNN/HMM) systems on limited data

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Adaptation of Hybrid ANN/HMM Models using Linear Hidden Transformations and Conservative Training

Author: De Mori R.
Gemello R.
Laface Pietro
Mana F.
Scanzio Stefano
Publication venue: IEEE
Publication date: 01/01/2006
Field of study

International audienceA technique is proposed for the adaptation of automatic speech recognition systems using Hybrid models combining Artificial Neural Networks with Hidden Markov Models. The application of linear transformations not only to the input features, but also to the outputs of the internal layers is investigated. The motivation is that the outputs of an internal layer represent a projection of the input pattern into a space where it should be easier to learn the classification or transformation expected at the output of the network. A new solution, called Conservative Training, is proposed that compensates for the lack of adaptation samples in certain classes. Supervised adaptation experiments with different corpora and for different adaptation types are described. The results show that the proposed approach always outperforms the use of transformations in the feature space and yields even better results when combined with linear input transformations

PORTO Publications Open Repository TOrino

Linear hidden transformations for adaptation of hybrid ANN/HMM models

Author: De Mori Renato
Gemello Roberto
Laface Pietro
Mana Franco
Scanzio Stefano
Publication venue: Elsevier
Publication date: 01/01/2007
Field of study

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Adapting Hybrid ANN/HMM to Speech Variations

Author: D. Albesano
F. Mana
LAFACE Pietro
R. Gemello
SCANZIO STEFANO
Publication venue: 'The International Fiscal Association of Korea'
Publication date: 01/01/2006
Field of study

A technique is proposed for the adaptation of automatic speech recognition systems using Hybrid models combining Artificial Neural Networks with Hidden Markov Models. We investigated in this paper the extension of the classical approach consisting in applying linear transformations not only to the input features, but also to the outputs of the internal layers. The motivation is that the outputs of an internal layer represent a projection of the input pattern into a space where it should be easier to learn the classification or transformation expected at the output of the network. To reduce the risk that the network focuses on new data only, loosing its generalization capability (catastrophic forgetting), an original solution, Conservative Training is proposed. We illustrate the problem of catastrophic forgetting using an artificial test-bed, and apply our techniques to a set of adaptation tasks in the domain of Automatic Speech Recognition (ASR) based on Artificial Neural Networks. We report on the adaptation potential of different techniques, and on the generalization capability of the adapted networks. The results show that the combination of the proposed approaches mitigates the catastrophic forgetting effects, and always outperforms the use of the classical linear transformation in the feature space. 1

CiteSeerX

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Adaptation of Hybrid ANN/HMM Models using Linear Hidden Transformations and Conservative Training

Author: DE MORI R
GEMELLO R
LAFACE P
MANA F
SCANZIO S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Linear hidden transformations for adaptation of hybrid ANN/HMM models

Author: DE MORI RENATO
GEMELLO ROBERTO
LAFACE PIETRO
MANA FRANCO
SCANZIO S.
Publication venue: 'Elsevier BV'
Publication date
Field of study

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)