12 research outputs found

    Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification

    Get PDF
    Recently we have proposed an approach for user-customized password speaker verification; in this approach, we combined a hybrid HMM/ANN model (used for utterance verification) and a GMM model (used for speaker verification). In this paper, we extend our investigations. First, we propose a new similarity measure that uses confidence measures developed in the HMM/ANN framework. Secondly, we analyze the contribution of each model using a weighted sum combination technique. Experiments conducted on a subset of the PolyVar database show that for a short password the performance of the combined system did not improve significantly compared to the performance using the GMM model alone, and that the HMM/ANN did not contribute much in the combined system. We discuss possible reasons for this

    Adaptation of Hybrid ANN/HMM Models using Linear Hidden Transformations and Conservative Training

    Get PDF
    International audienceA technique is proposed for the adaptation of automatic speech recognition systems using Hybrid models combining Artificial Neural Networks with Hidden Markov Models. The application of linear transformations not only to the input features, but also to the outputs of the internal layers is investigated. The motivation is that the outputs of an internal layer represent a projection of the input pattern into a space where it should be easier to learn the classification or transformation expected at the output of the network. A new solution, called Conservative Training, is proposed that compensates for the lack of adaptation samples in certain classes. Supervised adaptation experiments with different corpora and for different adaptation types are described. The results show that the proposed approach always outperforms the use of transformations in the feature space and yields even better results when combined with linear input transformations

    Adapting Hybrid ANN/HMM to Speech Variations

    Get PDF
    A technique is proposed for the adaptation of automatic speech recognition systems using Hybrid models combining Artificial Neural Networks with Hidden Markov Models. We investigated in this paper the extension of the classical approach consisting in applying linear transformations not only to the input features, but also to the outputs of the internal layers. The motivation is that the outputs of an internal layer represent a projection of the input pattern into a space where it should be easier to learn the classification or transformation expected at the output of the network. To reduce the risk that the network focuses on new data only, loosing its generalization capability (catastrophic forgetting), an original solution, Conservative Training is proposed. We illustrate the problem of catastrophic forgetting using an artificial test-bed, and apply our techniques to a set of adaptation tasks in the domain of Automatic Speech Recognition (ASR) based on Artificial Neural Networks. We report on the adaptation potential of different techniques, and on the generalization capability of the adapted networks. The results show that the combination of the proposed approaches mitigates the catastrophic forgetting effects, and always outperforms the use of the classical linear transformation in the feature space. 1

    Activity Report 2004

    Get PDF

    Activity Report 2002

    Get PDF
    corecore