Search CORE

22,807 research outputs found

Combining joint factor analysis and iVectors for robust language recognition

Author: Demuynck Kris
Desplanques Brecht
Martens Jean-Pierre
Publication venue
Publication date: 01/01/2014
Field of study

Glottal Source Cepstrum Coefficients Applied to NIST SRE 2010

Author: Gómez Vilda Pedro
Martínez Olalla Rafael
Mazaira Fernández Luis Miguel
Muñoz Cristina
Álvarez Marquina Agustin
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/2010
Field of study

Through the present paper, a novel feature set for speaker recognition based on glottal estimate information is presented. An iterative algorithm is used to derive the vocal tract and glottal source estimations from speech signal. In order to test the importance of glottal source information in speaker characterization, the novel feature set has been tested in the 2010 NIST Speaker Recognition Evaluation (NIST SRE10). The proposed system uses glottal estimate parameter templates and classical cepstral information to build a model for each speaker involved in the recognition process. ALIZE [1] open-source software has been used to create the GMM models for both background and target speakers. Compared to using mel-frequency cepstrum coefficients (MFCC), the misclassification rate for the NIST SRE 2010 reduced from 29.43% to 27.15% when glottal source features are use

Archivo Digital UPM

Histogram equalization for robust text-independent speaker verification in telephone environments

Author: Skosan Marshalleno
Publication venue: Department of Electrical Engineering
Publication date: 01/01/2005
Field of study

Word processed copy. Includes bibliographical references

Cape Town University OpenUCT

Band-pass filtering of the time sequences of spectral parameters for robust wireless speech recognition

Author: Díaz de María Fernando
Gallardo Antolín Ascensión
Peláez Moreno Carmen
Vicente Peña Jesús de
Publication venue: 'Elsevier BV'
Publication date: 01/01/2006
Field of study

In this paper we address the problem of automatic speech recognition when wireless speech communication systems are involved. In this context, three main sources of distortion should be considered: acoustic environment, speech coding and transmission errors. Whilst the first one has already received a lot of attention, the last two deserve further investigation in our opinion. We have found out that band-pass filtering of the recognition features improves ASR performance when distortions due to these particular communication systems are present. Furthermore, we have evaluated two alternative configurations at different bit error rates (BER) typical of these channels: band-pass filtering the LP-MFCC parameters or a modification of the RASTA-PLP using a sharper low-pass section perform consistently better than LP-MFCC and RASTA-PLP, respectively.Publicad

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Universidad Carlos III de Madrid e-Archivo

Histogram Equalization for Robust Speech Recognition

Author: &#193
Antonio J. Rubio
Carmen Ben&#237
Jose Carlos Segura
Luz Garc&#237
Publication venue: 'IntechOpen'
Publication date: 01/11/2008
Field of study

IntechOpen

Speech Recognition Under Noise Conditions: Compensation Methods

Author: Angel de la Torre
Antonio J. Rubio
Carmen Benitez
Javier Ramirez Luz Garcia
Jose C. Segura
Publication venue: 'IntechOpen'
Publication date: 01/06/2007
Field of study

IntechOpen