Search CORE

1,034 research outputs found

An Empirical Evaluation of Zero Resource Acoustic Unit Discovery

Author: Burget Lukas
Dehak Najim
Ghahremani Pegah
Kesiraju Santosh
Khudanpur Sanjeev
Liu Chunxi
Ondel Lucas
Rott Alena
Sun Ming
Yang Jinyi
Publication venue
Publication date: 04/02/2017
Field of study

Acoustic unit discovery (AUD) is a process of automatically identifying a categorical acoustic unit inventory from speech and producing corresponding acoustic unit tokenizations. AUD provides an important avenue for unsupervised acoustic model training in a zero resource setting where expert-provided linguistic knowledge and transcribed speech are unavailable. Therefore, to further facilitate zero-resource AUD process, in this paper, we demonstrate acoustic feature representations can be significantly improved by (i) performing linear discriminant analysis (LDA) in an unsupervised self-trained fashion, and (ii) leveraging resources of other languages through building a multilingual bottleneck (BN) feature extractor to give effective cross-lingual generalization. Moreover, we perform comprehensive evaluations of AUD efficacy on multiple downstream speech applications, and their correlated performance suggests that AUD evaluations are feasible using different alternative language resources when only a subset of these evaluation resources can be available in typical zero resource applications.Comment: 5 pages, 1 figure; Accepted for publication at ICASSP 201

arXiv.org e-Print Archive

Crossref

Histogram equalization for robust text-independent speaker verification in telephone environments

Author: Skosan Marshalleno
Publication venue: Department of Electrical Engineering
Publication date: 01/01/2005
Field of study

Word processed copy. Includes bibliographical references

Cape Town University OpenUCT

Speech Synthesis Based on Hidden Markov Models

Author: Nankaku Y.
Oura K.
Toda T.
Tokuda K.
Yamagishi J.
Zen H.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/05/2013
Field of study

Edinburgh Research Explorer

Multi-channel Feature Enhancement for Robust Speech Recognition

Author: Emanuele Principi
Francesco Piazza
Rudy Rotili
Simone Cifani
Stefano Squartini
Publication venue: 'IntechOpen'
Publication date: 01/01/2011
Field of study

IntechOpen

IRIS UniversitÃ Politecnica delle Marche

A study of prior sensitivity for Bayesian predictive classificationbased robust speech recognition

Author: Huo Q
Lee CH
Publication venue: IEEE.
Publication date: 01/01/1998
Field of study

We previously introduced a new Bayesian predictive classification (BPC) approach to robust speech recognition and showed that the BPC is capable of coping with many types of distortions. We also learned that the efficacy of the BPC algorithm is influenced by the appropriateness of the prior distribution for the mismatch being compensated. If the prior distribution fails to characterize the variability reflected in the model parameters, then the BPC will not help much. We show how the knowledge and/or experience of the interaction between the speech signal and the possible mismatch guide us to obtain a better prior distribution which improves the performance of the BPC approach.published_or_final_versio

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Diposit Digital de Documents de la UAB

Secretaría de Estado de Cultura

Portal de Revistas Científicas Complutenses

HKU Scholars Hub

A study of prior sensitivity for Bayesian predictive classificationbased robust speech recognition

Author: Huo Q
Lee CH
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/1998
Field of study

HKU Scholars Hub