Search CORE

67,794 research outputs found

Homogenous Ensemble Phonotactic Language Recognition Based on SVM Supervector Reconstruction

Author: Johnson Michael T
Liu Jia
Liu Wei-Wei
Zhang Wei-Qiang
Publication venue: e-Publications@Marquette
Publication date: 01/01/2014
Field of study

Currently, acoustic spoken language recognition (SLR) and phonotactic SLR systems are widely used language recognition systems. To achieve better performance, researchers combine multiple subsystems with the results often much better than a single SLR system. Phonotactic SLR subsystems may vary in the acoustic features vectors or include multiple language-specific phone recognizers and different acoustic models. These methods achieve good performance but usually compute at high computational cost. In this paper, a new diversification for phonotactic language recognition systems is proposed using vector space models by support vector machine (SVM) supervector reconstruction (SSR). In this architecture, the subsystems share the same feature extraction, decoding, and N-gram counting preprocessing steps, but model in a different vector space by using the SSR algorithm without significant additional computation. We term this a homogeneous ensemble phonotactic language recognition (HEPLR) system. The system integrates three different SVM supervector reconstruction algorithms, including relative SVM supervector reconstruction, functional SVM supervector reconstruction, and perturbing SVM supervector reconstruction. All of the algorithms are incorporated using a linear discriminant analysis-maximum mutual information (LDA-MMI) backend for improving language recognition evaluation (LRE) accuracy. Evaluated on the National Institute of Standards and Technology (NIST) LRE 2009 task, the proposed HEPLR system achieves better performance than a baseline phone recognition-vector space modeling (PR-VSM) system with minimal extra computational cost. The performance of the HEPLR system yields 1.39%, 3.63%, and 14.79% equal error rate (EER), representing 6.06%, 10.15%, and 10.53% relative improvements over the baseline system, respectively, for the 30-, 10-, and 3-s test conditions

epublications@Marquette

Springer - Publisher Connector

Why a diagnosis of neurofibromatosis calls for the attention of a deaf educator

Author: López Lydia Marie
Publication venue: Digital Commons@Becker
Publication date: 01/01/2016
Field of study

This paper will seek to describe neurofibromatosis (NF), the scope of its impact, how NF relates to hearing loss, and why someone with a teacher of the deaf’s expertise may have information to offer the intervention team for a child diagnosed with NF

Digital Commons@Becker

Improving Source Separation via Multi-Speaker Representations

Author: Van hamme Hugo
Zegers Jeroen
Publication venue: 'International Speech Communication Association'
Publication date: 01/01/2017
Field of study

Lately there have been novel developments in deep learning towards solving the cocktail party problem. Initial results are very promising and allow for more research in the domain. One technique that has not yet been explored in the neural network approach to this task is speaker adaptation. Intuitively, information on the speakers that we are trying to separate seems fundamentally important for the speaker separation task. However, retrieving this speaker information is challenging since the speaker identities are not known a priori and multiple speakers are simultaneously active. There is thus some sort of chicken and egg problem. To tackle this, source signals and i-vectors are estimated alternately. We show that blind multi-speaker adaptation improves the results of the network and that (in our case) the network is not capable of adequately retrieving this useful speaker information itself

arXiv.org e-Print Archive

Lirias

Crossref