Search CORE

25 research outputs found

Time–Frequency Cepstral Features and Heteroscedastic Linear Discriminant Analysis for Language Recognition

Author: Deng Yan
He Liang
Johnson Michael T
Liu Jia
Zhang Wei-Qiang
Publication venue: e-Publications@Marquette
Publication date: 01/02/2011
Field of study

The shifted delta cepstrum (SDC) is a widely used feature extraction for language recognition (LRE). With a high context width due to incorporation of multiple frames, SDC outperforms traditional delta and acceleration feature vectors. However, it also introduces correlation into the concatenated feature vector, which increases redundancy and may degrade the performance of backend classifiers. In this paper, we first propose a time-frequency cepstral (TFC) feature vector, which is obtained by performing a temporal discrete cosine transform (DCT) on the cepstrum matrix and selecting the transformed elements in a zigzag scan order. Beyond this, we increase discriminability through a heteroscedastic linear discriminant analysis (HLDA) on the full cepstrum matrix. By utilizing block diagonal matrix constraints, the large HLDA problem is then reduced to several smaller HLDA problems, creating a block diagonal HLDA (BDHLDA) algorithm which has much lower computational complexity. The BDHLDA method is finally extended to the GMM domain, using the simpler TFC features during re-estimation to provide significantly improved computation speed. Experiments on NIST 2003 and 2007 LRE evaluation corpora show that TFC is more effective than SDC, and that the GMM-based BDHLDA results in lower equal error rate (EER) and minimum average cost (Cavg) than either TFC or SDC approaches

epublications@Marquette

Linear classifier design under heteroscedasticity in Linear Discriminant Analysis

Author: Anderson
Andrew Hunt
Barber
Bishop
Buturovic
Cai
Chen
Coomans
Decell
Decell Jr
Duin
Elena Gaura
Fisher
Friedman
Fukunaga
Fukunaga
Hamsici
Hariharan
Hastie
Hsu
Izenman
James Brusey
Ju
Kojo Sarfo Gyamfi
Li
Liu
Loog
Lu
Mahmoudi
Malina
Marks
McLachlan
Mika
Mudholkar
Ng
Ojetola
Paliwal
Peterson
Polat
Sengur
Sharma
Sharma
Song
Xue
Yu
Yuan
Zhang
Zhao
Publication venue: 'Elsevier BV'
Publication date: 24/03/2017
Field of study

Under normality and homoscedasticity assumptions, Linear Discriminant Analysis (LDA) is known to be optimal in terms of minimising the Bayes error for binary classification. In the heteroscedastic case, LDA is not guaranteed to minimise this error. Assuming heteroscedasticity, we derive a linear classifier, the Gaussian Linear Discriminant (GLD), that directly minimises the Bayes error for binary classification. In addition, we also propose a local neighbourhood search (LNS) algorithm to obtain a more robust classifier if the data is known to have a non-normal distribution. We evaluate the proposed classifiers on two artificial and ten real-world datasets that cut across a wide range of application areas including handwriting recognition, medical diagnosis and remote sensing, and then compare our algorithm against existing LDA approaches and other linear classifiers. The GLD is shown to outperform the original LDA procedure in terms of the classification accuracy under heteroscedasticity. While it compares favourably with other existing heteroscedastic LDA approaches, the GLD requires as much as 60 times lower training time on some datasets. Our comparison with the support vector machine (SVM) also shows that, the GLD, together with the LNS, requires as much as 150 times lower training time to achieve an equivalent classification accuracy on some of the datasets. Thus, our algorithms can provide a cheap and reliable option for classification in a lot of expert systems

arXiv.org e-Print Archive

Crossref

Coventry University Pure Portal

Speech Recognition

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

Chapters in the first part of the book cover all the essential speech processing techniques for building robust, automatic speech recognition systems: the representation for speech signals and the methods for speech-features extraction, acoustic and language modeling, efficient algorithms for searching the hypothesis space, and multimodal approaches to speech recognition. The last part of the book is devoted to other speech processing applications that can use the information from automatic speech recognition for speaker identification and tracking, for prosody modeling in emotion-detection systems and in other speech processing applications that are able to operate in real-world environments, like mobile communication services and smart homes

Directory of Open Access Books (DOAB)

Discriminant feature analysis for pattern recognition

Author: HUANG DONG
Publication venue
Publication date: 21/01/2010
Field of study

Ph.DDOCTOR OF PHILOSOPH

ScholarBank@NUS

LANDSAT-D investigations in snow hydrology

Author: Davis R. E.
Dozier J.
Dubayah R. O.
Frew J. E.
Li S.
Marks D.
Milliff R. F.
Rousseau D. D.
Wan Z. M.
Publication venue
Publication date
Field of study

Work undertaken during the contract and its results are described. Many of the results from this investigation are available in journal or conference proceedings literature - published, accepted for publication, or submitted for publication. For these the reference and the abstract are given. Those results that have not yet been submitted separately for publication are described in detail. Accomplishments during the contract period are summarized as follows: (1) analysis of the snow reflectance characteristics of the LANDSAT Thematic Mapper, including spectral suitability, dynamic range, and spectral resolution; (2) development of a variety of atmospheric models for use with LANDSAT Thematic Mapper data. These include a simple but fast two-stream approximation for inhomogeneous atmospheres over irregular surfaces, and a doubling model for calculation of the angular distribution of spectral radiance at any level in an plane-parallel atmosphere; (3) incorporation of digital elevation data into the atmospheric models and into the analysis of the satellite data; and (4) textural analysis of the spatial distribution of snow cover

NASA Technical Reports Server

Recommended from our members

The Prediction of Bank Certificates of Deposit Ratings

Author: Kim Mi-hyung
Publication venue: 'University of North Texas Libraries'
Publication date: 01/05/1991
Field of study

The purpose of the study was to find the best prediction models of short-term bank CD ratings using financial variables. This study used short-term bank CD ratings assigned by Moody's and Standard and Poor's

UNT Digital Library

Tracing the assembly history of NGC 1395 through its Globular Cluster System

Author: Escudero Carlos Gabriel
Faifer Favio Raúl
Forte Juan Carlos
González Nélida Mabel
Scalia María Cecilia
Sesto Leandro Alberto
Smith Castelli Analía Viviana
Publication venue
Publication date: 22/04/2020
Field of study

We used deep Gemini-South/GMOS g'r'i'z' images to study the globular cluster (GC) system of the massive elliptical galaxy NGC1395, located in the Eridanus supergroup. The photometric analysis of the GC candidates reveals a clear colour bimodality distribution, indicating the presence of 'blue' and 'red' GC subpopulations. While a negative radial colour gradient is detected in the projected spatial distribution of the red GCs, the blue GCs display a shallow colour gradient. The blue GCs also display a remarkable shallow and extended surface density profile, suggesting a significant accretion of low-mass satellites in the outer halo of the galaxy. In addition, the slope of the projected spatial distribution of the blue GCs in the outer regions of the galaxy, is similar to that of the X-ray halo emission. Integrating up to 165 kpc the profile of the projected spatial distribution of the GCs, we estimated a total GC population and specific frequency of 6000 ± 1100 and S N = 7.4 ± 1.4, respectively. Regarding NGC1395 itself, the analysis of the deep Gemini/GMOS images shows a low surface brightness umbrella-like structure indicating, at least, one recent merger event. Through relations recently published in the literature, we obtained global parameters, such as M stellar = 9.32 × 10 11 M⊙ and M h = 6.46 × 10 13 M⊙. Using public spectroscopic data, we derive stellar population parameters of the central region of the galaxy by the full spectral fitting technique. We have found that this region seems to be dominated for an old stellar population, in contrast to findings of young stellar populations from the literature.Instituto de Astrofísica de La PlataFacultad de Ciencias Astronómicas y Geofísica

Tracing the assembly history of NGC 1395 through its Globular Cluster System

Author: Escudero Carlos Gabriel
Faifer Favio Raúl
Forte Juan Carlos
González Nélida Mabel
Scalia María Cecilia
Sesto Leandro Alberto
Smith Castelli Analía Viviana
Publication venue
Publication date: 22/04/2020
Field of study

Servicio de Difusión de la Creación Intelectual

Recommended from our members

Adapting Quantitative Protein and Phosphorylation Analyses to a Proteome-Wide Scale

Author: Grady Joshua Terrence Wilson
Publication venue: 'Harvard University Botany Libraries'
Publication date: 30/09/2013
Field of study

Liquid chromatography coupled tandem mass spectrometry (LC-MS/MS) has become the preferred method for large-scale peptide and phosphopeptide identification and quantification. The dominance of LC-MS/MS is the result of improved chromatographic, mass spectrometry and bioinformatic technologies. The applications of these technological improvements drive biological innovation by expanding the realm of possible experimentation, facilitating the creation and evaluation of novel hypotheses. Such improvements are the focus of this dissertation. New technologies are presented and their proteome wide applications in biological systems are demonstrated. A comparison of common phosphopeptide enrichment methods is presented in chapter two, which demonstrates that a combination of methods provides non-overlapping data sets. This comparison was performed in mitotically arrested fission yeast, a previously unstudied system by phosphoproteomic methods. This chapter remarks upon phosphorylation site conservation between lower and higher eukaryotes, as a means of predicting potentially relevant phosphorylation events in mammals. A new protocol for tissue based peptide quantification is presented in chapter three. The large-scale application of this method is detailed in a system of mouse liver phosphorylation, between fasted and re-fed states. The effect of peptide and protein level false discovery rates on the accuracy of phosphorylation site quantification is highlighted. This method is a cost-effective alternative to available techniques, such as metabolic labeling, and expands the application of proteomics to include larger animals. Finally, an in depth analysis of quantitative LC-MS/MS based multiplexing is the subject of the last chapter. New techniques for peptide pre-fractionation and ion quantification are discussed, which improve proteome coverage and quantitative accuracy. This proteome-wide multiplexing is applied to an analysis of the budding yeast environmental stress response. Applicable methods of data processing and a means of obtaining biologically relevant information out of multidimensional proteomic data sets are discussed. In all chapters, the data presented represent the largest analyses of their kind. This dissertation provides a solid guide for future proteome-wide studies, focused on the identification and quantification of peptides and their posttranslational modifications

Harvard University - DASH