Search CORE

134 research outputs found

Fast speaker verification on mobile phone data using boosted slice classifiers

Author
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Fast Speaker Verification on Mobile Phone data using Boosted Slice Classifiers

Author: Magimai.-Doss Mathew
Marcel Sébastien
Roy Anindya
Publication venue
Publication date: 19/12/2013
Field of study

In this work, we investigate a novel computationally efficient speaker verification (SV) system involving boosted ensembles of simple threshold-based classifiers. The system is based on a novel set of features called “slice features”. Both the system and the features were inspired by the recent success of pixel comparison-based ensemble approaches in the computer vision domain. The performance of the proposed system was evaluated through speaker verification experiments on the MOBIO corpus containing mobile phone speech, according to a challenging protocol. The system was found to perform reasonably well, compared to multiple state-of-the-art SV systems, with the benefit of significantly lower computational complexity. Its dual characteristics of good performance and computational efficiency could be important factors in the context of SV system implementation on portable devices like mobile phones

Infoscience - École polytechnique fédérale de Lausanne

A Fast Parts-Based Approach to Speaker Verification Using Boosted Slice Classifiers

Author: Anindya Roy
Mathew Magimai.-Doss
Sébastien Marcel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

A Fast Parts-based Approach to Speaker Verification using Boosted Slice Classifiers

Author: Magimai.-Doss Mathew
Marcel Sébastien
Roy Anindya
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/12/2013
Field of study

Speaker verification on portable devices like smartphones is gradually becoming popular. In this context, two issues need to be considered: 1) such devices have relatively limited computation resources, and 2) they are liable to be used everywhere, possibly in very noisy, uncontrolled environments. This work aims to address both these issues by proposing a computationally efficient yet robust speaker verification system. This novel parts-based system draws inspiration from face and object detection systems in the computer vision domain. The system involves boosted ensembles of simple threshold-based classifiers. It uses a novel set of features extracted from speech spectra, called “slice features”. The performance of the proposed system was evaluated through extensive studies involving a wide range of experimental conditions using the TIMIT, HTIMIT and MOBIO corpus, against standard cepstral features and Gaussian Mixture Model-based speaker verification systems

Infoscience - École polytechnique fédérale de Lausanne

Computer Graphics and Video Features for Speaker Recognition

Author: Fér Radek
Publication venue: Vysoké učení technické v Brně. Fakulta informačních technologií
Publication date: 01/01/2012
Field of study

Tato práce popisuje netradiční metodu rozpoznávání řečníka pomocí příznaků a alogoritmů používaných převážně v počítačovém vidění. V úvodu jsou shrnuty potřebné teoretické znalosti z oblasti počítačového rozpoznávání. Jako aplikace grafických příznaků v rozpoznávání řečníka jsou detailněji popsány již známé BBF příznaky. Tyto jsou vyhodnoceny nad standardními řečovými databázemi TIMIT a NIST SRE 2010. Experimentální výsledky jsou shrnuty a porovnány se standardními metodami. V závěru jsou jsou navrženy možné směry budoucí práce.We describe a non-traditional method for speaker recognition that uses features and algorithms used mainly for computer vision. Important theoretical knowledge of computer recognition is summarized first. The Boosted Binary Features are described and explored as an already proposed method, that has roots in computer vision. This method is evaluated on standard speaker recognition databases TIMIT and NIST SRE 2010. Experimental results are given and compared to standard methods. Possible directions for future work are proposed at the end.

Digital library of Brno University of Technology

National Repository of Grey Literature

Speaker Recognition: Advancements and Challenges

Author: Homayoon Beigi
Publication venue: 'IntechOpen'
Publication date: 28/11/2012
Field of study

IntechOpen

Biometrics

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

Biometrics-Unique and Diverse Applications in Nature, Science, and Technology provides a unique sampling of the diverse ways in which biometrics is integrated into our lives and our technology. From time immemorial, we as humans have been intrigued by, perplexed by, and entertained by observing and analyzing ourselves and the natural world around us. Science and technology have evolved to a point where we can empirically record a measure of a biological or behavioral feature and use it for recognizing patterns, trends, and or discrete phenomena, such as individuals' and this is what biometrics is all about. Understanding some of the ways in which we use biometrics and for what specific purposes is what this book is all about

Directory of Open Access Books (DOAB)

Bi-modal biometric authentication on mobile phones in challenging conditions

Author: Adami
Anjos
Auckenthaler
Brümmer
Brümmer
Burget
Cardinaux
Cardinaux
Christopher McCool
Dasarathy
Dehak
Dehak
Dehak
Dempster
El Shafey
Elie Khoury
Fisher
Garcia-Romero
Glembek
Hasan
Hatch
Kenny
Kenny
Khoury
Lau
Laurent El Shafey
Li
Lucey
Manuel Günther
Marcel
Martin
Mau
McCool
McCool
McCool
McLaren
Minka
Motlicek
Perera
Pigeon
Prince
Reynolds
Reynolds
Roy
Sanderson
Scheirer
Shah
Shen
Sébastien Marcel
Tan
Vogt
Wallace
Wallace
Wallace
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Efficient speaker recognition for mobile devices

Author: Karpov Evgeny
Publication venue: University of Eastern Finland
Publication date
Field of study

UEF Electronic Publications

Acoustic Approaches to Gender and Accent Identification

Author: DeMarco Andrea
Publication venue
Publication date: 01/06/2015
Field of study

There has been considerable research on the problems of speaker and language recognition from samples of speech. A less researched problem is that of accent recognition. Although this is a similar problem to language identification, di�erent accents of a language exhibit more fine-grained di�erences between classes than languages. This presents a tougher problem for traditional classification techniques. In this thesis, we propose and evaluate a number of techniques for gender and accent classification. These techniques are novel modifications and extensions to state of the art algorithms, and they result in enhanced performance on gender and accent recognition. The first part of the thesis focuses on the problem of gender identification, and presents a technique that gives improved performance in situations where training and test conditions are mismatched. The bulk of this thesis is concerned with the application of the i-Vector technique to accent identification, which is the most successful approach to acoustic classification to have emerged in recent years. We show that it is possible to achieve high accuracy accent identification without reliance on transcriptions and without utilising phoneme recognition algorithms. The thesis describes various stages in the development of i-Vector based accent classification that improve the standard approaches usually applied for speaker or language identification, which are insu�cient. We demonstrate that very good accent identification performance is possible with acoustic methods by considering di�erent i-Vector projections, frontend parameters, i-Vector configuration parameters, and an optimised fusion of the resulting i-Vector classifiers we can obtain from the same data. We claim to have achieved the best accent identification performance on the test corpus for acoustic methods, with up to 90% identification rate. This performance is even better than previously reported acoustic-phonotactic based systems on the same corpus, and is very close to performance obtained via transcription based accent identification. Finally, we demonstrate that the utilization of our techniques for speech recognition purposes leads to considerably lower word error rates. Keywords: Accent Identification, Gender Identification, Speaker Identification, Gaussian Mixture Model, Support Vector Machine, i-Vector, Factor Analysis, Feature Extraction, British English, Prosody, Speech Recognition

University of East Anglia digital repository