Search CORE

675 research outputs found

Robust audio indexing for Dutch spoken-word collections

Author: Huijbregts Marijn
Jong Franciska de
Leeuwen David van
Ordelman Roeland
Publication venue: KNAW
Publication date: 01/01/2005
Field of study

Abstract—Whereas the growth of storage capacity is in accordance with widely acknowledged predictions, the possibilities to index and access the archives created is lagging behind. This is especially the case in the oral history domain and much of the rich content in these collections runs the risk to remain inaccessible for lack of robust search technologies. This paper addresses the history and development of robust audio indexing technology for searching Dutch spoken-word collections and compares Dutch audio indexing in the well-studied broadcast news domain with an oral-history case-study. It is concluded that despite significant advances in Dutch audio indexing technology and demonstrated applicability in several domains, further research is indispensable for successful automatic disclosure of spoken-word collections

University of Twente Research Information

The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism

Author: Batliner Anton
Chetouani Mohamed
Eyben Florian
Kim Samuel
Marchi Erik
Mortillaro Marcello
Polychroniou Anna
Ringeval Fabien
Salamin Hugues
Scherer Klaus
Schuller Björn
Steidl Stefan
Valente Fabio
Vinciarelli Alessandro
Weninger Felix
Publication venue
Publication date: 01/01/2013
Field of study

The INTERSPEECH 2013 Computational Paralinguistics Challenge provides for the first time a unified test-bed for Social Signals such as laughter in speech. It further introduces conflict in group discussions as new tasks and picks up on autism and its manifestations in speech. Finally, emotion is revisited as task, albeit with a broader ranger of overall twelve emotional states. In this paper, we describe these four Sub-Challenges, Challenge conditions, baselines, and a new feature set by the openSMILE toolkit, provided to the participants. \em Bj\"orn Schuller

^1

, Stefan Steidl

^2

, Anton Batliner

^1

, Alessandro Vinciarelli

^{3,4}

, Klaus Scherer

^5

}\\ {\em Fabien Ringeval

^6

, Mohamed Chetouani

^7

, Felix Weninger

^1

, Florian Eyben

^1

, Erik Marchi

^1

, }\\ {\em Hugues Salamin

^3

, Anna Polychroniou

^3

, Fabio Valente

^4

, Samuel Kim

^4

CiteSeerX

Hal - Université Grenoble Alpes

Enlighten

Hal-Diderot

Archive ouverte UNIGE

Unravelling the voice of Willem Frederik Hermans: an oral history indexing case study

Author: Huijbregts Marijn
Jong Franciska de
Ordelman Roeland
Publication venue: University of Twente, Centre for Telematics and Information Technology (CTIT)
Publication date: 01/01/2009
Field of study

University of Twente Research Information

Combination of SVM and Large Margin GMM modeling for speaker identification

Author: Aboutajdine Driss
André-Obrecht Régine
Daoudi Khalid
Jourani Reda
Publication venue: HAL CCSD
Publication date: 09/09/2013
Field of study

International audienceMost state-of-the-art speaker recognition systems are partially or completely based on Gaussian mixture models (GMM). GMM have been widely and successfully used in speaker recognition during the last decades. They are traditionally estimated from a world model using the generative criterion of Maximum A Posteriori. In an earlier work, we proposed an efficient algorithm for discriminative learning of GMM with diagonal covariances under a large margin criterion. In this paper, we evaluate the combination of the large margin GMM modeling approach with SVM in the setting of speaker identification. We carry out a full NIST speaker identification task using NIST-SRE'2006 data, in a Symmetrical Factor Analysis compensation scheme. The results show that the two modeling approaches are complementary and that their combination outperforms their single use

Scientific Publications of the University of Toulouse II Le Mirail

INRIA a CCSD electronic archive server

HAL-Rennes 1

Large Margin GMM for discriminative speaker verifi cation

Author: Aboutajdine Driss
André-Obrecht Régine
Daoudi Khalid
Jourani Reda
Publication venue: Springer Verlag
Publication date: 01/01/2012
Field of study

International audienceGaussian mixture models (GMM), trained using the generative cri- terion of maximum likelihood estimation, have been the most popular ap- proach in speaker recognition during the last decades. This approach is also widely used in many other classi cation tasks and applications. Generative learning in not however the optimal way to address classi cation problems. In this paper we rst present a new algorithm for discriminative learning of diagonal GMM under a large margin criterion. This algorithm has the ma- jor advantage of being highly e cient, which allow fast discriminative GMM training using large scale databases. We then evaluate its performances on a full NIST speaker veri cation task using NIST-SRE'2006 data. In particular, we use the popular Symmetrical Factor Analysis (SFA) for session variability compensation. The results show that our system outperforms the state-of-the- art approaches of GMM-SFA and the SVM-based one, GSL-NAP. Relative reductions of the Equal Error Rate of about 9.33% and 14.88% are respec- tively achieved over these systems

Scientific Publications of the University of Toulouse II Le Mirail

INRIA a CCSD electronic archive server

HAL-Rennes 1

Discriminative speaker recognition using Large Margin GMM

Author: BGB Fauve
CM Bishop
DA Reynolds
Driss Aboutajdine
J Keshet
J Louradour
J Nocedal
K Daoudi
Khalid Daoudi
O Viikki
P Kenny
P Kenny
Reda Jourani
Régine André-Obrecht
S Davis
WM Campbell
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2012
Field of study

International audienceMost state-of-the-art speaker recognition systems are based on discriminative learning approaches. On the other hand, generative Gaussian mixture models (GMM) have been widely used in speaker recognition during the last decades. In an earlier work, we proposed an algorithm for discriminative training of GMM with diagonal covariances under a large margin criterion. In this paper, we propose an improvement of this algorithm which has the major advantage of being computationally highly efficient, thus well suited to handle large scale databases. We also develop a new strategy to detect and handle the outliers that occur in the training data. To evaluate the performances of our new algorithm, we carry out full NIST speaker identification and verification tasks using NIST-SRE'2006 data, in a Symmetrical Factor Analysis compensation scheme. The results show that our system significantly outperforms the traditional discriminative Support Vector Machines (SVM) based system of SVM-GMM supervectors, in the two speaker recognition tasks

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

INRIA a CCSD electronic archive server

HAL-Rennes 1

Apprentissage discriminant des GMM à grande marge pour la vérification automatique du locuteur

Author: Aboutajdine Driss
André-Obrecht Régine
Daoudi Khalid
Jourani Reda
Publication venue: HAL CCSD
Publication date: 05/09/2011
Field of study

National audienceGaussian mixture models (GMM) have been widely and successfully used in speaker recognition during the last decades. They are generally trained using the generative criterion of maximum likelihood estimation. In an earlier work, we proposed an algorithm for discriminative training of GMM with diagonal covariances under a large margin criterion. In this paper, we present a new version of this algorithm which has the major advantage of being computationally highly efficient. The resulting algorithm is thus well suited to handle large scale databases. To show the effectiveness of the new algorithm, we carry out a full NIST speaker verification task using NIST-SRE'2006 data. The results show that our system outperforms the baseline GMM, and with high computational efficiency

Scientific Publications of the University of Toulouse II Le Mirail

INRIA a CCSD electronic archive server

HAL-Rennes 1