Search CORE

202 research outputs found

Neural Network Configurations Analysis for Multilevel Speech Pattern Recognition System with Mixture of Experts

Author: Barros Filho Allan Kardec Duailibe
Rocha Priscila Lima
Santos Silva Washington Luis
Publication venue: 'IntechOpen'
Publication date: 20/12/2017
Field of study

This chapter proposes to analyze two configurations of neural networks to compose the expert set in the development of a multilevel speech signal pattern recognition system of 30 commands in the Brazilian Portuguese language. Then, multilayer perceptron (MLP) and learning vector quantization (LVQ) networks have their performances verified during the training, validation and test stages in the speech signal recognition, whose patterns are given by two-dimensional time matrices, result from mel-cepstral coefficients coding by the discrete cosine transform (DCT). In order to avoid the pattern separability problem, the patterns are modified by a nonlinear transformation to a high-dimensional space through a suitable set of Gaussian radial base functions (GRBF). The performance of MLP and LVQ experts is improved and configurations are trained with few examples of each modified pattern. Several combinations were performed for the neural network topologies and algorithms previously established to determine the network structures with the best hit and generalization results

IntechOpen

Crossref

Janus - towards multilingual spoken language translation

Author: [u.a.] Alexander
Geutner Petra
Kemp Thomas
Rogina Ivica
Schultz Tanja
Sloboda Tilo
Suhm Bernhard
Waibel Alexander
Woszczyna Monika
Publication venue: San Francisco
Publication date: 01/01/1995
Field of study

KITopen

Recent advances in Janus: a speech translation system

Author: [u.a.] Alex
Coccaro Noah
Eisele Andreas
Mcnair A.
Rogina Ivica
Sloboda Tilo
Waibel Alex
Woszczyna Monika
Publication venue
Publication date: 02/08/2007
Field of study

KITopen

Evaluation of preprocessors for neural network speaker verification

Author: Salleh Sheikh-Hussain
Publication venue: The University of Edinburgh
Publication date: 01/01/1997
Field of study

Edinburgh Research Archive

Recommended from our members

Biologically inspired speaker verification

Author: Tashan T
Publication venue
Publication date: 01/01/2012
Field of study

Speaker verification is an active research problem that has been addressed using a variety of different classification techniques. However, in general, methods inspired by the human auditory system tend to show better verification performance than other methods. In this thesis three biologically inspired speaker verification algorithms are presented

Nottingham Trent Institutional Repository (IRep)

Gender voice classification with huge accuracy rate

Author: Abd Thulfiqar
Mezaal Yaqeen S.
Shareef Mustafa Sahib
Publication venue: 'Universitas Ahmad Dahlan'
Publication date: 01/10/2020
Field of study

Gender voice recognition stands for an imperative research field in acoustics and speech processing as human voice shows very remarkable aspects. This study investigates speech signals to devise a gender classifier by speech analysis to forecast the gender of the speaker by investigating diverse parameters of the voice sample. A database has 2270 voice samples of celebrities, both male and female. Through Mel frequency cepstrum coefficient (MFCC), vector quantization (VQ), and machine learning algorithm (J 48), an accuracy of about 100% is achieved by the proposed classification technique based on data mining and Java script

Journal of Education and Learning (EduLearn)

TELKOMNIKA (Telecommunication Computing Electronics and Control)

UAD Journal Management System