Search CORE

900 research outputs found

The Performance Evaluation of Speech Recognition by Comparative Approach

Author: Rajeev R.
Raviteja R.
Venkateswarlu R.L.K.
Publication venue: 'IntechOpen'
Publication date: 12/09/2012
Field of study

IntechOpen

Evaluation of preprocessors for neural network speaker verification

Author: Salleh Sheikh-Hussain
Publication venue: The University of Edinburgh
Publication date: 01/01/1997
Field of study

Edinburgh Research Archive

Neural Network Configurations Analysis for Multilevel Speech Pattern Recognition System with Mixture of Experts

Author: Barros Filho Allan Kardec Duailibe
Rocha Priscila Lima
Santos Silva Washington Luis
Publication venue: 'IntechOpen'
Publication date: 20/12/2017
Field of study

This chapter proposes to analyze two configurations of neural networks to compose the expert set in the development of a multilevel speech signal pattern recognition system of 30 commands in the Brazilian Portuguese language. Then, multilayer perceptron (MLP) and learning vector quantization (LVQ) networks have their performances verified during the training, validation and test stages in the speech signal recognition, whose patterns are given by two-dimensional time matrices, result from mel-cepstral coefficients coding by the discrete cosine transform (DCT). In order to avoid the pattern separability problem, the patterns are modified by a nonlinear transformation to a high-dimensional space through a suitable set of Gaussian radial base functions (GRBF). The performance of MLP and LVQ experts is improved and configurations are trained with few examples of each modified pattern. Several combinations were performed for the neural network topologies and algorithms previously established to determine the network structures with the best hit and generalization results

IntechOpen

Crossref

Towards the use of sub-band processing in automatic speaker recognition

Author: Finan Robert
Publication venue
Publication date: 01/01/1998
Field of study

Abertay Research Portal

Southampton (e-Prints Soton)

Speaker verification using sequence discriminant support vector machines

Author: Renals S.
Wan V.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2005
Field of study

This paper presents a text-independent speaker verification system using support vector machines (SVMs) with score-space kernels. Score-space kernels generalize Fisher kernels and are based on underlying generative models such as Gaussian mixture models (GMMs). This approach provides direct discrimination between whole sequences, in contrast with the frame-level approaches at the heart of most current systems. The resultant SVMs have a very high dimensionality since it is related to the number of parameters in the underlying generative model. To address problems that arise in the resultant optimization we introduce a technique called spherical normalization that preconditions the Hessian matrix. We have performed speaker verification experiments using the PolyVar database. The SVM system presented here reduces the relative error rates by 34% compared to a GMM likelihood ratio system

Crossref

Edinburgh Research Archive

Edinburgh Research Explorer

White Rose Research Online

Speaker Independent Speech Recognition Using Neural Network

Author: Tan Chin Luh
Publication venue
Publication date: 01/12/2004
Field of study

In spite of the advances accomplished throughout the last few decades, automatic speech recognition (ASR) is still a challenging and difficult task when the systems are applied in the real world. Different requirements for various applications drive the researchers to explore for more effective ways in the particular application. Attempts to apply artificial neural networks (ANN) as a classification tool are proposed to increase the reliability of the system. This project studies the approach of using neural network for speaker independent isolated word recognition on small vocabularies and proposes a method to have a simple MLP as speech recognizer. Our approach is able to overcome the current limitations of MLP in the selection of input buffers’ size by proposing a method on frames selection. Linear predictive coding (LPC) has been applied to represent speech signal in frames in early stage. Features from the selected frames are used to train the multilayer perceptrons (MLP) feedforward back-propagation (FFBP) neural network during the training stage. Same routine has been applied to the speech signal during the recognition stage and the unknown test pattern will be classified to one of the nearest pattern. In short, the selected frames represent the local features of the speech signal and all of them contribute to the global similarity for the whole speech signal. The analysis, design and the PC based voice dialling system is developed using MATLAB®

Universiti Putra Malaysia Institutional Repository

Analysis of back propagation and radial basis function neural networks for handover decisions in wireless communication

Author: Mahajan Payal
Zaheeruddin Zaheeruddin
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/10/2020
Field of study

In mobile systems, handoff is a vital process, referring to a process of allocating an ongoing call from one BS to another BS. The handover technique is very important to maintain the Quality of service. Handover algorithms, based on neural networks, fuzzy logic etc. can be used for the same purpose to keep Quality of service as high as possible. In this paper, it is proposed that back propagation networks and radial basis functions may be used for taking handover decision in wireless communication networks. The performance of these classifiers is evaluated on the basis of neurons in hidden layer, training time and classification accuracy. The proposed approach shows that radial basis function neural network give better results for making handover decisions in wireless heterogeneous networks with classification accuracy of 90%

ZENODO

Institute of Advanced Engineering and Science

A Review on Speech Recognition Methods

Author: Ram Paul, Rajender Kr. Beniwal, Rinku Kumar, Rohit Saini
Publication venue: Auricle Global Society of Education and Research
Publication date: 26/02/2018
Field of study

Voice recognition is the identification of a speaker on the basis of the characteristics of voices. For this, features of speech patterns that differ between individuals are used to achieve the objective. In this paper speaker recognition system are discussed. Implementation of speaker's voice recognition system with MATLAB makes possible use of voice for real life applications. This paper provides a brief review of different DSP based techniques applied for speech recognition

International Journal on Future Revolution in Computer Science & Communication Engineering