Search CORE

7 research outputs found

Using Genetic Algorithm to Improve the Performance of Speech Recognition Based on Artificial Neural Network

Author: Chih-Chin Lai
Shing-Tai Pan
Publication venue: 'IntechOpen'
Publication date: 01/06/2007
Field of study

Isolated Word Recognition Using Ergodic Hidden Markov Models and Genetic Algorithm

Author: Emillia Nyoman Rizkha
Maharani Warih
Suyanto Suyanto
Publication venue: 'Universitas Ahmad Dahlan'
Publication date: 01/03/2012
Field of study

Speech to text was one of speech recognition applications which speech signal was processed, recognized and converted into a textual representation. Hidden Markov model (HMM) was the widely used method in speech recognition. However, the level of accuracy using HMM was strongly influenced by the optimalization of extraction process and modellling methods. Hence in this research, the use of genetic algorithm (GA) method to optimize the Ergodic HMM was tested. In Hybrid HMM-GA, GA was used to optimize the Baum-Welch method in the training process. It was useful to improve the accuracy of the recognition result which is produced by the HMM parameters that generate the low accuracy when the HMM are tested. Based on the research, the percentage increases the level of accuracy of 20% to 41%. Proved that the combination of GA in HMM method can gives more optimal results when compared with the HMM system that not combine with any method

TELKOMNIKA (Telecommunication Computing Electronics and Control)

UAD Journal Management System

Genetic Algorithm for Combined Speaker and Speech Recognition using Deep Neural Networks, Journal of Telecommunications and Information Technology, 2018, nr 2

Author: Kaur Gurpreet
Kumar Amod
Srivastava Mohit
Publication venue: 'National Institute of Telecommunications'
Publication date
Field of study

Huge growth is observed in the speech and speaker recognition ﬁeld due to many artiﬁcial intelligence algorithms being applied. Speech is used to convey messages via the language being spoken, emotions, gender and speaker identity. Many real applications in healthcare are based upon speech and speaker recognition, e.g. a voice-controlled wheelchair helps control the chair. In this paper, we use a genetic algorithm (GA) for combined speaker and speech recognition, relying on optimized Mel Frequency Cepstral Coeﬃcient (MFCC) speech features, and classiﬁcation is performed using a Deep Neural Network (DNN). In the ﬁrst phase, feature extraction using MFCC is executed. Then, feature optimization is performed using GA. In the second phase training is conducted using DNN. Evaluation and validation of the proposed work model is done by setting a real environment, and eﬃciency is calculated on the basis of such parameters as accuracy, precision rate, recall rate, sensitivity, and speciﬁcity. Also, this paper presents an evaluation of such feature extraction methods as linear predictive coding coeﬃcient (LPCC), perceptual linear prediction (PLP), mel frequency cepstral coefﬁcients (MFCC) and relative spectra ﬁltering (RASTA), with all of them used for combined speaker and speech recognition systems. A comparison of diﬀerent methods based on existing techniques for both clean and noisy environments is made as well

Biblioteka Cyfrowa Instytutu Łączności / National Institute of Telecomunications: Digital Library

Using Genetic Algorithm to Improve the Performance of Speech Recognition Based on Artificial Neural Network

Author
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2006
Field of study

Crossref

Using Genetic Algorithm to Improve the Performance of Speech Recognition Based on Artificial Neural Network

Author
Publication venue
Publication date
Field of study

The development for speech recognition system has been for a while. The recognition platform can be divided into three types. Dynamic Time Warping (DTW) (Sakoe, 1978), th

CiteSeerX

Application of neural networks in whispered speech recognition.

Author: Grozdić Đorđe T.
Publication venue: Универзитет у Београду, Електротехнички факултет
Publication date: 25/10/2017
Field of study

Nedavno postignuti uspesi dubinskih neuralnih mreža u različitim zadacima mašinskog učenja su doprineli da vestačke neuralne mreze ponovo zauzmu bitnu ulogu u automatskom prepoznavanju govora. U ovom doktoratu je ispitana primena vestačkih neuralnih mreza u prepoznavanju šapata...The recent success of Deep Neural Networks (DNN) in different machine learning tasks has significantly contributed to the rise in the popularity of artificial neural networks (ANN) and their today’s role in Automatic Speech Recognition (ASR). This thesis examines how artificial neural networks can benefit in automatic whispered speech recognition..

National Repository of Dissertations in Serbia (NaRDuS)

Nardus

Journal of Telecommunications and Information Technology, 2018, nr 2

Author
Publication venue: 'National Institute of Telecommunications'
Publication date
Field of study

kwartalni

Biblioteka Cyfrowa Instytutu Łączności / National Institute of Telecomunications: Digital Library