research

Tamil Speech Recognition using Semi Continuous Models

Abstract

Abstract- In this paper novel approach for implementing Tamil Language Semi continuous speech recognition based on Hidden Markov Models is discussed. Tamil and other Indian languages share phonological features which are rich in vowel and consonant realizations. The same phone in different words has different realizations. This can be overcome by employing phone-in-context. Therefore triphone models were chosen as suitable sub-word units for acoustic training. The system is trained with speech corpus of 37 Tamil phones. Speech corpus consisted of 0.35 hours of speech. Training was done using Carnegie Mellon University (CMU)’s SphinxTrain acoustic model Trainer. Accuracy of the training is measured by decoding using PocketSphinx

    Similar works