Support Vector Machines Study on English Isolated-Word-Error Classification and Regression

Abstract

Abstract: A better understanding on word classification and regression could lead to a better detection and correction technique. We used different features or attributes to represent a machine-printed English word and support vector machines is used to evaluate those features into two class types of word: correct and wrong word. Our proposed support vectors model classified the words by using fewer words during the training process because those training words are to be considered as personalized words. Those wrong words could be replaced by correct words predicted by the regression process. Our results are very encouraging when compared with neural networks, Hamming distance or minimum edit distance technique; with further improvement in sight

    Similar works