94 research outputs found

    Handwritten Character Recognition of South Indian Scripts: A Review

    Full text link
    Handwritten character recognition is always a frontier area of research in the field of pattern recognition and image processing and there is a large demand for OCR on hand written documents. Even though, sufficient studies have performed in foreign scripts like Chinese, Japanese and Arabic characters, only a very few work can be traced for handwritten character recognition of Indian scripts especially for the South Indian scripts. This paper provides an overview of offline handwritten character recognition in South Indian Scripts, namely Malayalam, Tamil, Kannada and Telungu.Comment: Paper presented on the "National Conference on Indian Language Computing", Kochi, February 19-20, 2011. 6 pages, 5 figure

    WRITER IDENTIFICATION BY TEXTURE ANALYSIS BASED ON KANNADA HANDWRITING

    Get PDF
    Writer identification problem is one of the important area of research due to its various applications and is a challenging task. The major research on writer identification is based on handwritten English documents with text independent and dependent. However, there is no significant work on identification of writers based on Kannada document. Hence, in this paper, we propose a text-independent method for off-line writer identification based on Kannada handwritten scripts. By observing each individual’s handwriting as a different texture image, a set of features based on Discrete Cosine Transform, Gabor filtering and gray level co-occurrence matrix, are extracted from preprocessed document image blocks. Experimental results demonstrate that the Gabor energy features are more potential than the DCTs and GLCMs based features for writer identification from 20 people

    Deep Learning Based Models for Offline Gurmukhi Handwritten Character and Numeral Recognition

    Get PDF
    Over the last few years, several researchers have worked on handwritten character recognition and have proposed various techniques to improve the performance of Indic and non-Indic scripts recognition. Here, a Deep Convolutional Neural Network has been proposed that learns deep features for offline Gurmukhi handwritten character and numeral recognition (HCNR). The proposed network works efficiently for training as well as testing and exhibits a good recognition performance. Two primary datasets comprising of offline handwritten Gurmukhi characters and Gurmukhi numerals have been employed in the present work. The testing accuracies achieved using the proposed network is 98.5% for characters and 98.6% for numerals

    A study on the use of Gabor features for Chinese OCR

    Get PDF
    The authors revisit the topic of Gabor feature extraction for Chinese OCR. We adopt a very simple discriminant function to construct a maximum discriminant function based character recognizer. We experiment with a simple way of forming a feature vector for each character image by extracting Gabor features using one wavelength at locations uniformly sampled with one spatial resolution. Extensive experiments on large vocabulary Chinese OCR for both machine-printed and handwritten characters are performed by using a large amount of training and testing data to demonstrate the effectiveness of the Gabor features for Chinese OCR. Using Gabor features as raw features, we have constructed several state-of-the-art Chinese OCR engines.published_or_final_versio

    Pattern detection and recognition using over-complete and sparse representations

    Get PDF
    Recent research in harmonic analysis and mammalian vision systems has revealed that over-complete and sparse representations play an important role in visual information processing. The research on applying such representations to pattern recognition and detection problems has become an interesting field of study. The main contribution of this thesis is to propose two feature extraction strategies - the global strategy and the local strategy - to make use of these representations. In the global strategy, over-complete and sparse transformations are applied to the input pattern as a whole and features are extracted in the transformed domain. This strategy has been applied to the problems of rotation invariant texture classification and script identification, using the Ridgelet transform. Experimental results have shown that better performance has been achieved when compared with Gabor multi-channel filtering method and Wavelet based methods. The local strategy is divided into two stages. The first one is to analyze the local over-complete and sparse structure, where the input 2-D patterns are divided into patches and the local over-complete and sparse structure is learned from these patches using sparse approximation techniques. The second stage concerns the application of the local over-complete and sparse structure. For an object detection problem, we propose a sparsity testing technique, where a local over-complete and sparse structure is built to give sparse representations to the text patterns and non-sparse representations to other patterns. Object detection is achieved by identifying patterns that can be sparsely represented by the learned. structure. This technique has been applied. to detect texts in scene images with a recall rate of 75.23% (about 6% improvement compared with other works) and a precision rate of 67.64% (about 12% improvement). For applications like character or shape recognition, the learned over-complete and sparse structure is combined. with a Convolutional Neural Network (CNN). A second text detection method is proposed based on such a combination to further improve (about 11% higher compared with our first method based on sparsity testing) the accuracy of text detection in scene images. Finally, this method has been applied to handwritten Farsi numeral recognition, which has obtained a 99.22% recognition rate on the CENPARMI Database and a 99.5% recognition rate on the HODA Database. Meanwhile, a SVM with gradient features achieves recognition rates of 98.98% and 99.22% on these databases respectivel

    Feature Extraction Methods for Character Recognition

    Get PDF
    Not Include
    • …
    corecore