72 research outputs found
Development of Comprehensive Devnagari Numeral and Character Database for Offline Handwritten Character Recognition
In handwritten character recognition, benchmark database plays an important
role in evaluating the performance of various algorithms and the results
obtained by various researchers. In Devnagari script, there is lack of such
official benchmark. This paper focuses on the generation of offline benchmark
database for Devnagari handwritten numerals and characters. The present work
generated 5137 and 20305 isolated samples for numeral and character database,
respectively, from 750 writers of all ages, sex, education, and profession. The
offline sample images are stored in TIFF image format as it occupies less
memory. Also, the data is presented in binary level so that memory requirement
is further reduced. It will facilitate research on handwriting recognition of
Devnagari script through free access to the researchers.Comment: 5 pages, 8 figures, journal pape
Handwritten Character Recognition of South Indian Scripts: A Review
Handwritten character recognition is always a frontier area of research in
the field of pattern recognition and image processing and there is a large
demand for OCR on hand written documents. Even though, sufficient studies have
performed in foreign scripts like Chinese, Japanese and Arabic characters, only
a very few work can be traced for handwritten character recognition of Indian
scripts especially for the South Indian scripts. This paper provides an
overview of offline handwritten character recognition in South Indian Scripts,
namely Malayalam, Tamil, Kannada and Telungu.Comment: Paper presented on the "National Conference on Indian Language
Computing", Kochi, February 19-20, 2011. 6 pages, 5 figure
Bangla handwritten numeral recognition using convolutional neural network
Recognition of handwritten numerals has gained much interest in recent years due to its various application
potentials. Although Bangla is a major language in Indian
subcontinent and is the first language of Bangladesh study
regarding Bangla handwritten numeral recognition (BHNR) is
very few with respect to other major languages such Roman.
The existing BHNR methods uses distinct feature extraction
techniques and various classification tools in their recognition schemes. Recently, convolutional neural network (CNN) is found efficient for image classification with its distinct features. It also automatically provides some degree of translation invariance. In this paper, a CNN based BHNR is investigated. The proposed BHNR-CNN normalizes the written numeral images and then employ CNN to classify individual numerals. It does not employ any feature extraction method like other related works. 17000 hand written numerals with different shapes, sizes and variations are used in this study. The proposed method is shown satisfactory recognition accuracy and outperformed other prominent exiting methods
Convolutional neural network training incorporating rotation-based generated patterns and handwritten numeral recognition of major Indian scripts
Handwritten numeral recognition has gained much interest in recent times because of its diverse application potentials. Bangla and Hindi are the two major languages in Indian subcontinent and a large number of population in vast land scape uses Bangla and Devnagari numeral scripts of these two languages. Well-performed handwritten numeral recognition system for Bangla and Devnagari is challenging because of similar shaped numerals in both scripts; few numerals differ from their similar ones with a very few variation even in printed form. In this study, convolutional neural network (CNN) based two different methods have been investigated for better recognition of Bangla and Devnagari handwritten numerals. Both the methods use rotation-based generated patterns along with ordinary patterns to train CNN but in two different modes. In multiple CNN case, three different training sets (one with ordinary patterns and two with clockwise and anti-clockwise rotation-based generated patterns) are prepared; three different CNNs are trained individually with each of these training sets; and their decisions are combined for final system decision. On the other hand, in the case of single CNN, combination of above three training sets is used to train one CNN. A moderated pre-processing is also employed while generating patterns from the scanned images. The proposed methods have been tested on prominent benchmark handwritten numeral datasets and have achieved remarkable recognition accuracies. The achieved recognition accuracies are found better than reported recognition accuracies of prominent existing methods; and such outperformance mounted proposed methods as better recognition systems. Moreover, CNN's performance improvement due to use of generated patterns has also been clearly identified from the presented experimental results
Multiple convolutional neural network training for Bangla handwritten numeral recognition
Recognition of handwritten numerals has gained much interest in recent years due to its various application potentials. The progress of handwritten Bangla numeral is well behind Roman, Chinese and Arabic scripts although it is a major language in Indian subcontinent and is the first language of Bangladesh. Handwritten numeral classification is a high-dimensional complex task and existing methods use distinct feature extraction techniques and various classification tools in their recognition schemes. Recently, convolutional neural network (CNN) is found efficient for image classification with its distinct features. In this study, three different CNNs with same architecture are trained with different training sets and combined their decisions for Bangla handwritten numeral recognition. One CNN is trained with ordinary training set prepared from handwritten scan images; and training sets for other two CNNs are prepared with fixed (positive and negative, respectively) rotational angles of original images. The proposed multiple CNN based approach is shown to outperform other existing methods while tested on a popular Bangla benchmark handwritten dataset
- โฆ