28 research outputs found
Multiple convolutional neural network training for Bangla handwritten numeral recognition
Recognition of handwritten numerals has gained much interest in recent years due to its various application potentials. The progress of handwritten Bangla numeral is well behind Roman, Chinese and Arabic scripts although it is a major language in Indian subcontinent and is the first language of Bangladesh. Handwritten numeral classification is a high-dimensional complex task and existing methods use distinct feature extraction techniques and various classification tools in their recognition schemes. Recently, convolutional neural network (CNN) is found efficient for image classification with its distinct features. In this study, three different CNNs with same architecture are trained with different training sets and combined their decisions for Bangla handwritten numeral recognition. One CNN is trained with ordinary training set prepared from handwritten scan images; and training sets for other two CNNs are prepared with fixed (positive and negative, respectively) rotational angles of original images. The proposed multiple CNN based approach is shown to outperform other existing methods while tested on a popular Bangla benchmark handwritten dataset
Development of Comprehensive Devnagari Numeral and Character Database for Offline Handwritten Character Recognition
In handwritten character recognition, benchmark database plays an important
role in evaluating the performance of various algorithms and the results
obtained by various researchers. In Devnagari script, there is lack of such
official benchmark. This paper focuses on the generation of offline benchmark
database for Devnagari handwritten numerals and characters. The present work
generated 5137 and 20305 isolated samples for numeral and character database,
respectively, from 750 writers of all ages, sex, education, and profession. The
offline sample images are stored in TIFF image format as it occupies less
memory. Also, the data is presented in binary level so that memory requirement
is further reduced. It will facilitate research on handwriting recognition of
Devnagari script through free access to the researchers.Comment: 5 pages, 8 figures, journal pape
Diagonal Based Feature Extraction for Handwritten Alphabets Recognition System using Neural Network
An off-line handwritten alphabetical character recognition system using
multilayer feed forward neural network is described in the paper. A new method,
called, diagonal based feature extraction is introduced for extracting the
features of the handwritten alphabets. Fifty data sets, each containing 26
alphabets written by various people, are used for training the neural network
and 570 different handwritten alphabetical characters are used for testing. The
proposed recognition system performs quite well yielding higher levels of
recognition accuracy compared to the systems employing the conventional
horizontal and vertical methods of feature extraction. This system will be
suitable for converting handwritten documents into structural text form and
recognizing handwritten names
Convolutional neural network training incorporating rotation-based generated patterns and handwritten numeral recognition of major Indian scripts
Handwritten numeral recognition has gained much interest in recent times because of its diverse application potentials. Bangla and Hindi are the two major languages in Indian subcontinent and a large number of population in vast land scape uses Bangla and Devnagari numeral scripts of these two languages. Well-performed handwritten numeral recognition system for Bangla and Devnagari is challenging because of similar shaped numerals in both scripts; few numerals differ from their similar ones with a very few variation even in printed form. In this study, convolutional neural network (CNN) based two different methods have been investigated for better recognition of Bangla and Devnagari handwritten numerals. Both the methods use rotation-based generated patterns along with ordinary patterns to train CNN but in two different modes. In multiple CNN case, three different training sets (one with ordinary patterns and two with clockwise and anti-clockwise rotation-based generated patterns) are prepared; three different CNNs are trained individually with each of these training sets; and their decisions are combined for final system decision. On the other hand, in the case of single CNN, combination of above three training sets is used to train one CNN. A moderated pre-processing is also employed while generating patterns from the scanned images. The proposed methods have been tested on prominent benchmark handwritten numeral datasets and have achieved remarkable recognition accuracies. The achieved recognition accuracies are found better than reported recognition accuracies of prominent existing methods; and such outperformance mounted proposed methods as better recognition systems. Moreover, CNN's performance improvement due to use of generated patterns has also been clearly identified from the presented experimental results
Exploiting Features From Triangle Geometry For Digit Recognition
Triangle is a basic geometry. There are six type of triangle, but scalene triangle was chosen to be used in this research that based on coordinates of corners generated by our proposed algorithm. In this paper, nine features are proposed where six features were derived from coordinates and sides of triangle. Another three features are angle of corners. After features are identified, image will be zoned into 25 zones. The zoning processes are based on Cartesian plan, Vertical and Horizontal zones. From the zoning, from nine features will become 225 features. The features proposed will be used to HODA, MNIST, IFHCDB and BANGLA datasets. Experiments will be conducted using supervised learning that are Support Vector Machine (SVM) and Multi-layer Perceptron (MLP). Results from the experiments will be evaluated with different Cost (c) for the SVM and Learning Rate (LR) for the MLP. Then, the result will be compared to state of the art by other researches
SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION
An Optical Character Recognition (OCR) consists of three bold steps namely Preprocessing, Feature extraction, Classification. Methods of Feature extraction yield feature vectors based on which the classification of a testing pattern is executed. The paper aims at proposing some methods of feature extraction that may go a long way to recognize a Bengali numeral or character. Pixel Ex-OR Method presents a digital gating (Ex-OR) technique to extract the information in an image. Two successive elements of a row in image matrix have been Ex-ORed and the output is again Ex-ORed with the next element. Alphabetical coding codes a binary character image by means of letters of English alphabet. Directional features find gradient information using Sobel Masks to make position of stroke clear in an image. The features have been derived in eight standard directions and then these eight feature vectors are merged into four sets of features to reduce the system complexity and hence processing time is saved considerably. These features will help develop a Bengali numeral recognition system