28 research outputs found

    Multiple convolutional neural network training for Bangla handwritten numeral recognition

    Get PDF
    Recognition of handwritten numerals has gained much interest in recent years due to its various application potentials. The progress of handwritten Bangla numeral is well behind Roman, Chinese and Arabic scripts although it is a major language in Indian subcontinent and is the first language of Bangladesh. Handwritten numeral classification is a high-dimensional complex task and existing methods use distinct feature extraction techniques and various classification tools in their recognition schemes. Recently, convolutional neural network (CNN) is found efficient for image classification with its distinct features. In this study, three different CNNs with same architecture are trained with different training sets and combined their decisions for Bangla handwritten numeral recognition. One CNN is trained with ordinary training set prepared from handwritten scan images; and training sets for other two CNNs are prepared with fixed (positive and negative, respectively) rotational angles of original images. The proposed multiple CNN based approach is shown to outperform other existing methods while tested on a popular Bangla benchmark handwritten dataset

    Development of Comprehensive Devnagari Numeral and Character Database for Offline Handwritten Character Recognition

    Get PDF
    In handwritten character recognition, benchmark database plays an important role in evaluating the performance of various algorithms and the results obtained by various researchers. In Devnagari script, there is lack of such official benchmark. This paper focuses on the generation of offline benchmark database for Devnagari handwritten numerals and characters. The present work generated 5137 and 20305 isolated samples for numeral and character database, respectively, from 750 writers of all ages, sex, education, and profession. The offline sample images are stored in TIFF image format as it occupies less memory. Also, the data is presented in binary level so that memory requirement is further reduced. It will facilitate research on handwriting recognition of Devnagari script through free access to the researchers.Comment: 5 pages, 8 figures, journal pape

    Diagonal Based Feature Extraction for Handwritten Alphabets Recognition System using Neural Network

    Full text link
    An off-line handwritten alphabetical character recognition system using multilayer feed forward neural network is described in the paper. A new method, called, diagonal based feature extraction is introduced for extracting the features of the handwritten alphabets. Fifty data sets, each containing 26 alphabets written by various people, are used for training the neural network and 570 different handwritten alphabetical characters are used for testing. The proposed recognition system performs quite well yielding higher levels of recognition accuracy compared to the systems employing the conventional horizontal and vertical methods of feature extraction. This system will be suitable for converting handwritten documents into structural text form and recognizing handwritten names

    Convolutional neural network training incorporating rotation-based generated patterns and handwritten numeral recognition of major Indian scripts

    Get PDF
    Handwritten numeral recognition has gained much interest in recent times because of its diverse application potentials. Bangla and Hindi are the two major languages in Indian subcontinent and a large number of population in vast land scape uses Bangla and Devnagari numeral scripts of these two languages. Well-performed handwritten numeral recognition system for Bangla and Devnagari is challenging because of similar shaped numerals in both scripts; few numerals differ from their similar ones with a very few variation even in printed form. In this study, convolutional neural network (CNN) based two different methods have been investigated for better recognition of Bangla and Devnagari handwritten numerals. Both the methods use rotation-based generated patterns along with ordinary patterns to train CNN but in two different modes. In multiple CNN case, three different training sets (one with ordinary patterns and two with clockwise and anti-clockwise rotation-based generated patterns) are prepared; three different CNNs are trained individually with each of these training sets; and their decisions are combined for final system decision. On the other hand, in the case of single CNN, combination of above three training sets is used to train one CNN. A moderated pre-processing is also employed while generating patterns from the scanned images. The proposed methods have been tested on prominent benchmark handwritten numeral datasets and have achieved remarkable recognition accuracies. The achieved recognition accuracies are found better than reported recognition accuracies of prominent existing methods; and such outperformance mounted proposed methods as better recognition systems. Moreover, CNN's performance improvement due to use of generated patterns has also been clearly identified from the presented experimental results

    Exploiting Features From Triangle Geometry For Digit Recognition

    Get PDF
    Triangle is a basic geometry. There are six type of triangle, but scalene triangle was chosen to be used in this research that based on coordinates of corners generated by our proposed algorithm. In this paper, nine features are proposed where six features were derived from coordinates and sides of triangle. Another three features are angle of corners. After features are identified, image will be zoned into 25 zones. The zoning processes are based on Cartesian plan, Vertical and Horizontal zones. From the zoning, from nine features will become 225 features. The features proposed will be used to HODA, MNIST, IFHCDB and BANGLA datasets. Experiments will be conducted using supervised learning that are Support Vector Machine (SVM) and Multi-layer Perceptron (MLP). Results from the experiments will be evaluated with different Cost (c) for the SVM and Learning Rate (LR) for the MLP. Then, the result will be compared to state of the art by other researches

    SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION

    Get PDF
    An Optical Character Recognition (OCR) consists of three bold steps namely Preprocessing, Feature extraction, Classification. Methods of Feature extraction yield feature vectors based on which the classification of a testing pattern is executed. The paper aims at proposing some  methods of feature extraction that may go a long way to recognize a Bengali numeral or character. Pixel Ex-OR Method presents a digital gating (Ex-OR) technique to extract the information in an image. Two successive elements of a row in image matrix have been Ex-ORed and the output is again Ex-ORed with the next element.  Alphabetical coding codes a binary character image by means of letters of English alphabet. Directional features find gradient information using Sobel Masks to make position of stroke clear in an image. The features have been derived in eight standard directions and then these eight feature vectors are merged into four sets of features to reduce the system complexity and hence processing time is saved considerably. These features will help develop a Bengali numeral recognition system
    corecore