701 research outputs found
Feedback Based Architecture for Reading Check Courtesy Amounts
In recent years, a number of large-scale applications continue to rely heavily on the use of paper as the
dominant medium, either on intra-organization basis or on inter-organization basis, including paper
intensive applications in the check processing application. In many countries, the value of each check is
read by human eyes before the check is physically transported, in stages, from the point it was presented
to the location of the branch of the bank which issued the blank check to the concerned account holder.
Such process of manual reading of each check involves significant time and cost. In this research, a new
approach is introduced to read the numerical amount field on the check; also known as the courtesy
amount field. In the case of check processing, the segmentation of unconstrained strings into individual
digits is a challenging task because one needs to accommodate special cases involving: connected or
overlapping digits, broken digits, and digits physically connected to a piece of stroke that belongs to a
neighboring digit. The system described in this paper involves three stages: segmentation, normalization,
and the recognition of each character using a neural network classifier, with results better than many other
methods in the literaratu
Handwritten Bank Check Recognition of Courtesy Amounts
In spite of rapid evolution of electronic techniques, a number of large-scale applications continue to rely on the use
of paper as the dominant medium. This is especially true for processing of bank checks. This paper examines the
issue of reading the numerical amount field. In the case of checks, the segmentation of unconstrained strings into
individual digits is a challenging task because of connected and overlapping digits, broken digits, and digits that are
physically connected to pieces of strokes from neighboring digits. The proposed architecture involves four stages:
segmentation of the string into individual digits, normalization, recognition of each character using a neural network
classifier, and syntactic verification. Overall, this paper highlights the importance of employing a hybrid architecture
that incorporates multiple approaches to provide high recognition rates
Recognition of off-line arabic handwritten dates and numeral strings
In this thesis, we present an automatic recognition system for CENPARMI off-line Arabic handwritten dates collected from Arabic Nationalities. This system consists of modules that segment and recognize an Arabic handwritten date image. First, in the segmentation module, the system explicitly segments a date image into a sequence of basic constituents or segments. As a part of this module, a special sub-module was developed to over-segment any constituent that is a candidate for a touching pair. The proposed touching pair segmentation submodule has been tested on three different datasets of handwritten numeral touching pairs: The CENPARMI Arabic [6], Urdu, and Dari [24] datasets. The final recognition rates of 92.22%, 90.43%, and 86.10% were achieved for Arabic, Urdu and Dari, respectively. Afterwards, the segments are preprocessed and sent to the classification module. In this stage, feature vectors are extracted and then recognized by an isolated numeral classifier. This recognition system has been tested in five different isolated numeral databases: The CENPARMI Arabic [6], Urdu, Dari [24], Farsi, and Pashto databases with overall recognition rates of 97.29% 97.75%, 97.75%, 97.95% and 98.36%, respectively. Finally, a date post processing module is developed to improve the recognition results. This post processing module is used in two different stages. First, in the date stage, to verify that the segmentation/recognition output represents a valid date image and it chooses the best date format to be assigned to this image. Second, in the sub-field stage, to evaluate the values for the date three parts: day, month and year. Experiments on two different databases of Arabic handwritten dates: CENPARMI Arabic database [6] and the CENPARMI Arabic Bank Cheques database [7], show encouraging results with overall recognition rates of 85.05% and 66.49, respectively
Recognition-based Approach of Numeral Extraction in Handwritten Chemistry Documents using Contextual Knowledge
International audienceThis paper presents a complete procedure that uses contextual and syntactic information to identify and recognize amount fields in the table regions of chemistry documents. The proposed method is composed of two main modules. Firstly, a structural analysis based on connected component (CC) dimensions and positions identifies some special symbols and clusters other CCs into three groups: fragment of characters, isolated characters or connected characters. Then, a specific processing is performed on each group of CCs. The fragment of characters are merged with the nearest character or string using geometric relationship based rules. The characters are sent to a recognition module to identify the numeral components. For the connected characters, the final decision on the string nature (numeric or non-numeric) is made based on a global score computed on the full string using the height regularity property and the recognition probabilities of its segmented fragments. Finally, a simple syntactic verification at table row level is conducted in order to correct eventual errors. The experimental tests are carried out on real-world chemistry documents provided by our industrial partner eNovalys. The obtained results show the effectiveness of the proposed system in extracting amount fields
Reliable pattern recognition system with novel semi-supervised learning approach
Over the past decade, there has been considerable progress in the design of statistical machine learning strategies, including Semi-Supervised Learning (SSL) approaches. However, researchers still have difficulties in applying most of these learning strategies when two or more classes overlap, and/or when each class has a bimodal/multimodal distribution. In this thesis, an efficient, robust, and reliable recognition system with a novel SSL scheme has been developed to overcome overlapping problems between two classes and bimodal distribution within each class. This system was based on the nature of category learning and recognition to enhance the system's performance in relevant applications. In the training procedure, besides the supervised learning strategy, the unsupervised learning approach was applied to retrieve the "extra information" that could not be obtained from the images themselves. This approach was very helpful for the classification between two confusing classes. In this SSL scheme, both the training data and the test data were utilized in the final classification. In this thesis, the design of a promising supervised learning model with advanced state-of-the-art technologies is firstly presented, and a novel rejection measurement for verification of rejected samples, namely Linear Discriminant Analysis Measurement (LDAM), is defined. Experiments on CENPARMI's Hindu-Arabic Handwritten Numeral Database, CENPARMI's Numerals Database, and NIST's Numerals Database were conducted in order to evaluate the efficiency of LDAM. Moreover, multiple verification modules, including a Writing Style Verification (WSV) module, have been developed according to four newly defined error categories. The error categorization was based on the different costs of misclassification. The WSV module has been developed by the unsupervised learning approach to automatically retrieve the person's writing styles so that the rejected samples can be classified and verified accordingly. As a result, errors on CENPARMI's Hindu-Arabic Handwritten Numeral Database (24,784 training samples, 6,199 testing samples) were reduced drastically from 397 to 59, and the final recognition rate of this HAHNR reached 99.05%, a significantly higher rate compared to other experiments on the same database. When the rejection option was applied on this database, the recognition rate, error rate, and reliability were 97.89%, 0.63%, and 99.28%, respectivel
Deep Learning Based Models for Offline Gurmukhi Handwritten Character and Numeral Recognition
Over the last few years, several researchers have worked on handwritten character recognition and have proposed various techniques to improve the performance of Indic and non-Indic scripts recognition. Here, a Deep Convolutional Neural Network has been proposed that learns deep features for offline Gurmukhi handwritten character and numeral recognition (HCNR). The proposed network works efficiently for training as well as testing and exhibits a good recognition performance. Two primary datasets comprising of offline handwritten Gurmukhi characters and Gurmukhi numerals have been employed in the present work. The testing accuracies achieved using the proposed network is 98.5% for characters and 98.6% for numerals
Signature Recognition System for Student Attendance System in UTP
This paper proposes an off-line signature recognition system for student attendance
system in Universiti Teknologi PETRONAS (UTP). In current system, attendance sheet
is passed across the class and students are required to signed on the paper. Later,
lecturers will check on the paper and mark any empty column. However, lecturers
always busy and seldom have time to check each signature. Basically, the system has the
ability to imitate humans' capability of recognizing signatures. Thus, it could help
lecturers in recognizing students' signatures. The system employs artificial neural
networks for recognition and training process. This system is developed mainly using
Visual Basic 6.0 and involves four basic steps, which are image acquisition, image pre
processing, and enrolment and verification process. It has two phases, training and
recognition. Both process use artificial neural network. The system was satisfactory in
all cases where there were two different signatures to be recognized with False Rejection
Rate (FRR) for genuine signature is 4% and False Acceptance Rate (FAR) for forged
signature is 28%
Advances in Character Recognition
This book presents advances in character recognition, and it consists of 12 chapters that cover wide range of topics on different aspects of character recognition. Hopefully, this book will serve as a reference source for academic research, for professionals working in the character recognition field and for all interested in the subject
- …