Search CORE

30 research outputs found

Feature Extraction Methods for Character Recognition

Author: Yampolskiy Roman V
Publication venue: RIT Scholar Works
Publication date: 01/01/2004
Field of study

Not Include

RIT Scholar Works

Evaluation of Pattern Classifiers for Fingerprint and OCR Applications

Author: Blue J.L.
Candela G.T.
Chellappa Rama
Grother P.J.
Wilson C.L.
Publication venue
Publication date: 15/10/1998
Field of study

(Also cross-referenced as CAR-TR-691) In this paper we evaluate the classification accuracy of four statistical and three neural network classifiers for two image based pattern classification problems. These are fingerprint classification and optical character recognition (OCR) for isolated handprinted digits. The evaluation results reported here should be useful for designers of practical systems for these two important commercial applications. For the OCR problem, the Karhunen-Loeve (K-L) transform of the images is used to generate the inp ut feature set. Similarly for the fingerprint problem, the K-L transform of the ridge directions is used to generate the input feature set. The statistical classifiers used were Euclidean minimum distance, quadratic minimum distance, normal, and knearest neighbor. The neural network classifiers used were multilayer perceptron, radial basis function, and probabilistic. The OCR data consisted of 7,480 digit images for training and 23,140 digit images for testing. The fingerprint data consisted of 9,000 trai ning and 2,000 testing images. In addition to evaluation for accuracy, the multilayer perceptron and radial basis function networks were evaluated for size and generalization capability. For the evaluated datasets the best accuracy obtained for either pro blem was provided by the probabilistic neural network, where the minimum classification error was 2.5% for OCR and 7.2% for fingerprints

Digital Repository at the University of Maryland

A novel approach to handwritten character recognition

Author: Clarke Eddie
Publication venue
Publication date: 01/01/1995
Field of study

A number of new techniques and approaches for off-line handwritten character recognition are presented which individually make significant advancements in the field. First. an outline-based vectorization algorithm is described which gives improved accuracy in producing vector representations of the pen strokes used to draw characters. Later. Vectorization and other types of preprocessing are criticized and an approach to recognition is suggested which avoids separate preprocessing stages by incorporating them into later stages. Apart from the increased speed of this approach. it allows more effective alteration of the character images since more is known about them at the later stages. It also allows the possibility of alterations being corrected if they are initially detrimental to recognition. A new feature measurement. the Radial Distance/Sector Area feature. is presented which is highly robust. tolerant to noise. distortion and style variation. and gives high accuracy results when used for training and testing in a statistical or neural classifier. A very powerful classifier is therefore obtained for recognizing correctly segmented characters. The segmentation task is explored in a simple system of integrated over-segmentation. Character classification and approximate dictionary checking. This can be extended to a full system for handprinted word recognition. In addition to the advancements made by these methods. a powerful new approach to handwritten character recognition is proposed as a direction for future research. This proposal combines the ideas and techniques developed in this thesis in a hierarchical network of classifier modules to achieve context-sensitive. off-line recognition of handwritten text. A new type of "intelligent" feedback is used to direct the search to contextually sensible classifications. A powerful adaptive segmentation system is proposed which. when used as the bottom layer in the hierarchical network. allows initially incorrect segmentations to be adjusted according to the hypotheses of the higher level context modules

Nottingham eTheses

The effectiveness of features in pattern recognition

Author: Ray S.
Ray S.
Publication venue: Department of Electrical Engineering, Imperial College London
Publication date: 01/01/1985
Field of study

Imperial Users onl

Spiral - Imperial College Digital Repository

A novel approach to handwritten character recognition

Author: Clarke Eddie
Publication venue
Publication date
Field of study

Nottingham ePrints

A comprehensive survey of handwritten document benchmarks: structure, usage and evaluation

Author
Publication venue: Springer
Publication date: 24/12/2015
Field of study

Springer - Publisher Connector

Off-line Thai handwriting recognition in legal amount

Author: Chatwiriya Watchara
Publication venue: The Research Repository @ WVU
Publication date: 01/12/2002
Field of study

Thai handwriting in legal amounts is a challenging problem and a new field in the area of handwriting recognition research. The focus of this thesis is to implement Thai handwriting recognition system. A preliminary data set of Thai handwriting in legal amounts is designed. The samples in the data set are characters and words of the Thai legal amounts and a set of legal amounts phrases collected from a number of native Thai volunteers. At the preprocessing and recognition process, techniques are introduced to improve the characters recognition rates. The characters are divided into two smaller subgroups by their writing levels named body and high groups. The recognition rates of both groups are increased based on their distinguished features. The writing level separation algorithms are implemented using the size and position of characters. Empirical experiments are set to test the best combination of the feature to increase the recognition rates. Traditional recognition systems are modified to give the accumulative top-3 ranked answers to cover the possible character classes. At the postprocessing process level, the lexicon matching algorithms are implemented to match the ranked characters with the legal amount words. These matched words are joined together to form possible choices of amounts. These amounts will have their syntax checked in the last stage. Several syntax violations are caused by consequence faulty character segmentation and recognition resulting from connecting or broken characters. The anomaly in handwriting caused by these characters are mainly detected by their size and shape. During the recovery process, the possible word boundary patterns can be pre-defined and used to segment the hypothesis words. These words are identified by the word recognition and the results are joined with previously matched words to form the full amounts and checked by the syntax rules again. From 154 amounts written by 10 writers, the rejection rate is 14.9 percent with the recovery processes. The recognition rate for the accepted amount is 100 percent

The Research Repository @ WVU (West Virginia University)

A comprehensive survey of handwritten document benchmarks: structure, usage and evaluation

Author: A Bensefia
A Fischer
A Giménez
A Schlapbach
A Shivram
A-HM R
A-L Bianne-Bernard
Ahsen Raza
AK Jain
B Verma
B Zhu
C-L Liu
Chawki Djeddi
CO Freitas
D Bertolini
D-H Wang
E Kavallieratou
E Kussul
EF Can
F H-C
F Lauer
F Zamora-Martanez
GE Hinton
GX Tan
H Bunke
H El-Abed
H El-Abed
H Liu
H Yamada
I Siddiqi
Imran Siddiqi
JJ Hull
K Seo
Khurram Khurshid
L C-L
L Jin
L Xu
L Z
M Bulacu
M Liwicki
M Nakagawa
M Nakagawa
M Shi
MA Mohamed
MN Abdi
N Serrano
NB Amara
Q-F Wang
R Saabni
Raashid Hussain
S Al-Maadeed
S Gunter
SJ Smith
T-H Su
TM Ha
U Bhattacharya
UV Marti
V Frinken
Y Al-Ohali
Y Kessentini
Y LeCun
Y Shao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

On deformable models for visual pattern recognition

Author: Amini
Amini
Amit
Ballard
Blake
Burr
Burr
Caselles
Caselles
Chen
Chesnaud
Cheung
Chou
Christensen
Christmas
Cohen
Cohen
Cooper
Cootes
DeCarlo
DeCarlo
Dempster
Dit-Yan Yeung
Dubuisson
Durbin
Fischler
Fok
Fua
Fua
Garrido
Geiger
Grenander
Grenander
Grzeszczuk
Huber
Jain
Jain
Jain
Kass
Kita
Klein
Kumar
Kwok-Wai Cheung
Lades
Lai
Lanitis
Leymarie
Liou
Luettin
MacKay
Malladi
McInerney
Miller
Nastar
Olstad
Pentland
Poggio
Revow
Rigoutsos
Roland T. Chin
Ronfard
Sclaroff
Senior
Shahraray
Staib
Subrahmonia
Terzopoulos
Terzopoulos
Tikhonov
Ullman
Wakahara
Widrow
Williams
Williams
Wiskott
Witkin
Worring
Würtz
Xie
Xu
Young
Yuille
Zhu
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

A novel image matching approach for word spotting

Author: Shah Muhammad Ismail
Publication venue
Publication date: 01/01/2009
Field of study

Word spotting has been adopted and used by various researchers as a complementary technique to Optical Character Recognition for document analysis and retrieval. The various applications of word spotting include document indexing, image retrieval and information filtering. The important factors in word spotting techniques are pre-processing, selection and extraction of proper features and image matching algorithms. The Correlation Similarity Measure (CORR) algorithm is considered to be a faster matching algorithm, originally defined for finding similarities between binary patterns. In the word spotting literature the CORR algorithm has been used successfully to compare the GSC binary features extracted from binary word images, i.e., Gradient, Structural and Concavity (GSC) features. However, the problem with this approach is that binarization of images leads to a loss of very useful information. Furthermore, before extracting GSC binary features the word images must be skew corrected and slant normalized, which is not only difficult but in some cases impossible in Arabic and modified Arabic scripts. We present a new approach in which the Correlation Similarity Measure (CORR) algorithm has been used innovatively to compare Gray-scale word images. In this approach, binarization of images, skew correction and slant normalization of word images are not required at all. The various features, i.e., projection profiles, word profiles and transitional features are extracted from the Gray-scale word images and converted into their binary equivalents, which are compared via CORR algorithm with greater speed and higher accuracy. The experiments have been conducted on Gray-scale versions of newly created handwritten databases of Pashto and Dari languages, written in modified Arabic scripts. For each of these languages we have used 4599 words relating to 21 different word classes collected from 219 writers. The average precision rates achieved for Pashto and Dari languages were 93.18 % and 93.75 %, respectively. The time taken for matching a pair of images was 1.43 milli-seconds. In addition, we will present the handwritten databases for two well-known Indo- Iranian languages, i.e., Pashto and Dari languages. These are large databases which contain six types of data, i.e., Dates, Isolated Digits, Numeral Strings, Isolated Characters, Different Words and Special Symbols, written by native speakers of the corresponding languages

Concordia University Research Repository