180 research outputs found

    Two-dimensional penalized signal regression for hand written digit recognition

    Get PDF
    Many attempts have been made to achieve successful recognition of handwritten digits. We report our results of using statistical method on handwritten digit recognition. A digitized handwritten numeral can be represented by an image with grayscales. The image includes features that are mapped into two-dimensional space with row and column coordinates. Based on this structure, two-dimensional penalized signal logistic regression (PSR) is applied to the recognition of handwritten digits. The data set is taken from the USPS zip code database that contains 7219 training images and 2007 test images. All the images have been deslanted and normalized into 16 x 16 pixels with various grayscales. The PSR method constructs a coefficient surface using a rich two-dimensional tensor product B-splines basis, so that the surface is more flexible than needed. We then penalize roughness of the coefficient surface with difference penalties on each coefficient associate with the rows and columns of the tensor product B-splines. The optimal penalty weight is found in several minutes of iterative operations. A competitive overall recognition error rate of 8.97% on the test data set was achieved. We will also review an artificial neural network approach for comparison. By using PSR, it requires neither long learning time nor large memory resources. Another advantage of the PSR method is that our results are obtained on the original USPS data set without any further image preprocessing. We also found that PSR algorithm was very capable to cope with high diversity and variation that were two major features of handwritten digits

    Recognition of off-line handwritten cursive text

    Get PDF
    The author presents novel algorithms to design unconstrained handwriting recognition systems organized in three parts: In Part One, novel algorithms are presented for processing of Arabic text prior to recognition. Algorithms are described to convert a thinned image of a stroke to a straight line approximation. Novel heuristic algorithms and novel theorems are presented to determine start and end vertices of an off-line image of a stroke. A straight line approximation of an off-line stroke is converted to a one-dimensional representation by a novel algorithm which aims to recover the original sequence of writing. The resulting ordering of the stroke segments is a suitable preprocessed representation for subsequent handwriting recognition algorithms as it helps to segment the stroke. The algorithm was tested against one data set of isolated handwritten characters and another data set of cursive handwriting, each provided by 20 subjects, and has been 91.9% and 91.8% successful for these two data sets, respectively. In Part Two, an entirely novel fuzzy set-sequential machine character recognition system is presented. Fuzzy sequential machines are defined to work as recognizers of handwritten strokes. An algorithm to obtain a deterministic fuzzy sequential machine from a stroke representation, that is capable of recognizing that stroke and its variants, is presented. An algorithm is developed to merge two fuzzy machines into one machine. The learning algorithm is a combination of many described algorithms. The system was tested against isolated handwritten characters provided by 20 subjects resulting in 95.8% recognition rate which is encouraging and shows that the system is highly flexible in dealing with shape and size variations. In Part Three, also an entirely novel text recognition system, capable of recognizing off-line handwritten Arabic cursive text having a high variability is presented. This system is an extension of the above recognition system. Tokens are extracted from a onedimensional representation of a stroke. Fuzzy sequential machines are defined to work as recognizers of tokens. It is shown how to obtain a deterministic fuzzy sequential machine from a token representation that is capable'of recognizing that token and its variants. An algorithm for token learning is presented. The tokens of a stroke are re-combined to meaningful strings of tokens. Algorithms to recognize and learn token strings are described. The. recognition stage uses algorithms of the learning stage. The process of extracting the best set of basic shapes which represent the best set of token strings that constitute an unknown stroke is described. A method is developed to extract lines from pages of handwritten text, arrange main strokes of extracted lines in the same order as they were written, and present secondary strokes to main strokes. Presented secondary strokes are combined with basic shapes to obtain the final characters by formulating and solving assignment problems for this purpose. Some secondary strokes which remain unassigned are individually manipulated. The system was tested against the handwritings of 20 subjects yielding overall subword and character recognition rates of 55.4% and 51.1%, respectively

    Feature Extraction Methods for Character Recognition

    Get PDF
    Not Include

    Advances in Character Recognition

    Get PDF
    This book presents advances in character recognition, and it consists of 12 chapters that cover wide range of topics on different aspects of character recognition. Hopefully, this book will serve as a reference source for academic research, for professionals working in the character recognition field and for all interested in the subject

    Classification of Arabic Autograph as Genuine ‎And Forged through a Combination of New ‎Attribute Extraction Techniques

    Get PDF
    تقترح هذه الدراسة إطارا جديدا لتقنية التحقق من التوقيع العربي. وهو يستخلص بعض السمات الديناميكية للتمييز بين التوقيعات المزورة والحقيقية. لهذا الغرض، يستخدم هذا الإطار التكيف وضعية النافذة لاستخراج تفرد من الموقعين في التوقيع بخط اليد والخصائص المحددة من الموقعين. وبناء على هذا الإطار، تقسم التوقيعات العربية أولا إلى نوافذ 14 × 14؛ كل جزء واسع بما فيه الكفاية لإدخال معلومات وافية عن أنماط الموقعين وصغيرة بما فيه الكفاية للسماح بالمعالجة السريعة. ثم، تم اقتراح نوعين من الميزات على أساس تحويل جيب التمام المنفصل، تحويل المويجة المنفصلة لاستخلاص الميزات من المنطقة ذات الاهتمام. وأخيرا، يتم اختيار شجرة القرار لتصنيف التوقيعات باستخدام الميزات المذكورة كمدخلات لها. وتجرى التقييمات على التوقيعات العربية. وكانت النتائج مشجعة جدا مع معدل تحقق 99.75٪ لاختيار سلسلة من للتوقيعات المزورة والحقيقية للتوقيعات العربية التي تفوقت بشكل ملحوظ على أحدث الأعمال في هذا المجالThis study proposes a new framework for an Arabic autograph verification technique. It extracts certain dynamic attributes to distinguish between forged and genuine signatures. For this aim, this framework uses Adaptive Window Positioning to extract the uniqueness of signers in handwritten signatures and the specific characteristics of signers. Based on this framework, Arabic autograph are first divided into 14X14 windows; each fragment is wide enough to include sufficient information about signers’ styles and small enough to allow fast processing. Then, two types of fused attributes based on Discrete Cosine Transform and Discrete Wavelet Transform of region of interest have been proposed for attributes extraction. Finally, the Decision Tree is chosen to classify the autographs using the previous attributes as its input. The evaluations are carried out on the Arabic autograph. The results are very encouraging with verification rate 99.75% for sequential selection of forged and genuine autographs for Arabic autograph that significantly outperformed the most recent work in this fiel

    Using generative models for handwritten digit recognition

    Get PDF
    We describe a method of recognizing handwritten digits by fitting generative models that are built from deformable B-splines with Gaussian ``ink generators'' spaced along the length of the spline. The splines are adjusted using a novel elastic matching procedure based on the Expectation Maximization (EM) algorithm that maximizes the likelihood of the model generating the data. This approach has many advantages. (1) After identifying the model most likely to have generated the data, the system not only produces a classification of the digit but also a rich description of the instantiation parameters which can yield information such as the writing style. (2) During the process of explaining the image, generative models can perform recognition driven segmentation. (3) The method involves a relatively small number of parameters and hence training is relatively easy and fast. (4) Unlike many other recognition schemes it does not rely on some form of pre-normalization of input images, but can handle arbitrary scalings, translations and a limited degree of image rotation. We have demonstrated our method of fitting models to images does not get trapped in poor local minima. The main disadvantage of the method is it requires much more computation than more standard OCR techniques
    corecore