Location of Repository

The Application Of Word Image Matching In Text Recognition

By Siamak Khoubyari, Chairman Dr, Jonathan J. Hull, Member Dr and Sargur N. Srihari

Abstract

Printed text recognition usually involves recognizing individual characters, and assembling the results to recognize words and sentences. However, the performance of conventional character recognition systems tends to suffer in the presence of moderate levels of degradation in the text. A method is proposed that uses equivalence among frequent word images to derive hypotheses for each word using the available language statistics. Word images are clustered to determine equivalency. The attributes of the clusters, as well as the relationships among them are then matched with the same characteristics for the words in the language. The method requires no explicit training, and is fairly tolerant to image degradation. The results for several sample sizes are reported. 2 Contents 1 Introduction 4 1.1 Motivation : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 4 1.2 Problem Definition : : : : : : : : : : : : : : : : : : : : : : : : : : : : 5 1.3 This Thesis : : : : : : : :..

Year: 2007
OAI identifier: oai:CiteSeerX.psu:10.1.1.32.5173
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • ftp://ftp.cs.buffalo.edu/pub/t... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.