Search CORE

5 research outputs found

A fuzzy approach to segment touching characters

Author: AIRO' FARULLA Giuseppe
Murru Nadir
Rossini Rosaria
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Institutional Research Information System University of Turin

Optical character recognition for checkbox detection

Author: Istle John Michael
Publication venue: Digital Scholarship@UNLV
Publication date: 01/01/2004
Field of study

Optical character recognition is the branch in computer science that involves reading text from paper and translating the images into a format that computers can manipulate. There are a lot of algorithms for finding letters and numbers, however checkboxes are often overlooked and very difficult to detect. To locate and determine if checkboxes are checked or unchecked is a very useful tool to use on forms. It is difficult to detect since there are so many ways a person can mark a checkbox. This thesis will describe a new algorithm for detecting checkboxes; Before checkboxes can be searched, certain preprocessing algorithms need to be performed on the form. The preprocessing steps are used to ensure that the width of the pixels that inscribe characters are one pixel. Not all checkmarks are drawn inside the box. Once a box is found, the coordinates are saved for further analysis

University of Nevada, Las Vegas Repository

Document Image Analysis Techniques for Handwritten Text Segmentation, Document Image Rectification and Digital Collation

Author: Salvi Dhaval
Publication venue: Scholar Commons
Publication date: 09/08/2014
Field of study

Document image analysis comprises all the algorithms and techniques that are utilized to convert an image of a document to a computer readable description. In this work we focus on three such techniques, namely (1) Handwritten text segmentation (2) Document image rectification and (3) Digital Collation. Offline handwritten text recognition is a very challenging problem. Aside from the large variation of different handwriting styles, neighboring characters within a word are usually connected, and we may need to segment a word into individual characters for accurate character recognition. Many existing methods achieve text segmentation by evaluating the local stroke geometry and imposing constraints on the size of each resulting character, such as the character width, height and aspect ratio. These constraints are well suited for printed texts, but may not hold for handwritten texts. Other methods apply holistic approach by using a set of lexicons to guide and correct the segmentation and recognition. This approach may fail when the domain lexicon is insufficient. In the first part of this work, we present a new global non-holistic method for handwritten text segmentation, which does not make any limiting assumptions on the character size and the number of characters in a word. We conduct experiments on real images of handwritten texts taken from the IAM handwriting database and compare the performance of the presented method against an existing text segmentation algorithm that uses dynamic programming and achieve significant performance improvement. Digitization of document images using OCR based systems is adversely affected if the image of the document contains distortion (warping). Often, costly and precisely calibrated special hardware such as stereo cameras, laser scanners, etc. are used to infer the 3D model of the distorted image which is used to remove the distortion. Recent methods focus on creating a 3D shape model based on 2D distortion informa- tion obtained from the document image. The performance of these methods is highly dependent on estimating an accurate 2D distortion grid. These methods often affix the 2D distortion grid lines to the text line, and as such, may suffer in the presence of unreliable textual cues due to preprocessing steps such as binarization. In the domain of printed document images, the white space between the text lines carries as much information about the 2D distortion as the text lines themselves. Based on this intuitive idea, in the second part of our work we build a 2D distortion grid from white space lines, which can be used to rectify a printed document image by a dewarping algorithm. We compare our presented method against a state-of-the-art 2D distortion grid construction method and obtain better results. We also present qualitative and quantitative evaluations for the presented method. Collation of texts and images is an indispensable but labor-intensive step in the study of print materials. It is an often used methodology by textual scholars when the manuscript of the text does not exist. Although various methods and machines have been designed to assist in this labor, it still remains an expensive and time- consuming process, often requiring travel to distant repositories for the painstaking visual examination of multiple original copies. Efforts to digitize collation have so far depended on first transcribing the texts to be compared, thus introducing into the process more labor and expense, and also more potential error. Digital collation will instead automate the first stages of collation directly from the document images of the original texts, thereby speeding the process of comparison. We describe such a novel framework for digital collation in the third part of this work and provide qualitative results

Scholar Commons - Institutional Repository of the University of South Carolina

テガキジョウホウノリカツヨウノタメノニンシキトキョウユウニカンスルケンキュウ

Author: イケダヒサシ
池田尚司
Publication venue: 'Springer Publishing Company'
Publication date
Field of study

Osaka University Knowledge Archive

Extracção automática de dados georreferenciados a partir dos planos cadastrais portugueses

Author: Candeias Tiago Miguel Pereira
Publication venue
Publication date: 01/01/2009
Field of study

Tese dout., Engenharia Electrónica e Computação, Universidade do Algarve, 2009Image recognition algorithms are used to extract information from digitized images automatically. Systems designed to convert paper documents into meaningful vectorial representations are numerous nowadays, and have been constantly improved over the two last decades. However, none of these systems seems to be able to provide satisfying results when it comes to convert complex documents such as technical drawings, usually semantic of the problem is not considered and post-processing costs remain high. This dissertation presents a set of techniques that greatly simplifies the automatic extraction of cadastral entities from the portuguese cadastral maps. The validity of the approach is illustrated designing a prototype system, joining all recognition algorithms and validating all information.Fundação para a Ciência e Tecnologia (FCT

Sapientia