Search CORE

2,249 research outputs found

Recommended from our members

Use of colour for hand-filled form analysis and recognition

Author: Allen T
Sherkat N
Wong WS
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/07/2005
Field of study

Colour information in form analysis is currently under utilised. As technology has advanced and computing costs have reduced, the processing of forms in colour has now become practicable. This paper describes a novel colour-based approach to the extraction of filled data from colour form images. Images are first quantised to reduce the colour complexity and data is extracted by examining the colour characteristics of the images. The improved performance of the proposed method has been verified by comparing the processing time, recognition rate, extraction precision and recall rate to that of an equivalent black and white system

Nottingham Trent Institutional Repository (IRep)

Recovering Homography from Camera Captured Documents using Convolutional Neural Networks

Author: Dejan M. Petrović
Gerdt Müller
Marie Dahlström
Martin Lersch
Matti Siika-aho
Oskar Bengtsson
Piotr Chylenski
Svein Jarle Horn
Vincent G. H. Eijsink
Publication venue
Publication date: 01/01/2017
Field of study

Removing perspective distortion from hand held camera captured document images is one of the primitive tasks in document analysis, but unfortunately, no such method exists that can reliably remove the perspective distortion from document images automatically. In this paper, we propose a convolutional neural network based method for recovering homography from hand-held camera captured documents. Our proposed method works independent of document's underlying content and is trained end-to-end in a fully automatic way. Specifically, this paper makes following three contributions: Firstly, we introduce a large scale synthetic dataset for recovering homography from documents images captured under different geometric and photometric transformations; secondly, we show that a generic convolutional neural network based architecture can be successfully used for regressing the corners positions of documents captured under wild settings; thirdly, we show that L1 loss can be reliably used for corners regression. Our proposed method gives state-of-the-art performance on the tested datasets, and has potential to become an integral part of document analysis pipeline.Comment: 10 pages, 8 figure

arXiv.org e-Print Archive

Brage NMBU

Directory of Open Access Journals

VTT Research System

FigShare

Adaptive restoration of text images containing touching and broken characters

Author: Kalluri Venugopal
Publication venue: Digital Scholarship@UNLV
Publication date: 01/01/1995
Field of study

For document processing systems, automated data entry is generally performed by optical character recognition (OCR) systems. To make these systems practical, reliable OCR systems are essential. However, distortions in document images cause character recognition errors, thereby, reducing the accuracy of OCR systems. In document images, most OCR errors are caused by broken and touching characters. This thesis presents an adaptive system to restore text images distorted by touching and broken characters. The adaptive system uses the distorted text image and the output from an OCR system to generate the training character image. Using the training image and the distorted image, the system trains an adaptive restoration filter and then uses the trained filter to restore the distorted text image. To demonstrate the performance of this technique, it was applied to several distorted images containing touching or broken characters. The results show that this technique can improve both pixel and OCR accuracy of distorted text images containing touching or broken characters

University of Nevada, Las Vegas Repository

Enhancement of Historical Printed Document Images by Combining Total Variation Regularization and Non-Local Means Filtering

Author: Barney Smith Elisa H.
Darbon Jérôme
Likforman-Sulem Laurence
Publication venue: 'IUScholarWorks'
Publication date: 01/04/2011
Field of study

This paper proposes a novel method for document enhancement which combines two recent powerful noise-reduction steps. The first step is based on the total variation framework. It flattens background grey-levels and produces an intermediate image where background noise is considerably reduced. This image is used as a mask to produce an image with a cleaner background while keeping character details. The second step is applied to the cleaner image and consists of a filter based on non-local means: character edges are smoothed by searching for similar patch images in pixel neighborhoods. The document images to be enhanced are real historical printed documents from several periods which include several defects in their background and on character edges. These defects result from scanning, paper aging and bleed- through. The proposed method enhances document images by combining the total variation and the non-local means techniques in order to improve OCR recognition. The method is shown to be more powerful than when these techniques are used alone and than other enhancement methods

Boise State University - ScholarWorks