Search CORE

33,337 research outputs found

Recommended from our members

Use of colour for hand-filled form analysis and recognition

Author: Allen T
Sherkat N
Wong WS
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/07/2005
Field of study

Colour information in form analysis is currently under utilised. As technology has advanced and computing costs have reduced, the processing of forms in colour has now become practicable. This paper describes a novel colour-based approach to the extraction of filled data from colour form images. Images are first quantised to reduce the colour complexity and data is extracted by examining the colour characteristics of the images. The improved performance of the proposed method has been verified by comparing the processing time, recognition rate, extraction precision and recall rate to that of an equivalent black and white system

Nottingham Trent Institutional Repository (IRep)

Recovering Homography from Camera Captured Documents using Convolutional Neural Networks

Author: Dejan M. Petrović
Gerdt Müller
Marie Dahlström
Martin Lersch
Matti Siika-aho
Oskar Bengtsson
Piotr Chylenski
Svein Jarle Horn
Vincent G. H. Eijsink
Publication venue
Publication date: 01/01/2017
Field of study

Removing perspective distortion from hand held camera captured document images is one of the primitive tasks in document analysis, but unfortunately, no such method exists that can reliably remove the perspective distortion from document images automatically. In this paper, we propose a convolutional neural network based method for recovering homography from hand-held camera captured documents. Our proposed method works independent of document's underlying content and is trained end-to-end in a fully automatic way. Specifically, this paper makes following three contributions: Firstly, we introduce a large scale synthetic dataset for recovering homography from documents images captured under different geometric and photometric transformations; secondly, we show that a generic convolutional neural network based architecture can be successfully used for regressing the corners positions of documents captured under wild settings; thirdly, we show that L1 loss can be reliably used for corners regression. Our proposed method gives state-of-the-art performance on the tested datasets, and has potential to become an integral part of document analysis pipeline.Comment: 10 pages, 8 figure

arXiv.org e-Print Archive

Brage NMBU

Directory of Open Access Journals

VTT Research System

FigShare

No-reference image quality assessment through the von Mises distribution

Author: Ciocca
Claasen
Cohen
Conrad
Ferzli
Gabarda
Gabriel Cristóbal
Jacobson
Ponomarenko
Redi
Salvador Gabarda
Snelson
Stankovic
Stephens
Valdecasas
von Mises
Wigner
Williams
Zhu
Zyczkowski
Publication venue: 'The Optical Society'
Publication date: 14/02/2012
Field of study

An innovative way of calculating the von Mises distribution (VMD) of image entropy is introduced in this paper. The VMD's concentration parameter and some fitness parameter that will be later defined, have been analyzed in the experimental part for determining their suitability as a image quality assessment measure in some particular distortions such as Gaussian blur or additive Gaussian noise. To achieve such measure, the local R\'{e}nyi entropy is calculated in four equally spaced orientations and used to determine the parameters of the von Mises distribution of the image entropy. Considering contextual images, experimental results after applying this model show that the best-in-focus noise-free images are associated with the highest values for the von Mises distribution concentration parameter and the highest approximation of image data to the von Mises distribution model. Our defined von Misses fitness parameter experimentally appears also as a suitable no-reference image quality assessment indicator for no-contextual images.Comment: 29 pages, 11 figure

arXiv.org e-Print Archive

Crossref

Digital.CSIC

Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition

Author: Jaderberg Max
Simonyan Karen
Vedaldi Andrea
Zisserman Andrew
Publication venue
Publication date: 01/01/2014
Field of study

In this work we present a framework for the recognition of natural scene text. Our framework does not require any human-labelled data, and performs word recognition on the whole image holistically, departing from the character based recognition systems of the past. The deep neural network models at the centre of this framework are trained solely on data produced by a synthetic text generation engine -- synthetic data that is highly realistic and sufficient to replace real data, giving us infinite amounts of training data. This excess of data exposes new possibilities for word recognition models, and here we consider three models, each one "reading" words in a different way: via 90k-way dictionary encoding, character sequence encoding, and bag-of-N-grams encoding. In the scenarios of language based and completely unconstrained text recognition we greatly improve upon state-of-the-art performance on standard datasets, using our fast, simple machinery and requiring zero data-acquisition costs

arXiv.org e-Print Archive

Oxford University Research Archive