2,249 research outputs found

    Recovering Homography from Camera Captured Documents using Convolutional Neural Networks

    Get PDF
    Removing perspective distortion from hand held camera captured document images is one of the primitive tasks in document analysis, but unfortunately, no such method exists that can reliably remove the perspective distortion from document images automatically. In this paper, we propose a convolutional neural network based method for recovering homography from hand-held camera captured documents. Our proposed method works independent of document's underlying content and is trained end-to-end in a fully automatic way. Specifically, this paper makes following three contributions: Firstly, we introduce a large scale synthetic dataset for recovering homography from documents images captured under different geometric and photometric transformations; secondly, we show that a generic convolutional neural network based architecture can be successfully used for regressing the corners positions of documents captured under wild settings; thirdly, we show that L1 loss can be reliably used for corners regression. Our proposed method gives state-of-the-art performance on the tested datasets, and has potential to become an integral part of document analysis pipeline.Comment: 10 pages, 8 figure

    Adaptive restoration of text images containing touching and broken characters

    Full text link
    For document processing systems, automated data entry is generally performed by optical character recognition (OCR) systems. To make these systems practical, reliable OCR systems are essential. However, distortions in document images cause character recognition errors, thereby, reducing the accuracy of OCR systems. In document images, most OCR errors are caused by broken and touching characters. This thesis presents an adaptive system to restore text images distorted by touching and broken characters. The adaptive system uses the distorted text image and the output from an OCR system to generate the training character image. Using the training image and the distorted image, the system trains an adaptive restoration filter and then uses the trained filter to restore the distorted text image. To demonstrate the performance of this technique, it was applied to several distorted images containing touching or broken characters. The results show that this technique can improve both pixel and OCR accuracy of distorted text images containing touching or broken characters

    Enhancement of Historical Printed Document Images by Combining Total Variation Regularization and Non-Local Means Filtering

    Get PDF
    This paper proposes a novel method for document enhancement which combines two recent powerful noise-reduction steps. The first step is based on the total variation framework. It flattens background grey-levels and produces an intermediate image where background noise is considerably reduced. This image is used as a mask to produce an image with a cleaner background while keeping character details. The second step is applied to the cleaner image and consists of a filter based on non-local means: character edges are smoothed by searching for similar patch images in pixel neighborhoods. The document images to be enhanced are real historical printed documents from several periods which include several defects in their background and on character edges. These defects result from scanning, paper aging and bleed- through. The proposed method enhances document images by combining the total variation and the non-local means techniques in order to improve OCR recognition. The method is shown to be more powerful than when these techniques are used alone and than other enhancement methods
    • …
    corecore