Search CORE

326 research outputs found

DeepOtsu: Document Enhancement and Binarization using Iterative Deep Learning

Author: He Sheng
Schomaker Lambert
Publication venue: 'Elsevier BV'
Publication date: 17/01/2019
Field of study

This paper presents a novel iterative deep learning framework and apply it for document enhancement and binarization. Unlike the traditional methods which predict the binary label of each pixel on the input image, we train the neural network to learn the degradations in document images and produce the uniform images of the degraded input images, which allows the network to refine the output iteratively. Two different iterative methods have been studied in this paper: recurrent refinement (RR) which uses the same trained neural network in each iteration for document enhancement and stacked refinement (SR) which uses a stack of different neural networks for iterative output refinement. Given the learned uniform and enhanced image, the binarization map can be easy to obtain by a global or local threshold. The experimental results on several public benchmark data sets show that our proposed methods provide a new clean version of the degraded image which is suitable for visualization and promising results of binarization using the global Otsu's threshold based on the enhanced images learned iteratively by the neural network.Comment: Accepted by Pattern Recognitio

arXiv.org e-Print Archive

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Image Enhancement with Statistical Estimation

Author: Kanrar Soumen
Mukherjee Aroop
Publication venue: 'Academy and Industry Research Collaboration Center (AIRCC)'
Publication date: 07/05/2012
Field of study

Contrast enhancement is an important area of research for the image analysis. Over the decade, the researcher worked on this domain to develop an efficient and adequate algorithm. The proposed method will enhance the contrast of image using Binarization method with the help of Maximum Likelihood Estimation (MLE). The paper aims to enhance the image contrast of bimodal and multi-modal images. The proposed methodology use to collect mathematical information retrieves from the image. In this paper, we are using binarization method that generates the desired histogram by separating image nodes. It generates the enhanced image using histogram specification with binarization method. The proposed method has showed an improvement in the image contrast enhancement compare with the other image.Comment: 9 pages,6 figures; ISSN:0975-5578 (Online); 0975-5934 (Print

arXiv.org e-Print Archive

Crossref

Recognizing Degraded Handwritten Characters

Author: Diem Markus
Gau Melanie
Miklas Heinz
Sablatnig Robert
Publication venue: Books on Demand (BoD)
Publication date: 01/01/2011
Field of study

In this paper, Slavonic manuscripts from the 11th century written in Glagolitic script are investigated. State-of-the-art optical character recognition methods produce poor results for degraded handwritten document images. This is largely due to a lack of suitable results from basic pre-processing steps such as binarization and image segmentation. Therefore, a new, binarization-free approach will be presented that is independent of pre-processing deficiencies. It additionally incorporates local information in order to recognize also fragmented or faded characters. The proposed algorithm consists of two steps: character classification and character localization. Firstly scale invariant feature transform features are extracted and classified using support vector machines. On this basis interest points are clustered according to their spatial information. Then, characters are localized and eventually recognized by a weighted voting scheme of pre-classified local descriptors. Preliminary results show that the proposed system can handle highly degraded manuscript images with background noise, e.g. stains, tears, and faded characters

Kölner UniversitätsPublikationsServer

BiNet:Degraded-Manuscript Binarization in Diverse Document Textures and Layouts using Deep Encoder-Decoder Networks

Author: Dhali Maruf A.
Schomaker Lambert
Wit Jan Willem de
Publication venue
Publication date: 13/11/2019
Field of study

Proceedings - University of Groningen

BiNet:Degraded-Manuscript Binarization in Diverse Document Textures and Layouts using Deep Encoder-Decoder Networks

Author: Dhali Maruf A.
Schomaker Lambert
Wit Jan Willem de
Publication venue
Publication date: 13/11/2019
Field of study

ARTS repository - University of Groningen

Word matching using single closed contours for indexing handwritten historical documents

Author: Alan F. Smeaton
C.C. Teppert
D. Cheng
F. Mokhtarian
L. Vincent
L.K. Huang
Noel E. O’Connor
R.F. Farag
S. Belongie
S. Madhvanath
S. Madhvanath
S. Madhvanath
Tomasz Adamek
W. Niblack
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2007
Field of study

Effective indexing is crucial for providing convenient access to scanned versions of large collections of historically valuable handwritten manuscripts. Since traditional handwriting recognizers based on optical character recognition (OCR) do not perform well on historical documents, recently a holistic word recognition approach has gained in popularity as an attractive and more straightforward solution (Lavrenko et al. in proc. document Image Analysis for Libraries (DIAL’04), pp. 278–287, 2004). Such techniques attempt to recognize words based on scalar and profile-based features extracted from whole word images. In this paper, we propose a new approach to holistic word recognition for historical handwritten manuscripts based on matching word contours instead of whole images or word profiles. The new method consists of robust extraction of closed word contours and the application of an elastic contour matching technique proposed originally for general shapes (Adamek and O’Connor in IEEE Trans Circuits Syst Video Technol 5:2004). We demonstrate that multiscale contour-based descriptors can effectively capture intrinsic word features avoiding any segmentation of words into smaller subunits. Our experiments show a recognition accuracy of 83%, which considerably exceeds the performance of other systems reported in the literature

Crossref

Irish Universities

DCU Online Research Access Service