Search CORE

7,947 research outputs found

Automatic Document Image Binarization using Bayesian Optimization

Author: Badekas E
Bernsen John
Gatos Basilis
Nafchi Hossein Ziaei
Ntirogiannis Konstantinos
Pratikakis Ioannis
Pratikakis Ioannis
Pratikakis Ioannis
Pratikakis Ioannis
Su Bolan
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 21/10/2017
Field of study

Document image binarization is often a challenging task due to various forms of degradation. Although there exist several binarization techniques in literature, the binarized image is typically sensitive to control parameter settings of the employed technique. This paper presents an automatic document image binarization algorithm to segment the text from heavily degraded document images. The proposed technique uses a two band-pass filtering approach for background noise removal, and Bayesian optimization for automatic hyperparameter selection for optimal results. The effectiveness of the proposed binarization technique is empirically demonstrated on the Document Image Binarization Competition (DIBCO) and the Handwritten Document Image Binarization Competition (H-DIBCO) datasets

arXiv.org e-Print Archive

Crossref

Adaptive Algorithms for Automated Processing of Document Images

Author: Agrawal Mudit
Publication venue
Publication date: 01/01/2011
Field of study

Large scale document digitization projects continue to motivate interesting document understanding technologies such as script and language identification, page classification, segmentation and enhancement. Typically, however, solutions are still limited to narrow domains or regular formats such as books, forms, articles or letters and operate best on clean documents scanned in a controlled environment. More general collections of heterogeneous documents challenge the basic assumptions of state-of-the-art technology regarding quality, script, content and layout. Our work explores the use of adaptive algorithms for the automated analysis of noisy and complex document collections. We first propose, implement and evaluate an adaptive clutter detection and removal technique for complex binary documents. Our distance transform based technique aims to remove irregular and independent unwanted foreground content while leaving text content untouched. The novelty of this approach is in its determination of best approximation to clutter-content boundary with text like structures. Second, we describe a page segmentation technique called Voronoi++ for complex layouts which builds upon the state-of-the-art method proposed by Kise [Kise1999]. Our approach does not assume structured text zones and is designed to handle multi-lingual text in both handwritten and printed form. Voronoi++ is a dynamically adaptive and contextually aware approach that considers components' separation features combined with Docstrum [O'Gorman1993] based angular and neighborhood features to form provisional zone hypotheses. These provisional zones are then verified based on the context built from local separation and high-level content features. Finally, our research proposes a generic model to segment and to recognize characters for any complex syllabic or non-syllabic script, using font-models. This concept is based on the fact that font files contain all the information necessary to render text and thus a model for how to decompose them. Instead of script-specific routines, this work is a step towards a generic character and recognition scheme for both Latin and non-Latin scripts

Digital Repository at the University of Maryland

Persian Heritage Image Binarization Competition (PHIBC 2012)

Author: Ayatollahi Seyed Morteza
Nafchi Hossein Ziaei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 20/04/2016
Field of study

The first competition on the binarization of historical Persian documents and manuscripts (PHIBC 2012) has been organized in conjunction with the first Iranian conference on pattern recognition and image analysis (PRIA 2013). The main objective of PHIBC 2012 is to evaluate performance of the binarization methodologies, when applied on the Persian heritage images. This paper provides a report on the methodology and performance of the three submitted algorithms based on evaluation measures has been used.Comment: 4 pages, 2 figures, conferenc

arXiv.org e-Print Archive

CiteSeerX

Development of a Recognizer for Bangla Text: Present Status and Future Challenges

Author: Hasan Sarwar
Mofizur Rahman
Nasreen Akter
Saima Hossain
Publication venue: 'IntechOpen'
Publication date: 17/08/2010
Field of study

IntechOpen

A GENERIC SYSTEM TO EXTRACT AND CLEAN HANDWRITTEN DATA FROM BUSINESS FORMS

Author: Cheriet M.
Suen C.Y.
Publication venue: s.n.
Publication date: 01/01/2004
Field of study

Proceedings - University of Groningen