Search CORE

6 research outputs found

DeepOtsu: Document Enhancement and Binarization using Iterative Deep Learning

Author: He Sheng
Schomaker Lambert
Publication venue: 'Elsevier BV'
Publication date: 17/01/2019
Field of study

This paper presents a novel iterative deep learning framework and apply it for document enhancement and binarization. Unlike the traditional methods which predict the binary label of each pixel on the input image, we train the neural network to learn the degradations in document images and produce the uniform images of the degraded input images, which allows the network to refine the output iteratively. Two different iterative methods have been studied in this paper: recurrent refinement (RR) which uses the same trained neural network in each iteration for document enhancement and stacked refinement (SR) which uses a stack of different neural networks for iterative output refinement. Given the learned uniform and enhanced image, the binarization map can be easy to obtain by a global or local threshold. The experimental results on several public benchmark data sets show that our proposed methods provide a new clean version of the degraded image which is suitable for visualization and promising results of binarization using the global Otsu's threshold based on the enhanced images learned iteratively by the neural network.Comment: Accepted by Pattern Recognitio

arXiv.org e-Print Archive

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

BiNet:Degraded-Manuscript Binarization in Diverse Document Textures and Layouts using Deep Encoder-Decoder Networks

Author: Dhali Maruf A.
Schomaker Lambert
Wit Jan Willem de
Publication venue
Publication date: 13/11/2019
Field of study

Handwritten document-image binarization is a semantic segmentation process to differentiate ink pixels from background pixels. It is one of the essential steps towards character recognition, writer identification, and script-style evolution analysis. The binarization task itself is challenging due to the vast diversity of writing styles, inks, and paper materials. It is even more difficult for historical manuscripts due to the aging and degradation of the documents over time. One of such manuscripts is the Dead Sea Scrolls (DSS) image collection, which poses extreme challenges for the existing binarization techniques. This article proposes a new binarization technique for the DSS images using the deep encoder-decoder networks. Although the artificial neural network proposed here is primarily designed to binarize the DSS images, it can be trained on different manuscript collections as well. Additionally, the use of transfer learning makes the network already utilizable for a wide range of handwritten documents, making it a unique multi-purpose tool for binarization. Qualitative results and several quantitative comparisons using both historical manuscripts and datasets from handwritten document image binarization competition (H-DIBCO and DIBCO) exhibit the robustness and the effectiveness of the system. The best performing network architecture proposed here is a variant of the U-Net encoder-decoders.Comment: 26 pages, 15 figures, 11 table

arXiv.org e-Print Archive

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

A comprehensive survey of handwritten document benchmarks: structure, usage and evaluation

Author
Publication venue: Springer
Publication date: 24/12/2015
Field of study

Springer - Publisher Connector

A comprehensive survey of handwritten document benchmarks: structure, usage and evaluation

Author: A Bensefia
A Fischer
A Giménez
A Schlapbach
A Shivram
A-HM R
A-L Bianne-Bernard
Ahsen Raza
AK Jain
B Verma
B Zhu
C-L Liu
Chawki Djeddi
CO Freitas
D Bertolini
D-H Wang
E Kavallieratou
E Kussul
EF Can
F H-C
F Lauer
F Zamora-Martanez
GE Hinton
GX Tan
H Bunke
H El-Abed
H El-Abed
H Liu
H Yamada
I Siddiqi
Imran Siddiqi
JJ Hull
K Seo
Khurram Khurshid
L C-L
L Jin
L Xu
L Z
M Bulacu
M Liwicki
M Nakagawa
M Nakagawa
M Shi
MA Mohamed
MN Abdi
N Serrano
NB Amara
Q-F Wang
R Saabni
Raashid Hussain
S Al-Maadeed
S Gunter
SJ Smith
T-H Su
TM Ha
U Bhattacharya
UV Marti
V Frinken
Y Al-Ohali
Y Kessentini
Y LeCun
Y Shao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref