Search CORE

204 research outputs found

A Bottom Up Procedure for Text Line Segmentation of Latin Script

Author: Jain Himanshu
Kumar Archana Praveen
Publication venue
Publication date: 09/10/2017
Field of study

In this paper we present a bottom up procedure for segmentation of text lines written or printed in the Latin script. The proposed method uses a combination of image morphology, feature extraction and Gaussian mixture model to perform this task. The experimental results show the validity of the procedure.Comment: Accepted and presented at the IEEE conference "International Conference on Advances in Computing, Communications and Informatics (ICACCI) 2017

arXiv.org e-Print Archive

Crossref

Text Line Segmentation of Historical Documents: a Survey

Author: A. Amin
A. Bozzi
A. Downton
A. Jain
A. Kolcz
Abderrazak Zahour
Bruno Taconet
C.L. Tan
C.V. Lakshmi
E. Cohen
E. Oztop
G. Seni
I.-K. Kim
K. Wong
L. Likforman-Sulem
L. Likforman-Sulem
L. Likforman-Sulem
L. O’Gorman
L.A. Fletcher
Laurence Likforman-Sulem
R. Plamondon
R.D. Lins
U. Pal
V. Shapiro
Ventadert Gusnard de de
Y. Solihin
Y.H. Tseng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/04/2007
Field of study

There is a huge amount of historical documents in libraries and in various National Archives that have not been exploited electronically. Although automatic reading of complete pages remains, in most cases, a long-term objective, tasks such as word spotting, text/image alignment, authentication and extraction of specific fields are in use today. For all these tasks, a major step is document segmentation into text lines. Because of the low quality and the complexity of these documents (background noise, artifacts due to aging, interfering lines),automatic text line segmentation remains an open research field. The objective of this paper is to present a survey of existing methods, developed during the last decade, and dedicated to documents of historical interest.Comment: 25 pages, submitted version, To appear in International Journal on Document Analysis and Recognition, On line version available at http://www.springerlink.com/content/k2813176280456k3

arXiv.org e-Print Archive

Crossref

Estimation of the Handwritten Text Skew Based on Binary Moments

Author: D. Brodić
Z. Milivojević
Publication venue: Společnost pro radioelektronické inženýrství
Publication date: 01/04/2012
Field of study

Binary moments represent one of the methods for the text skew estimation in binary images. It has been used widely for the skew identification of the printed text. However, the handwritten text consists of text objects, which are characterized with different skews. Hence, the method should be adapted for the handwritten text. This is achieved with the image splitting into separate text objects made by the bounding boxes. Obtained text objects represent the isolated binary objects. The application of the moment-based method to each binary object evaluates their local text skews. Due to the accuracy, estimated skew data can be used as an input to the algorithms for the text line segmentation

Directory of Open Access Journals

Digital library of Brno University of Technology

The application of new methods for offline recognition in printed Arabic documents

Author: Bouressace Hassina
Publication venue
Publication date: 29/05/2020
Field of study

SZTE Doktori Értekezések Repozitórium (SZTE Repository of Dissertations)

Text Line Segmentation of Handwritten Documents in Hindi and English

Author: Sunanda Dixit, Sneha, Nilotpal Utkalit, Suresh H.N.
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 30/04/2014
Field of study

International Journal on Recent and Innovation Trends in Computing and Communication

HYBRID BINARIZTION TECHNIQUE FOR HISTORICAL MANUSCRIPTS

Author: Goyal Rajan
Kaur Amandeep
Publication venue: Institute for Project Management Pvt. Ltd
Publication date: 21/08/2020
Field of study

This paper presents a new hybrid approach for the binarization and enhancement of Historical Manuscript. This paper deals with degradations which occur due to shadows, non-uniform illumination, low contrast and strain. We follow two distinct method of Binarization with a pre-processing procedure using a adaptive Wiener filter, a rough estimation of foreground regions and a background surface calculation by interpolating neighboring background intensities. Further logical anding of the calculated background surface with compliment of second method result, performing final thresholding and post-processing in order to improve the quality of text regions. After extensive experiments, our method demonstrated superior performance against some wellknown techniques on numerous degraded document images as well as on Historical Manuscript in both manners qualitatively and quantitatively

Interscience Research Network