Search CORE

719 research outputs found

Component-based Segmentation of words from handwritten Arabic text

Author: AlKhateeb J. H.
Ipson S.
Jiang J.
Ren Jinchang
Publication venue
Publication date: 28/05/2008
Field of study

Efficient preprocessing is very essential for automatic recognition of handwritten documents. In this paper, techniques on segmenting words in handwritten Arabic text are presented. Firstly, connected components (ccs) are extracted, and distances among different components are analyzed. The statistical distribution of this distance is then obtained to determine an optimal threshold for words segmentation. Meanwhile, an improved projection based method is also employed for baseline detection. The proposed method has been successfully tested on IFN/ENIT database consisting of 26459 Arabic words handwritten by 411 different writers, and the results were promising and very encouraging in more accurate detection of the baseline and segmentation of words for further recognition

University of Strathclyde Institutional Repository

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Handwritten Character Recognition of South Indian Scripts: A Review

Author: Jomy John
Kannan Balakrishnan
Pramod K. V.
Publication venue
Publication date: 01/06/2011
Field of study

Handwritten character recognition is always a frontier area of research in the field of pattern recognition and image processing and there is a large demand for OCR on hand written documents. Even though, sufficient studies have performed in foreign scripts like Chinese, Japanese and Arabic characters, only a very few work can be traced for handwritten character recognition of Indian scripts especially for the South Indian scripts. This paper provides an overview of offline handwritten character recognition in South Indian Scripts, namely Malayalam, Tamil, Kannada and Telungu.Comment: Paper presented on the "National Conference on Indian Language Computing", Kochi, February 19-20, 2011. 6 pages, 5 figure

arXiv.org e-Print Archive