A new algorithm for skew correction and baseline detection based on the randomized Hough Transform

Abstract

The proposed technique is based on the detection of the lower baselines of the text lines of Arabic documents. As the lower baseline pixels belong to the lower edge of the word images, we first locate vertically the black–white transitions at the black pixels where the resulting image would emphasize the baselines of the text. Once the skew angle is determined using a randomized Hough transform, the baselines are extracted using y-intercept histogram. This algorithm can also contribute significantly for text line extraction from skewed document images for many languages

Similar works

Full text

thumbnail-image

Directory of Open Access Journals

redirect
Last time updated on 14/10/2017

This paper was published in Directory of Open Access Journals.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.