Article thumbnail

2009 10th International Conference on Document Analysis and Recognition Pre-processing of degraded printed documents by Non-local Means and Total Variation

By Laurence Likforman-sulem, Telecom Paristech, Jérôme Darbon and Elisa H. Barney Smith

Abstract

We compare in this study two image restoration approaches for the pre-processing of printed documents: namely the Non-local Means filter and a total variation minimization approach. We apply these two approaches to printed document sets from various periods, and we evaluate their effectiveness through character recognition performance using an open source OCR. Our results show that for each document set, one or both pre-processing methods improve character recognition accuracy over recognition without preprocessing. Higher accuracies are obtained with Non-local Means when characters have a low level of degradation since they can be restored by similar neighboring parts of nondegraded characters. The Total Variation approach is more effective when characters are highly degraded and can only be restored through modeling instead of using neighboring data.

Year: 2012
OAI identifier: oai:CiteSeerX.psu:10.1.1.212.3133
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://www.cvc.uab.es/icdar200... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.