Pre-processing of degraded printed documents by non-local means and total variation

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We compare in this study two image restoration ap-proaches for the pre-processing of printed documents: namely the Non-local Means filter and a total variation minimization approach. We apply these two approaches to printed document sets from various periods, and we evaluate their effectiveness through character recognition performance using an open source OCR. Our results show that for each document set, one or both pre-processing methods improve character recognition accuracy over recognition without preprocessing. Higher accuracies are obtained with Non-local Means when characters have a low level of degradation since they can be restored by similar neighboring parts of non-degraded characters. The Total Variation approach is more effective when characters are highly degraded and can only be restored through modeling instead of using neighboring data

Original languageEnglish
Title of host publicationICDAR2009 - 10th International Conference on Document Analysis and Recognition
Pages758-762
Number of pages5
DOIs
Publication statusPublished - 10 Dec 2009
EventICDAR2009 - 10th International Conference on Document Analysis and Recognition - Barcelona, Spain
Duration: 26 Jul 200929 Jul 2009

Publication series

NameProceedings of the International Conference on Document Analysis and Recognition, ICDAR
ISSN (Print)1520-5363

Conference

ConferenceICDAR2009 - 10th International Conference on Document Analysis and Recognition
Country/TerritorySpain
CityBarcelona
Period26/07/0929/07/09

Fingerprint

Dive into the research topics of 'Pre-processing of degraded printed documents by non-local means and total variation'. Together they form a unique fingerprint.

Cite this