Automatic name extraction from degraded document images

Research output: Contribution to journalArticlepeer-review

Abstract

The problem addressed in this paper is the automatic extraction of names from a document image. Our approach relies on the combination of two complementary analyses. First, the image-based analysis exploits visual clues to select the regions of interest in the document. Second, the textual-based analysis searches for name patterns and low-level word textual features. Both analyses are then combined at the word level through a neural network fusion scheme. Reported results on degraded documents such as facsimile and photocopied technical journals demonstrate the interest of the combined approach.

Original languageEnglish
Pages (from-to)211-227
Number of pages17
JournalPattern Analysis and Applications
Volume9
Issue number2-3
DOIs
Publication statusPublished - 1 Oct 2006
Externally publishedYes

Keywords

  • Document image analysis
  • Document understanding
  • Facsimile processing
  • Journal title pages
  • Name extraction
  • Neural Networks
  • Visual clues

Fingerprint

Dive into the research topics of 'Automatic name extraction from degraded document images'. Together they form a unique fingerprint.

Cite this