Using the Web for the handwritten out-of-vocabulary word recognition

Cristina Oprean, Laurence Likforman-Sulem, Adrian Popescu, Chafic Mokbel

Research output: Contribution to conferencePaperpeer-review

Abstract

Handwriting recognition systems rely on predefined classifiers. Small and static dictionaries are usually exploited to obtain high in-vocabulary (IV) accuracy at the expense of coverage. Thus the recognition of out-of-vocabulary (OOV) words cannot be handled efficiently. To improve OOV recognition while keeping IV dictionaries small, we introduce a multi-step approach that exploits Web resources. After an initial IV-OOV classification, external resources are used to create OOV sequence-adapted dynamic dictionaries. A final CTC-based decoding is performed over the dynamic dictionary to determine the most probable word for the OOV sequence. We validate our approach with experiments conducted on the RIMES dataset. Results show that improvements are obtained compared to standard handwriting recognition.

Translated title of the contributionUtilisation duWeb pour la reconnaissance de mots manuscrits hors-vocabulaire
Original languageEnglish
Pages217-232
Number of pages16
Publication statusPublished - 1 Jan 2014
Externally publishedYes
EventCORIA 2014 - Conference en Recherche d'Infomations et Applications - 11th French Information Retrieval Conference. CIFED 2014 Colloque International Francophone sur l'Ecrit et le Document - CORIA 2014 - Conference on Information Retrieval and Applications, 11th French Information Retrieval Conference. CIFED 2014 - International French Speaking Symposium on Writing and the Document - Nancy, France
Duration: 19 Mar 201423 Mar 2014

Conference

ConferenceCORIA 2014 - Conference en Recherche d'Infomations et Applications - 11th French Information Retrieval Conference. CIFED 2014 Colloque International Francophone sur l'Ecrit et le Document - CORIA 2014 - Conference on Information Retrieval and Applications, 11th French Information Retrieval Conference. CIFED 2014 - International French Speaking Symposium on Writing and the Document
Country/TerritoryFrance
CityNancy
Period19/03/1423/03/14

Fingerprint

Dive into the research topics of 'Using the Web for the handwritten out-of-vocabulary word recognition'. Together they form a unique fingerprint.

Cite this