Passer à la navigation principale Passer à la recherche Passer au contenu principal

Using the web to create dynamic dictionaries in handwritten out-of-vocabulary word recognition

  • CNRS LTCI
  • CEA/UVSQ/CNRS
  • University of Balamand

Résultats de recherche: Contribution à un journalArticle de conférenceRevue par des pairs

Résumé

Handwriting recognition systems rely on predefined dictionaries obtained from training data. Small and static dictionaries are usually exploited to obtain high in-vocabulary (IV) accuracy at the expense of coverage. Thus the recognition of out-of-vocabulary (OOV) words cannot be handled efficiently. To improve OOV recognition while keeping IV dictionaries small, we introduce a multi-step approach that exploits Web resources. After an initial IV-OOV sequence classification, external resources are used to create OOV sequence-adapted dynamic dictionaries. A final Viterbi-based decoding is performed over the dynamic dictionary to determine the most probable word for the OOV sequence. We validate our approach with experiments conducted on RIMES, a publicly available database. Results show that improvements are obtained compared to standard handwriting recognition, performed with a static dictionary. Both domain adapted and generic dynamic dictionaries are studied and we show that domain adaptation is beneficial.

langue originaleAnglais
Numéro d'article6628764
Pages (de - à)989-993
Nombre de pages5
journalProceedings of the International Conference on Document Analysis and Recognition, ICDAR
Les DOIs
étatPublié - 11 déc. 2013
Modification externeOui
Evénement12th International Conference on Document Analysis and Recognition, ICDAR 2013 - Washington, DC, États-Unis
Durée: 25 août 201328 août 2013

Empreinte digitale

Examiner les sujets de recherche de « Using the web to create dynamic dictionaries in handwritten out-of-vocabulary word recognition ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation