Passer à la navigation principale Passer à la recherche Passer au contenu principal

Reconnaissance de mots manuscrits horsvocabulaire en utilisant des ressources web

  • CNRS LTCI
  • University of Balamand
  • CEA/UVSQ/CNRS

Résultats de recherche: Contribution à un journalArticleRevue par des pairs

Résumé

Handwriting recognition systems rely on predefined dictionaries. Small and static dictionaries are often exploited to obtain high in-vocabulary (IV) accuracy at the expense of coverage. Thus the recognition of out-of-vocabulary (OOV) words is not handled efficiently. To improve OOV recognition while keeping IV dictionaries small, we introduce a multi-step approach that exploits web resources. After an IV-OOV classification, Wikipedia is used to create OOV sequence-adapted dynamic dictionaries. A second decoding is done the dynamic dictionary to determine the most probable word for the OOV sequence. We validate our approach with experiments conducted on the RIMES dataset using a BLSTM recognizer. Results show that improvements are obtained compared to handwriting recognition with static dictionary.

langue originaleFrançais
Pages (de - à)77-96
Nombre de pages20
journalDocument Numerique
Volume17
Numéro de publication3
Les DOIs
étatPublié - 1 janv. 2014
Modification externeOui

mots-clés

  • Adapted dynamic dictionaries
  • BLSTM
  • Handwriting recognition
  • Wikipedia

Contient cette citation