Passer à la navigation principale Passer à la recherche Passer au contenu principal

BLSTM-based handwritten text recognition using Web resources

  • Institut Mines-Télécom
  • University of Balamand
  • CEA/UVSQ/CNRS

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Handwriting recognition systems usually rely on static dictionaries and language models. Full coverage of these dictionaries is generally not achieved when dealing with unrestricted document corpora due to the presence of Out-Of-Vocabulary words. In a previous work, dynamic dictionaries were built from Web resources and successfully applied to isolated word recognition. In the present work we extend this approach to text-line recognition. Line segmentation into words is needed to exploit dynamic dictionaries and it is performed using BLSTM classifiers to align filler models and word sequence outputs. Words are then classified based on the confidence score into anchor and non-anchor words (AWs and NAWs). AWs are equated to the BLSTM outputs and used as such. Dynamic dictionaries are built for NAWs by exploiting Web resources for their character sequence and for neighboring AWs. Text-lines are decoded again using dynamic dictionaries and re-estimated language model. We conduct experiments on the publicly available RIMES database and show that the introduction of the dynamic dictionary is beneficial. Equally important, we show that the gain increases as the proportion of OOVs increases.

langue originaleAnglais
titre13th IAPR International Conference on Document Analysis and Recognition, ICDAR 2015 - Conference Proceedings
EditeurIEEE Computer Society
Pages466-470
Nombre de pages5
ISBN (Electronique)9781479918058
Les DOIs
étatPublié - 20 nov. 2015
Modification externeOui
Evénement13th International Conference on Document Analysis and Recognition, ICDAR 2015 - Nancy, France
Durée: 23 août 201526 août 2015

Série de publications

NomProceedings of the International Conference on Document Analysis and Recognition, ICDAR
Volume2015-November
ISSN (imprimé)1520-5363

Une conférence

Une conférence13th International Conference on Document Analysis and Recognition, ICDAR 2015
Pays/TerritoireFrance
La villeNancy
période23/08/1526/08/15

Empreinte digitale

Examiner les sujets de recherche de « BLSTM-based handwritten text recognition using Web resources ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation