Passer à la navigation principale Passer à la recherche Passer au contenu principal

Stroke width exploitation to improve automatic recognition of Arabic handwritten texts

  • Université Paris-Saclay
  • University of Balamand

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Several inherent factors increase the complexity of automatic recognition of handwritten documents, such as the size of writing and the stroke width. In a previous work [1], we showed that a successful exploitation of the writing size improves the recognition performance. In this work we are interested in considering the stroke width as a factor in modeling, to improve the performance of automatic systems. The experiments were conducted on Arabic handwritten documents from one of the largest labeled Arabic handwriting databases, NISTOpenHaRT. The database includes large variability in the stroke width. We propose several approaches to deal with these changes in both training and recognition phases. The first experiments show that the recognition is largely affected by the stroke width. To account for this parameter, we propose to classify data into three classes according to the stroke width. In the recognition phase, we have thickened each text-line image into several versions with predefined values, then we combined the recognition scores for each value. This approach has significant performance gains for both an HMM-based and a BLSTM-based recognition systems. In addition, we integrated synthetic data to adapt HMM models at different stroke width measures. We also obtained performance gains by two different combination methods (ROVER, trellis) on the adapted models results. We provide the obtained recognition results showing the benefits of exploiting the stroke width, and compare them with a known approach for stroke width normalization.

langue originaleAnglais
titre1st IEEE International Workshop on Arabic Script Analysis and Recognition, ASAR 2017
EditeurInstitute of Electrical and Electronics Engineers Inc.
Pages74-78
Nombre de pages5
ISBN (Electronique)9781509066285
Les DOIs
étatPublié - 13 oct. 2017
Modification externeOui
Evénement1st IEEE International Workshop on Arabic Script Analysis and Recognition, ASAR 2017 - Nancy, France
Durée: 3 avr. 20175 avr. 2017

Série de publications

Nom1st IEEE International Workshop on Arabic Script Analysis and Recognition, ASAR 2017

Une conférence

Une conférence1st IEEE International Workshop on Arabic Script Analysis and Recognition, ASAR 2017
Pays/TerritoireFrance
La villeNancy
période3/04/175/04/17

Empreinte digitale

Examiner les sujets de recherche de « Stroke width exploitation to improve automatic recognition of Arabic handwritten texts ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation