Stroke width exploitation to improve automatic recognition of Arabic handwritten texts

Edgard Chammas, Laurence Likforman-Sulem, Chafic Mokbel

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Several inherent factors increase the complexity of automatic recognition of handwritten documents, such as the size of writing and the stroke width. In a previous work [1], we showed that a successful exploitation of the writing size improves the recognition performance. In this work we are interested in considering the stroke width as a factor in modeling, to improve the performance of automatic systems. The experiments were conducted on Arabic handwritten documents from one of the largest labeled Arabic handwriting databases, NISTOpenHaRT. The database includes large variability in the stroke width. We propose several approaches to deal with these changes in both training and recognition phases. The first experiments show that the recognition is largely affected by the stroke width. To account for this parameter, we propose to classify data into three classes according to the stroke width. In the recognition phase, we have thickened each text-line image into several versions with predefined values, then we combined the recognition scores for each value. This approach has significant performance gains for both an HMM-based and a BLSTM-based recognition systems. In addition, we integrated synthetic data to adapt HMM models at different stroke width measures. We also obtained performance gains by two different combination methods (ROVER, trellis) on the adapted models results. We provide the obtained recognition results showing the benefits of exploiting the stroke width, and compare them with a known approach for stroke width normalization.

Original languageEnglish
Title of host publication1st IEEE International Workshop on Arabic Script Analysis and Recognition, ASAR 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages74-78
Number of pages5
ISBN (Electronic)9781509066285
DOIs
Publication statusPublished - 13 Oct 2017
Externally publishedYes
Event1st IEEE International Workshop on Arabic Script Analysis and Recognition, ASAR 2017 - Nancy, France
Duration: 3 Apr 20175 Apr 2017

Publication series

Name1st IEEE International Workshop on Arabic Script Analysis and Recognition, ASAR 2017

Conference

Conference1st IEEE International Workshop on Arabic Script Analysis and Recognition, ASAR 2017
Country/TerritoryFrance
CityNancy
Period3/04/175/04/17

Keywords

  • Adaptation
  • Arabic handwriting recognition
  • OpenHaRT database
  • Stroke width
  • Synthetic data

Fingerprint

Dive into the research topics of 'Stroke width exploitation to improve automatic recognition of Arabic handwritten texts'. Together they form a unique fingerprint.

Cite this