Online-KHATT: Online-KFUPM Handwritten Arabic TexT Database

Online-KHATT (Online-KFUPM Handwritten Arabic TexT) database is a database of unconstrained handwritten Arabic online text written by 623 different writers.

Database Overview:

  • It consists of 10,040 lines of Arabic text written by 623 writers using Android- and Windows-based devices.
  • Writers are from different countries, gender, age groups, handedness and education level.
  • Natural writings with unrestricted writing styles.
  • Part of the collected data is segmented into characters and it is available along with their ground truths.
  • Written line are supplied with manually verified ground-truths.
  • The database divided into three disjoint sets viz. training (70%), validation (15%), and testing (15%).
  • It can be used for Arabic online text recognition, writer identification and verification, pre-processing and segmentation, etc.
  • The developed tools for collecting the data (for devices with electronic pen), verification and correction of ground truths, transliteration, and semi-automated segmentation of characters are also available for researchers.
  • The database is available free of charge (for academic and research purposes) to the researchers.

Online-KHATT line and character Samples

Figure 1 Samples of Online-KHATT text lines

Figure 2 Samples of Online-KHATT text line segmented into characters

For further information about the database go through:

  • The authors would like to acknowledge the support provided by King Abdul-Aziz City for Science and Technology (KACST) through the Science & Technology Unit at King Fahd University of Petroleum & Minerals (KFUPM) for funding this work through project no. 11-INF2153-4 as part of the National Science, Technology and Innovation Plan.

Copyright 2018