Online-KHATT: Online-KFUPM Handwritten Arabic TexT Database
Online-KHATT (Online-KFUPM Handwritten Arabic TexT) database is a database of unconstrained handwritten Arabic online text written by 623 different writers.
Database Overview:
- It consists of 10,040 lines of Arabic text written by 623 writers using Android- and Windows-based devices.
- Writers are from different countries, gender, age groups, handedness and education level.
- Natural writings with unrestricted writing styles.
- Part of the collected data is segmented into characters and it is available along with their ground truths.
- Written line are supplied with manually verified ground-truths.
- The database divided into three disjoint sets viz. training (70%), validation (15%), and testing (15%).
- It can be used for Arabic online text recognition, writer identification and verification, pre-processing and segmentation, etc.
- The developed tools for collecting the data (for devices with electronic pen), verification and correction of ground truths, transliteration, and semi-automated segmentation of characters are also available for researchers.
- The database is available free of charge (for academic and research purposes) to the researchers.
Online-KHATT line and character Samples
Figure 1 Samples of Online-KHATT text lines
|
|
Figure 2 Samples of Online-KHATT text line segmented into characters
|
For further information about the database go through:
- Sabri A. Mahmoud , Hamzah Luqman, Baligh M. Al-Helali, Galal BinMakhashen and Mohammad Tanvir Parvez."Online-KHATT: An Open-Vocabulary Database for Arabic Online- text processing", The Open Cybernetics & Systemics Journal, 2018.
- The authors would like to acknowledge the support provided
by King Abdul-Aziz City for Science and Technology (KACST)
through the Science & Technology Unit at King Fahd University
of Petroleum & Minerals (KFUPM) for funding this work through
project no. 11-INF2153-4 as part of the National Science, Technology
and Innovation Plan.