DocumentCode
591982
Title
A Database for Arabic Handwritten Text Image Recognition and Writer Identification
Author
Mezghani, Amine ; Kanoun, Slim ; Khemakhem, Mahdi ; Abed, H.E.
Author_Institution
MIRACL Lab., Univ. of Sfax, Sfax, Tunisia
fYear
2012
fDate
18-20 Sept. 2012
Firstpage
399
Lastpage
402
Abstract
Standard databases play essential roles for evaluating and comparing results obtained by different groups of researchers. In this paper, an Arabic Handwritten Text Images Database written by Multiple Writers (AHTID/MW) is introduced. This database can be used for research in the recognition of Arabic handwritten text with open vocabulary, word segmentation and writer identification. The AHTID/MW contains 3710 text lines and 22896 words written by 53 native writers of Arabic. In addition, ground truth annotation is provided for each text image. The database is freely available for worldwide researchers.
Keywords
document image processing; handwritten character recognition; natural language processing; text analysis; visual databases; vocabulary; AHTID/MW; Arabic handwritten text image database; Arabic handwritten text image recognition; ground truth annotation; multiple writers; open vocabulary; text line; word segmentation; writer identification; Educational institutions; Handwriting recognition; Image databases; Shape; Text recognition; Vocabulary; AHTID/MW Database; Arabic Handwritten text image; Ground truth; open vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
Conference_Location
Bari
Print_ISBN
978-1-4673-2262-1
Type
conf
DOI
10.1109/ICFHR.2012.155
Filename
6424426
Link To Document