DocumentCode
2321995
Title
A first approach to the evaluation of arabic diacritization systems
Author
Bahanshal, Alia O. ; Al-Khalifa, Hend S.
Author_Institution
Comput. Res. Inst., King Abdulaziz City for Sci. & Technol., Riyadh, Saudi Arabia
fYear
2012
fDate
22-24 Aug. 2012
Firstpage
155
Lastpage
158
Abstract
Modern Standard Arabic (MSA) is widely used nowadays in Newspapers, books and the World Wide Web with rare use of diacritics. Diacritics, which are symbols placed above or below a letter, change the sound of letters and are used in aiding readers to understand and disambiguate written text. In order to permit automatic processing of Arabic text, many diacritization systems were introduced. In this paper, we evaluate the accuracy of some available diacritization systems using fully diacritized text from the Holy Quran and short poems from the period of the advent of Islam. We also discuss the results of the evaluation.
Keywords
natural language processing; text analysis; Arabic diacritization systems; Arabic text; Holy Quran; Islam; MSA; World Wide Web; automatic processing; books; diacritics; modern standard Arabic; newspapers; short poems; written text disambiguation; Accuracy; Cities and towns; Computers; Educational institutions; Standards; Syntactics; Text processing; Arabic Text Processing; Automatic Diacritizatio;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Information Management (ICDIM), 2012 Seventh International Conference on
Conference_Location
Macau
ISSN
pending
Print_ISBN
978-1-4673-2428-1
Type
conf
DOI
10.1109/ICDIM.2012.6360097
Filename
6360097
Link To Document