DocumentCode
2031090
Title
Automatic grapheme-to-phoneme conversion of Arabic text
Author
Al-Daradkah, Belal ; Al-Diri, Bashir
Author_Institution
Sch. of Comput. Sci., Univ. of Lincoln, Lincoln, UK
fYear
2015
fDate
28-30 July 2015
Firstpage
468
Lastpage
473
Abstract
An automated computerized system to convert Arabic graphemes to phonemes (G2P) by using Arabic language phonology rules supported by a dictionary of exceptional words. The system was tested on a publicly dataset that contains 620 fully diacritics Arabic sentences formed from 3440 words and consists of 27030 graphemes which were manually segmented. the results were very promising on different levels of validation: sentences, word and phonemes. The system showed a high rate of accuracy and the precision of the system is 99.19%; with sensitivity rate of 99.42%. All Arabic rules were applied and tested. The developed system can be applied to any diacritic Arabic text.
Keywords
natural language processing; speech synthesis; text analysis; Arabic language phonology rule; Arabic text; G2P; automated computerized system; grapheme-to-phoneme conversion; word dictionary; Accuracy; Dictionaries; Manuals; Speech; Speech processing; Speech recognition; Standards; Arabic; graphemes; phonemes; phonology; rules;
fLanguage
English
Publisher
ieee
Conference_Titel
Science and Information Conference (SAI), 2015
Conference_Location
London
Type
conf
DOI
10.1109/SAI.2015.7237184
Filename
7237184
Link To Document