• DocumentCode
    2031090
  • Title

    Automatic grapheme-to-phoneme conversion of Arabic text

  • Author

    Al-Daradkah, Belal ; Al-Diri, Bashir

  • Author_Institution
    Sch. of Comput. Sci., Univ. of Lincoln, Lincoln, UK
  • fYear
    2015
  • fDate
    28-30 July 2015
  • Firstpage
    468
  • Lastpage
    473
  • Abstract
    An automated computerized system to convert Arabic graphemes to phonemes (G2P) by using Arabic language phonology rules supported by a dictionary of exceptional words. The system was tested on a publicly dataset that contains 620 fully diacritics Arabic sentences formed from 3440 words and consists of 27030 graphemes which were manually segmented. the results were very promising on different levels of validation: sentences, word and phonemes. The system showed a high rate of accuracy and the precision of the system is 99.19%; with sensitivity rate of 99.42%. All Arabic rules were applied and tested. The developed system can be applied to any diacritic Arabic text.
  • Keywords
    natural language processing; speech synthesis; text analysis; Arabic language phonology rule; Arabic text; G2P; automated computerized system; grapheme-to-phoneme conversion; word dictionary; Accuracy; Dictionaries; Manuals; Speech; Speech processing; Speech recognition; Standards; Arabic; graphemes; phonemes; phonology; rules;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Science and Information Conference (SAI), 2015
  • Conference_Location
    London
  • Type

    conf

  • DOI
    10.1109/SAI.2015.7237184
  • Filename
    7237184