• DocumentCode
    2176685
  • Title

    Rapid phonetic transcription using everyday life natural Chat Alphabet orthography for dialectal Arabic speech recognition

  • Author

    Elmahdy, Mohamed ; Gruhn, Rainer ; Abdennadher, Slim ; Minker, Wolfgang

  • Author_Institution
    Fac. of Eng. & Comput. Sci., Univ. of Ulm, Ulm, Germany
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4936
  • Lastpage
    4939
  • Abstract
    We propose the Arabic Chat Alphabet (ACA) as naturally written in everyday life for dialectal Arabic speech transcription. Our assumption is that ACA is a natural language that includes short vowels that are missing in traditional Arabic orthography. Furthermore, ACA transcriptions can be rapidly prepared. Egyptian Colloquial Arabic was chosen as a typical dialect. Two speech recognition baselines were built: phonemic and graphemic. Original transcriptions were re-written in ACA by different transcribers. Ambiguous ACA sequences were handled by automatically generating all possible variants. ACA variations across transcribers were modeled by phonemes normalization and merging. Results show that the ACA-based approach outperforms the graphemic baseline while it performs as accurate as the phoneme-based baseline with a slight increase in WER.
  • Keywords
    speech processing; speech recognition; Arabic chat alphabet; Egyptian colloquial Arabic; WER; dialectal Arabic speech transcription; dialectal arabic speech recognition; graphemic baseline; natural chat alphabet orthography; natural language; phoneme-based baseline; rapid phonetic transcription; Acoustics; Approximation methods; Hidden Markov models; Speech; Speech recognition; Training; Writing; Arabic; acoustic modeling; chat alphabet; phonetic transcription; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947463
  • Filename
    5947463