Title :
Rapid phonetic transcription using everyday life natural Chat Alphabet orthography for dialectal Arabic speech recognition
Author :
Elmahdy, Mohamed ; Gruhn, Rainer ; Abdennadher, Slim ; Minker, Wolfgang
Author_Institution :
Fac. of Eng. & Comput. Sci., Univ. of Ulm, Ulm, Germany
Abstract :
We propose the Arabic Chat Alphabet (ACA) as naturally written in everyday life for dialectal Arabic speech transcription. Our assumption is that ACA is a natural language that includes short vowels that are missing in traditional Arabic orthography. Furthermore, ACA transcriptions can be rapidly prepared. Egyptian Colloquial Arabic was chosen as a typical dialect. Two speech recognition baselines were built: phonemic and graphemic. Original transcriptions were re-written in ACA by different transcribers. Ambiguous ACA sequences were handled by automatically generating all possible variants. ACA variations across transcribers were modeled by phonemes normalization and merging. Results show that the ACA-based approach outperforms the graphemic baseline while it performs as accurate as the phoneme-based baseline with a slight increase in WER.
Keywords :
speech processing; speech recognition; Arabic chat alphabet; Egyptian colloquial Arabic; WER; dialectal Arabic speech transcription; dialectal arabic speech recognition; graphemic baseline; natural chat alphabet orthography; natural language; phoneme-based baseline; rapid phonetic transcription; Acoustics; Approximation methods; Hidden Markov models; Speech; Speech recognition; Training; Writing; Arabic; acoustic modeling; chat alphabet; phonetic transcription; speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947463