DocumentCode
2176685
Title
Rapid phonetic transcription using everyday life natural Chat Alphabet orthography for dialectal Arabic speech recognition
Author
Elmahdy, Mohamed ; Gruhn, Rainer ; Abdennadher, Slim ; Minker, Wolfgang
Author_Institution
Fac. of Eng. & Comput. Sci., Univ. of Ulm, Ulm, Germany
fYear
2011
fDate
22-27 May 2011
Firstpage
4936
Lastpage
4939
Abstract
We propose the Arabic Chat Alphabet (ACA) as naturally written in everyday life for dialectal Arabic speech transcription. Our assumption is that ACA is a natural language that includes short vowels that are missing in traditional Arabic orthography. Furthermore, ACA transcriptions can be rapidly prepared. Egyptian Colloquial Arabic was chosen as a typical dialect. Two speech recognition baselines were built: phonemic and graphemic. Original transcriptions were re-written in ACA by different transcribers. Ambiguous ACA sequences were handled by automatically generating all possible variants. ACA variations across transcribers were modeled by phonemes normalization and merging. Results show that the ACA-based approach outperforms the graphemic baseline while it performs as accurate as the phoneme-based baseline with a slight increase in WER.
Keywords
speech processing; speech recognition; Arabic chat alphabet; Egyptian colloquial Arabic; WER; dialectal Arabic speech transcription; dialectal arabic speech recognition; graphemic baseline; natural chat alphabet orthography; natural language; phoneme-based baseline; rapid phonetic transcription; Acoustics; Approximation methods; Hidden Markov models; Speech; Speech recognition; Training; Writing; Arabic; acoustic modeling; chat alphabet; phonetic transcription; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947463
Filename
5947463
Link To Document