Title :
Google´s cross-dialect Arabic voice search
Author :
Biadsy, Fadi ; Moreno, Pedro J. ; Jansche, Martin
Author_Institution :
Google Inc., New York, NY, USA
Abstract :
We present a large scale effort to build a commercial Automatic Speech Recognition (ASR) product for Arabic. Our goal is to support voice search, dictation, and voice control for the general Arabic-speaking public, including support for multiple Arabic dialects. We describe our ASR system design and compare recognizers for five Arabic dialects, with the potential to reach more than 125 million people in Egypt, Jordan, Lebanon, Saudi Arabia, and the United Arab Emirates (UAE). We compare systems built on diacritized vs. non-diacritized text. We also conduct cross-dialect experiments, where we train on one dialect and test on the others. Our average word error rate (WER) is 24.8% for voice search.
Keywords :
error statistics; natural languages; search engines; speech recognition; Arabic dialects; Google crossdialect Arabic voice search; automatic speech recognition; dictation; nondiacritized text; voice control; word error rate; Acoustics; Speech; Speech recognition; System analysis and design; Testing; Training; Arabic; Speech Recognition; Voice Search;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288905