DocumentCode
3163717
Title
Google´s cross-dialect Arabic voice search
Author
Biadsy, Fadi ; Moreno, Pedro J. ; Jansche, Martin
Author_Institution
Google Inc., New York, NY, USA
fYear
2012
fDate
25-30 March 2012
Firstpage
4441
Lastpage
4444
Abstract
We present a large scale effort to build a commercial Automatic Speech Recognition (ASR) product for Arabic. Our goal is to support voice search, dictation, and voice control for the general Arabic-speaking public, including support for multiple Arabic dialects. We describe our ASR system design and compare recognizers for five Arabic dialects, with the potential to reach more than 125 million people in Egypt, Jordan, Lebanon, Saudi Arabia, and the United Arab Emirates (UAE). We compare systems built on diacritized vs. non-diacritized text. We also conduct cross-dialect experiments, where we train on one dialect and test on the others. Our average word error rate (WER) is 24.8% for voice search.
Keywords
error statistics; natural languages; search engines; speech recognition; Arabic dialects; Google crossdialect Arabic voice search; automatic speech recognition; dictation; nondiacritized text; voice control; word error rate; Acoustics; Speech; Speech recognition; System analysis and design; Testing; Training; Arabic; Speech Recognition; Voice Search;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288905
Filename
6288905
Link To Document