Title :
Recognition of multilingual speech in mobile applications
Author :
Lin, Hui ; Huang, Jui-ting ; Beaufays, Françoise ; Strope, Brian ; Sung, Yun-hsuan
Abstract :
We evaluate different architectures to recognize multilingual speech for real-time mobile applications. In particular, we show that combining the results of several recognizers greatly outperforms other solutions such as training a single large multilingual system or using an explicit language identification system to select the appropriate recognizer. Experiments are conducted on a trilingual English-French-Mandarin mobile speech task. The data set includes Google searches, Maps queries, as well as more general inputs such as email and short message dictation. Without pre-specifying the input language, the combined system achieves comparable accuracy to that of the monolingual systems when the input language is known. The combined system is also roughly 5% absolute better than an explicit language identification approach, and 10% better than a single large multilingual system.
Keywords :
data analysis; mobile communication; speech recognition; Google searches; data set; email; explicit language identification system; general inputs; maps queries; monolingual systems; multilingual speech recognition; realtime mobile applications; short message dictation; trilingual English-French-Mandarin mobile speech task; Accuracy; Hidden Markov models; Mobile communication; Speech; Speech recognition; Support vector machines; Training; Multilingual speech recognition; acoustic modeling;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6289013