DocumentCode :
3526799
Title :
Leveraging multiple query logs to improve language models for spoken query recognition
Author :
Li, Xiao ; Nguyen, Patrick ; Zweig, Geoffrey ; Bohus, Dan
Author_Institution :
Microsoft Res., Redmond, WA
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
3713
Lastpage :
3716
Abstract :
A voice search system requires a speech interface that can correctly recognize spoken queries uttered by users. The recognition performance strongly relies on a robust language model. In this work, we present the use of multiple data sources, with the focus on query logs, in improving ASR language models for a voice search application. Our contributions are three folds: (1) the use of text queries from web search and mobile search in language modeling; (2) the use of web click data to predict query forms from business listing forms; and (3) the use of voice query logs in creating a positive feedback loop. Experiments show that by leveraging these resources, we can achieve recognition performance comparable to, or even better than, that of a previously deploy system where a large amount of spoken query transcripts are used in language modeling.
Keywords :
query processing; speech recognition; ASR language models; Web search; mobile search; multiple data sources; multiple query logs; positive feedback loop; speech interface; spoken query recognition; voice search system; Automatic speech recognition; Databases; Feedback loop; Natural languages; Predictive models; Robustness; Speech recognition; Text recognition; Training data; Web search; click data; language modeling; query log; voice search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960433
Filename :
4960433
Link To Document :
بازگشت