DocumentCode
353700
Title
A unified context-free grammar and n-gram model for spoken language processing
Author
Wang, Ye-Yi ; Mahajan, Milind ; Huang, Xuedong
Author_Institution
Speech Technol. Group, Microsoft Corp., Redmond, WA, USA
Volume
3
fYear
2000
fDate
2000
Firstpage
1639
Abstract
While context-free grammars (CFGs) remain as one of the most important formalisms for interpreting natural language, word n-gram models are surprisingly powerful for domain-independent applications. We propose to unify these two formalisms for both speech recognition and spoken language understanding (SLU). With portability as the major problem, we incorporated domain-specific CFGs into a domain-independent n-gram model that can improve the generalizability of the CFG and the specificity of the n-gram. In our experiments, the unified model can significantly reduce the test set perplexity from 378 to 90 in comparison with a domain-independent word trigram. The unified model converges well when domain-specific data becomes available. The perplexity can be further reduced from 90 to 65 with a limited amount of domain-specific data. While we have demonstrated excellent portability, the full potential of our approach lies in its unified recognition and understanding that we are investigating
Keywords
context-free grammars; natural languages; nomograms; speech processing; speech recognition; convergence; domain-independent applications; domain-independent n-gram model; domain-independent word trigram; domain-specific context-free grammars; generalizability; natural language interpretation; portability; specificity; speech recognition; spoken language processing; spoken language understanding; test set perplexity; unified model; word n-gram models; Context modeling; Decoding; Equations; Natural languages; Predictive models; Signal generators; Speech processing; Speech recognition; Testing; Unified modeling language;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.862062
Filename
862062
Link To Document