Title :
Impact of Web based language modeling on speech understanding
Author :
Sarikaya, Ruhi ; Kuo, Hong-Kwang Jeff ; Gao, Yuqing
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY
Abstract :
Data sparseness in building statistical language models for spoken dialog systems is a critical problem. In a previous paper we addressed this issue by exploiting the World Wide Web (WWW) and other external data sources in a financial transaction domain. In this paper, we evaluate the impact of improved speech recognition due to Web-based language model (WebLM) on the speech understanding performance in a new domain. As speech understanding system we use a natural language call-routing system. Experimental results show that the WebLM improves the speech recognition performance by 1.7% to 2.7% across varying amounts of in-domain data. The improvements in action classification (AC) performance were modest yet consistent ranging from 0.3% to 0.8%
Keywords :
Internet; interactive systems; natural languages; speech recognition; speech synthesis; Web based language modeling; World Wide Web; data sparseness; financial transaction domain; natural language call-routing system; speech recognition; speech understanding; spoken dialog systems; statistical language models; Acoustic applications; Acoustic testing; Information retrieval; Natural languages; Robustness; Speech analysis; Speech recognition; System performance; Web sites; World Wide Web;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
Conference_Location :
San Juan
Print_ISBN :
0-7803-9478-X
Electronic_ISBN :
0-7803-9479-8
DOI :
10.1109/ASRU.2005.1566477