Title :
Incorporating semantic information to selection of web texts for language model of spoken dialogue system
Author :
Yoshino, Kohzoh ; Mori, Shinsuke ; Kawahara, Toshio
Author_Institution :
Sch. of Inf., Kyoto Univ., Kyoto, Japan
Abstract :
A novel text selection approach for training a language model (LM) with Web texts is proposed for automatic speech recognition (ASR) of spoken dialogue systems. Compared to the conventional approach based on perplexity criterion, the proposed approach introduces a semantic-level relevance measure with the back-end knowledge base used in the dialogue system. We focus on the predicate-argument (P-A) structure characteristic to the domain in order to filter semantically relevant sentences in the domain. Several choices of statistical models and combination methods with the perplexity measure are investigated in this paper. Experimental evaluations in two different domains demonstrate the effectiveness and generality of the proposed approach. The combination method realizes significant improvement not only in ASR accuracy but also in semantic and dialogue-level accuracy.
Keywords :
Internet; interactive systems; knowledge based systems; speech recognition; speech-based user interfaces; statistical analysis; text analysis; ASR accuracy; LM; P-A structure characteristic; Web text selection; automatic speech recognition; back-end knowledge base; combination methods; dialogue-level accuracy; language model; perplexity criterion; perplexity measure; predicate-argument structure characteristic; semantic accuracy; semantic information; semantic-level relevance measure; semantically relevant sentences; spoken dialogue system; statistical models; Accuracy; Adaptation models; Computational modeling; Semantics; Sports equipment; Training; Web sites; Language model; Web; predicate-argument structure; spoken dialogue system;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639274