Title :
Deriving phrase-based language models
Author :
Heeman, Peter A. ; Damnati, Géraldine
Author_Institution :
Center for Spoken Language Understanding, Oregon Graduate Inst., Portland, OR, USA
Abstract :
Phrase-based language models have grown in popularity since they allow the speech recognition process to make use of more context in recognizing the words. Previous approaches have used perplexity reduction to identify groups of words to be linked into phrases and have used these phrases as the basis for computing the language model probabilities. In this paper, we argue that perplexity reduction is only one of three aspects to be considered in choosing the phrases. We also argue that the chosen phrases should not be the basis for computing the language model probabilities. Rather, the probabilities should be derived from a language model built at the lexical level
Keywords :
natural languages; probability; speech recognition; context-based word recognition; language model probabilities; lexical level; perplexity reduction; phrase-based language models; speech recognition process; word group identification; Acoustic measurements; Context modeling; Natural languages; Predictive models; Speech recognition; State-space methods; Vocabulary;
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
DOI :
10.1109/ASRU.1997.658975