Title :
On-line learning of language models with word error probability distributions
Author :
Gretter, Roberto ; Riccardi, Giuseppe
Author_Institution :
Trento Univ., Italy
Abstract :
We are interested in the problem of learning stochastic language models on-line (without speech transcriptions) for adaptive speech recognition and understanding. We propose an algorithm to adapt to variations in the language model distributions based on speech input only and without its true transcription. The on-line probability estimate is defined. as a function of the prior and word error distributions. We show the effectiveness of word-lattice based error probability distributions in terms of receiver operating characteristics (ROC) curves and word accuracy. We apply the new estimates Padapt (w) to the task of adapting on-line an initial large vocabulary trigram language model and show improvement in word accuracy with respect to the baseline speech recognizer
Keywords :
error statistics; natural languages; probability; speech recognition; unsupervised learning; ROC curves; adaptive speech recognition; baseline speech recognizer; large vocabulary trigram language model; online learning; receiver operating characteristics curves; speech understanding; stochastic language models; word accuracy; word error probability distributions; Accuracy; Error analysis; Error probability; Lattices; Natural languages; Probability distribution; Speech recognition; Stochastic processes; Topology; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940892