Title :
Task adaptation in stochastic language models for continuous speech recognition
Author :
Matsunaga, Shoichi ; Yamada, Tomokazu ; Shikano, Kiyohiro
Author_Institution :
NTT Human Interface Labs., Tokyo, Japan
Abstract :
The authors describe two approaches for adapting a specific syllable trigram model to a new task. One uses a small amount of text data similar to the target task, and the other uses supervised learning using the most recent input phrases. The effect of each adaptation is verified with syllable perplexity and phrase recognition. Where the syntactic knowledge was only the syllable trigram model, the perplexity was reduced from 54.5 to 18.1 for the adaptation using 100 phrases of similar text, and was reduced to 14.6 by the supervised learning. The recognition rates were also improved from 42.3% to 46.6% and 50.9%, respectively. Text similarity for speech recognition is also studied
Keywords :
grammars; speech recognition; stochastic processes; continuous speech recognition; input phrases; phrase recognition; stochastic language models; supervised learning; syllable perplexity; syllable trigram model; syntactic knowledge; task adaptation; text data; text similarity; Databases; Humans; Natural languages; Probability; Speech recognition; Stochastic processes; Supervised learning; Target recognition; Text recognition; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.225946