DocumentCode
394237
Title
Unsupervised class-based language model adaptation for spontaneous speech recognition
Author
Yokoyama, Z. ; Shinozaki, Tetsuo ; Iwano, K. ; Furui, S.
Author_Institution
Dept. of Comput. Sci., Tokyo Inst. of Technol., Japan
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
This paper proposes an unsupervised, batch-type, class-based language model adaptation method for spontaneous speech recognition. The word classes are automatically determined by maximizing the average mutual information between the classes using a training set. A class-based language model is built based on recognition hypotheses obtained using a general word-based language model, and linearly interpolated with the general language model. All the input utterances are re-recognized using the adapted language model. The proposed method was applied to the recognition of spontaneous presentations and was found to be effective in improving the recognition accuracy for all the presentations. The best condition was found to be using 100 word classes, and in this condition 2.3% of the absolute value improvement in the word accuracy averaged over all the speakers was achieved.
Keywords
linguistics; natural languages; speech recognition; unsupervised learning; average mutual information; batch-type class-based language model adaptation method; spontaneous presentations; spontaneous speech recognition; training set; unsupervised class-based language model adaptation; word-based language model; Adaptation model; Art; Computer science; Error analysis; Loudspeakers; Mutual information; Natural languages; Speech recognition; Text recognition; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198761
Filename
1198761
Link To Document