DocumentCode
3426598
Title
Automatic lecture transcription by exploiting presentation slide information for language model adaptation
Author
Kawahara, Tatsuya ; Nemoto, Yusuke ; Akita, Yuya
Author_Institution
Acad. Center for Comput. & Media Studies Sakyo-ku, Kyoto Univ., Kyoto
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
4929
Lastpage
4932
Abstract
The paper addresses language model adaptation for automatic lecture transcription by fully exploiting presentation slide information used in the lecture. As the text in the presentation slides is small in its size and fragmentary in its content, a robust adaptation scheme is addressed by focusing on the keyword and topic information. Several methods are investigated and combined; first, global topic adaptation is conducted based on PLSA (probabilistic latent semantic analysis) using keywords appearing in all slides. Web text is also retrieved to enhance the relevant text. Then, local preference of the keywords are reflected with a cache model by referring to the slide used during each utterance. Experimental evaluations on real lectures show that the proposed method combining the global and local slide information achieves a significant improvement of recognition accuracy, especially in the detection rate of content keywords.
Keywords
speech recognition; Web text; automatic lecture transcription; automatic speech recognition; cache model; global adaptation; language model adaptation; local adaptation; presentation slide information; probabilistic latent semantic analysis; robust adaptation scheme; Adaptation model; Audio recording; Automatic speech recognition; Conducting materials; Deafness; Error analysis; Microphones; Natural languages; Robustness; Speech recognition; PLSA; cache model; language model; lectures; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518763
Filename
4518763
Link To Document