• DocumentCode
    3426598
  • Title

    Automatic lecture transcription by exploiting presentation slide information for language model adaptation

  • Author

    Kawahara, Tatsuya ; Nemoto, Yusuke ; Akita, Yuya

  • Author_Institution
    Acad. Center for Comput. & Media Studies Sakyo-ku, Kyoto Univ., Kyoto
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4929
  • Lastpage
    4932
  • Abstract
    The paper addresses language model adaptation for automatic lecture transcription by fully exploiting presentation slide information used in the lecture. As the text in the presentation slides is small in its size and fragmentary in its content, a robust adaptation scheme is addressed by focusing on the keyword and topic information. Several methods are investigated and combined; first, global topic adaptation is conducted based on PLSA (probabilistic latent semantic analysis) using keywords appearing in all slides. Web text is also retrieved to enhance the relevant text. Then, local preference of the keywords are reflected with a cache model by referring to the slide used during each utterance. Experimental evaluations on real lectures show that the proposed method combining the global and local slide information achieves a significant improvement of recognition accuracy, especially in the detection rate of content keywords.
  • Keywords
    speech recognition; Web text; automatic lecture transcription; automatic speech recognition; cache model; global adaptation; language model adaptation; local adaptation; presentation slide information; probabilistic latent semantic analysis; robust adaptation scheme; Adaptation model; Audio recording; Automatic speech recognition; Conducting materials; Deafness; Error analysis; Microphones; Natural languages; Robustness; Speech recognition; PLSA; cache model; language model; lectures; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518763
  • Filename
    4518763