• DocumentCode
    394236
  • Title

    Language modeling and transcription of the TED corpus lectures

  • Author

    Leeuwis, Erwin ; Federico, Murcello ; Cettolo, Mauro

  • Author_Institution
    Dept. of Comput. Sci., Twente Univ., Enschede, Netherlands
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work, we present our first results on the automatic transcription of lectures from the TED corpus, recently released by ELRA and LDC. In particular, we concentrated our effort on language modeling. Baseline acoustic and language models were developed using respectively 8 hours of TED transcripts and various types of texts: conference proceedings, lecture transcripts, and conversational speech transcripts. Then, adaptation of the language model to single speakers was investigated by exploiting different kinds of information: automatic transcripts of the talk, the title of the talk, the abstract and, finally, the paper. In the last case, a 39.2% WER was achieved.
  • Keywords
    linguistics; natural languages; speech recognition; TED corpus; abstract; acoustic modeling; automatic transcription; automatic transcripts; conference proceedings; conversational speech transcripts; language modeling; lecture transcripts; paper; title; Acoustic testing; Broadcasting; Computer science; Conference proceedings; Content based retrieval; Loudspeakers; Maximum likelihood linear regression; Natural languages; Software libraries; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198760
  • Filename
    1198760