• DocumentCode
    178722
  • Title

    The RWTH English lecture recognition system

  • Author

    Wiesler, Simon ; Irie, Kazuki ; Tuske, Zoltan ; Schluter, Ralf ; Ney, Hermann

  • Author_Institution
    Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    3286
  • Lastpage
    3290
  • Abstract
    In this paper, we describe the RWTH speech recognition system for English lectures developed within the Translectures project. A difficulty in the development of an English lectures recognition system, is the high ratio of non-native speakers. We address this problem by using very effective deep bottleneck features trained on multilingual data. The acoustic model is trained on large amounts of data from different domains and with different dialects. Large improvements are obtained from unsupervised acoustic adaptation. Another challenge is the frequent use of technical terms and the wide range of topics. In our recognition system, slides, which are attached to most lectures, are used for improving lexical coverage and language model adaptation.
  • Keywords
    natural language processing; speech recognition; RWTH English lecture recognition system; RWTH speech recognition system; acoustic model; bottleneck features; language model adaptation; lexical coverage; multilingual data; nonnative speakers; translectures project; unsupervised acoustic adaptation; Acoustics; Adaptation models; Feature extraction; Hidden Markov models; Speech; Training; Training data; LVCSR; lecture recognition; speech recognition system;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854208
  • Filename
    6854208