• DocumentCode
    3165905
  • Title

    A hybrid phonotactic language identification system with an SVM back-end for simultaneous lecture translation

  • Author

    Heck, Michael ; Stüker, Sebastian ; Waibel, Alex

  • Author_Institution
    Inst. for Anthropomatics, Karlsruhe Inst. of Technol., Karlsruhe, Germany
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4857
  • Lastpage
    4860
  • Abstract
    In this paper we describe our work in constructing a language identification system for use in our simultaneous lecture translation system. We first built PPR and PPRLM baseline systems that produce score-fusing language cue feature vectors for language discrimination and utilize an SVM back-end classifier for the actual language identification. On our bi-lingual lecture tasks the PPRLM system clearly outperforms the PPR system in various segment length conditions, however at the cost of slower run-time. By using lexical information in the form of keyword spotting, and additional language models we show ways to improve the performance of both baseline systems. In order to combine the faster run-time of the PPR system with the better performance of the PPRLM system we finally built a hybrid of both approaches that clearly outperforms the PPR system while not adding any additional computing time. This hybrid system is therefore our choice for the use in the lecture translation system due to its faster run-time and good performance.
  • Keywords
    language translation; natural language processing; speech processing; support vector machines; SVM backend classifier; additional language model; baseline system; bilingual lecture task; hybrid phonotactic language identification system; hybrid system; keyword spotting; language discrimination; lecture translation system; lexical information; score-fusing language cue feature vectors; simultaneous lecture translation; Acoustics; Computational modeling; Data models; Decoding; Hidden Markov models; Support vector machines; Training; language identification; lecture translation; speech translation; support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6289007
  • Filename
    6289007