• DocumentCode
    3696623
  • Title

    English to Japanese spoken lecture translation system by using DNN-HMM and phrase-based SMT

  • Author

    Norioki Goto;Kazumasa Yamamoto;Seiichi Nakagawa

  • Author_Institution
    Toyohashi University of Technology, Tenpaku-cho, Toyohashi, Aichi, 441-8580, Japan
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper presents our scheme to translate spoken English lectures into Japanese that consists of an English automatic speech recognition system (ASR) that utilizes a deep neural network (DNN) and an English to Japanese phrase-based statistical machine translation system (SMT). We utilized an existing Wall Street Journal corpus for our acoustic model and adapted it with MIT OpenCourseWare lectures whose transcriptions we also utilized to create our language model. For the parallel corpus of our SMT system, we used TED Talks and Japanese News Article Alignment Data. Our ASR system achieved a word error rate (WER) of 21.0%, and our SMT system achieved a 3-gram base bilingual evaluation understudy (BLEU) of 16.8 for text input and 14.6 for speech input, respectively. These scores outperformed our previous system : WER = 32.1% and BLEU = 11.0.
  • Keywords
    "Hidden Markov models","Data models","Adaptation models","Acoustics","Speech","Speech recognition","Computational modeling"
  • Publisher
    ieee
  • Conference_Titel
    Advanced Informatics: Concepts, Theory and Applications (ICAICTA), 2015 2nd International Conference on
  • Print_ISBN
    978-1-4673-8142-0
  • Type

    conf

  • DOI
    10.1109/ICAICTA.2015.7335357
  • Filename
    7335357