English to Japanese spoken lecture translation system by using DNN-HMM and phrase-based SMT

Author

Norioki Goto;Kazumasa Yamamoto;Seiichi Nakagawa

Author_Institution

Toyohashi University of Technology, Tenpaku-cho, Toyohashi, Aichi, 441-8580, Japan

fYear

2015

Firstpage

1

Lastpage

6

Abstract

This paper presents our scheme to translate spoken English lectures into Japanese that consists of an English automatic speech recognition system (ASR) that utilizes a deep neural network (DNN) and an English to Japanese phrase-based statistical machine translation system (SMT). We utilized an existing Wall Street Journal corpus for our acoustic model and adapted it with MIT OpenCourseWare lectures whose transcriptions we also utilized to create our language model. For the parallel corpus of our SMT system, we used TED Talks and Japanese News Article Alignment Data. Our ASR system achieved a word error rate (WER) of 21.0%, and our SMT system achieved a 3-gram base bilingual evaluation understudy (BLEU) of 16.8 for text input and 14.6 for speech input, respectively. These scores outperformed our previous system : WER = 32.1% and BLEU = 11.0.

Keywords

"Hidden Markov models","Data models","Adaptation models","Acoustics","Speech","Speech recognition","Computational modeling"

Publisher

ieee

Conference_Titel

Advanced Informatics: Concepts, Theory and Applications (ICAICTA), 2015 2nd International Conference on

Print_ISBN

978-1-4673-8142-0

Type

conf

DOI

10.1109/ICAICTA.2015.7335357

Filename

7335357