DocumentCode
3696623
Title
English to Japanese spoken lecture translation system by using DNN-HMM and phrase-based SMT
Author
Norioki Goto;Kazumasa Yamamoto;Seiichi Nakagawa
Author_Institution
Toyohashi University of Technology, Tenpaku-cho, Toyohashi, Aichi, 441-8580, Japan
fYear
2015
Firstpage
1
Lastpage
6
Abstract
This paper presents our scheme to translate spoken English lectures into Japanese that consists of an English automatic speech recognition system (ASR) that utilizes a deep neural network (DNN) and an English to Japanese phrase-based statistical machine translation system (SMT). We utilized an existing Wall Street Journal corpus for our acoustic model and adapted it with MIT OpenCourseWare lectures whose transcriptions we also utilized to create our language model. For the parallel corpus of our SMT system, we used TED Talks and Japanese News Article Alignment Data. Our ASR system achieved a word error rate (WER) of 21.0%, and our SMT system achieved a 3-gram base bilingual evaluation understudy (BLEU) of 16.8 for text input and 14.6 for speech input, respectively. These scores outperformed our previous system : WER = 32.1% and BLEU = 11.0.
Keywords
"Hidden Markov models","Data models","Adaptation models","Acoustics","Speech","Speech recognition","Computational modeling"
Publisher
ieee
Conference_Titel
Advanced Informatics: Concepts, Theory and Applications (ICAICTA), 2015 2nd International Conference on
Print_ISBN
978-1-4673-8142-0
Type
conf
DOI
10.1109/ICAICTA.2015.7335357
Filename
7335357
Link To Document