DocumentCode :
3696623
Title :
English to Japanese spoken lecture translation system by using DNN-HMM and phrase-based SMT
Author :
Norioki Goto;Kazumasa Yamamoto;Seiichi Nakagawa
Author_Institution :
Toyohashi University of Technology, Tenpaku-cho, Toyohashi, Aichi, 441-8580, Japan
fYear :
2015
Firstpage :
1
Lastpage :
6
Abstract :
This paper presents our scheme to translate spoken English lectures into Japanese that consists of an English automatic speech recognition system (ASR) that utilizes a deep neural network (DNN) and an English to Japanese phrase-based statistical machine translation system (SMT). We utilized an existing Wall Street Journal corpus for our acoustic model and adapted it with MIT OpenCourseWare lectures whose transcriptions we also utilized to create our language model. For the parallel corpus of our SMT system, we used TED Talks and Japanese News Article Alignment Data. Our ASR system achieved a word error rate (WER) of 21.0%, and our SMT system achieved a 3-gram base bilingual evaluation understudy (BLEU) of 16.8 for text input and 14.6 for speech input, respectively. These scores outperformed our previous system : WER = 32.1% and BLEU = 11.0.
Keywords :
"Hidden Markov models","Data models","Adaptation models","Acoustics","Speech","Speech recognition","Computational modeling"
Publisher :
ieee
Conference_Titel :
Advanced Informatics: Concepts, Theory and Applications (ICAICTA), 2015 2nd International Conference on
Print_ISBN :
978-1-4673-8142-0
Type :
conf
DOI :
10.1109/ICAICTA.2015.7335357
Filename :
7335357
Link To Document :
بازگشت