DocumentCode :
1793636
Title :
Comparison of syllable-based and phoneme-based DNN-HMM in Japanese speech recognition
Author :
Seki, Hiroshi ; Yamamoto, Koji ; Nakagawa, Sachiko
Author_Institution :
Dept. of Comput. Sci. & Eng., Toyohashi Univ. of Technol., Toyohashi, Japan
fYear :
2014
fDate :
20-21 Aug. 2014
Firstpage :
249
Lastpage :
254
Abstract :
Japanese is syllabic language. Additionally we have studied syllable-based GMM-HMM for Japanese speech recognition. In this paper, we investigate the differences of recognition accuracy using phoneme/syllable-based GMM-HMM and DNN (Deep Neural Network)-HMM. First, we present a comparison of syllable-based and phoneme-based DNN-HMM. Second, we train the tied state left-context dependent syllable DNN-HMM, and compare these three types of modeling method. In the experiment, we obtained a 5% relative gain for WER using left-context syllable DNN-HMM in comparison with a left-context syllable GMM-HMM, and an 11% relative gain for WER using triphone DNN-HMM in comparison with a syllable-based DNN-HMM. Finally, we got results that modeling left-context phoneme has not worked and context independent syllable-based DNN-HMM got the best performance in the experiments, when applied to the ASJ+JNAS corpus, which consists of about 70 hours.
Keywords :
hidden Markov models; neural nets; speech recognition; ASJ+JNAS corpus; Japanese speech recognition; WER; context independent syllable; hidden Markov process; left-context dependent syllable; modeling method; phoneme-based DNN-HMM; recognition accuracy; syllabic language; triphone DNN-HMM; Context; Context modeling; Hidden Markov models; Neural networks; Rectifiers; Speech recognition; Training; DNN-HMM; GMM-HMM; deep neural network; phoneme; speech recognition; syllable;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Informatics: Concept, Theory and Application (ICAICTA), 2014 International Conference of
Conference_Location :
Bandung
Print_ISBN :
978-1-4799-6984-5
Type :
conf
DOI :
10.1109/ICAICTA.2014.7005949
Filename :
7005949
Link To Document :
بازگشت