DocumentCode
134287
Title
Building an ensemble of CD-DNN-HMM acoustic model using random forests of phonetic decision trees
Author
Tuo Zhao ; Yunxin Zhao ; Xin Chen
Author_Institution
Dept. of Comput. Sci., Univ. of Missouri, Columbia, MO, USA
fYear
2014
fDate
12-14 Sept. 2014
Firstpage
98
Lastpage
102
Abstract
We propose an RF-PDT+CD-DNN approach to generate an ensemble of context-dependent pre-trained deep neural networks (CD-DNNs) using random forests of phonetic decision trees (RF-PDTs) and constructing a CD-DNN-HMM-based ensemble acoustic model (EAM). We present evaluation results on the TIMIT dataset and a telemedicine automatic captioning dataset and demonstrate that the proposed RF-PDT+CD-DNN based EAM significantly outperforms the CD-DNN based single acoustic model (SAM) in phone and word recognition accuracies.
Keywords
decision trees; neural nets; speech recognition; telemedicine; CD-DNN-HMM acoustic model ensemble; EAM; RF-PDT+CD-DNN; SAM; TIMIT dataset; context-dependent pretrained deep neural networks; phone recognition accuracies; phonetic decision trees; random forests; random forests of phonetic decision trees; single acoustic model; telemedicine automatic captioning dataset; word recognition accuracies; deep neural network; discriminative pre-training; ensemble acoustic model; phonetic decision tree; random forest; single acoustic model;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location
Singapore
Type
conf
DOI
10.1109/ISCSLP.2014.6936680
Filename
6936680
Link To Document