DocumentCode :
2789962
Title :
Data sampling ensemble acoustic modelling in speaker independent speech recognition
Author :
Chen, Xin ; Zhao, Yunxin
Author_Institution :
Dept. of Comput. Sci., Univ. of Missouri, Columbia, MO, USA
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
5130
Lastpage :
5133
Abstract :
In this paper, we extend our recent data-sampling based ensemble acoustic modeling technique for the speaker-independent task of TIMIT and propose new methods to further improve the effectiveness of the ensemble acoustic models. We propose applying overlapped speaker clustering in data sampling to construct an ensemble of acoustic models for speaker independent speech recognition. In addition, we evaluate the method of data sampling in recurrent neural network for constructing a RNN based frame classifier. We also investigate using CVEM in place of EM in our ensemble acoustic model training. By using these methods on the speaker independent TIMIT phone recognition task, we have obtained a 2.5% absolute gain on phone accuracy over a standard HMM baseline system.
Keywords :
hidden Markov models; pattern clustering; recurrent neural nets; speech recognition; CVEM; HMM baseline system; TIMIT; acoustic model training; data sampling ensemble acoustic modelling; overlapped speaker clustering; phone recognition task; recurrent neural network; speaker independent speech recognition; Computer science; Decoding; Diversity reception; Hidden Markov models; Loudspeakers; Radio frequency; Recurrent neural networks; Sampling methods; Speech recognition; Training data; data sampling; ensemble acoustic modeling; recurrent neural network; speaker adaptation; speaker overlapped clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5495029
Filename :
5495029
Link To Document :
بازگشت