Title :
Data sampling ensemble acoustic modelling in speaker independent speech recognition
Author :
Chen, Xin ; Zhao, Yunxin
Author_Institution :
Dept. of Comput. Sci., Univ. of Missouri, Columbia, MO, USA
Abstract :
In this paper, we extend our recent data-sampling based ensemble acoustic modeling technique for the speaker-independent task of TIMIT and propose new methods to further improve the effectiveness of the ensemble acoustic models. We propose applying overlapped speaker clustering in data sampling to construct an ensemble of acoustic models for speaker independent speech recognition. In addition, we evaluate the method of data sampling in recurrent neural network for constructing a RNN based frame classifier. We also investigate using CVEM in place of EM in our ensemble acoustic model training. By using these methods on the speaker independent TIMIT phone recognition task, we have obtained a 2.5% absolute gain on phone accuracy over a standard HMM baseline system.
Keywords :
hidden Markov models; pattern clustering; recurrent neural nets; speech recognition; CVEM; HMM baseline system; TIMIT; acoustic model training; data sampling ensemble acoustic modelling; overlapped speaker clustering; phone recognition task; recurrent neural network; speaker independent speech recognition; Computer science; Decoding; Diversity reception; Hidden Markov models; Loudspeakers; Radio frequency; Recurrent neural networks; Sampling methods; Speech recognition; Training data; data sampling; ensemble acoustic modeling; recurrent neural network; speaker adaptation; speaker overlapped clustering;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495029