Title :
Data sampling based ensemble acoustic modelling
Author :
Chen, Xin ; Zhao, Yunxin
Author_Institution :
Dept. of Comput. Sci., Univ. of Missouri, Columbia, MO
Abstract :
In this paper, we propose a novel technique of using cross validation (CV) data sampling to construct an ensemble of acoustic models for conversational speech recognition. We further propose using hierarchical Gaussian mixture model (HGMM) and repartition training data to increase the ensemble size and diversity. The proposed methods are found to work well together for ensemble acoustic modeling. We also evaluated the quality of the ensemble acoustic models by using the measures of classification margin, average correct score and variance of correct score. We have found that the ensemble of acoustic models increases the margin and the average correct score, and reduces the variance. We compared the performance of our proposed method with a recently reported method of CV expectation maximization (CVEM) for single acoustic models. Our experimental results on a telemedicine automatic captioning task showed that the proposed ensemble acoustic modeling has led to significant improvements in word recognition accuracy.
Keywords :
Gaussian processes; expectation-maximisation algorithm; speech recognition; telemedicine; word processing; CV expectation maximization; acoustic modeling; cross validation; data sampling; ensemble acoustic modelling; hierarchical Gaussian mixture model; speech recognition; telemedicine automatic captioning task; word recognition accuracy; Acoustic measurements; Computer science; Context modeling; Decision trees; Decoding; Hidden Markov models; Sampling methods; Speech recognition; Telemedicine; Training data; acoustic modeling; cross validation; data sampling; ensemble classifier; hierarchical mixture ensemble;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960456