DocumentCode :
3215105
Title :
Improved speech recognition using discriminative integration of multiple local classifiers in lattice rescoring
Author :
Huang, Hao ; Li, Bing Hu
Author_Institution :
Dept. of Inf. Sci. & Eng., Xinjiang Univ., Urumqi, China
Volume :
4
fYear :
2011
fDate :
29-31 July 2011
Abstract :
Model combination is a popular technique to integrate several knowledge sources into automatic speech recognition for better system accuracy. In this paper, we report our recent work on the integration of the hidden Markov model based acoustic model, the multi-layer perceptron based phoneme classifier and Gaussian mixture model based tone classifier in lattice rescoring. Moreover, we use discriminative model weight training to tune the impact of the heterogeneous models according to different phonetic contexts for better model interpolation. Experimental results on continuous mandarin speech recognition show a 8.2% improvement can be obtained using the combination of the three models. We have also evaluated four context dependent weighting schemes using discriminative trained scaling factors. It is also shown by introducing left final dependent contexts, a 4.1% further recognition error reduction can be further obtained.
Keywords :
Gaussian processes; hidden Markov models; multilayer perceptrons; speech recognition; Gaussian mixture model; acoustic model; automatic speech recognition; discriminative integration; hidden Markov model; lattice rescoring; multilayer perceptron; multiple local classifiers; phoneme classifier; Computational modeling; Context; Hidden Markov models; Markov processes; Maximum likelihood estimation; Discriminative model combination; Multi-layer perceptron; minimum phone error; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronics and Optoelectronics (ICEOE), 2011 International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-1-61284-275-2
Type :
conf
DOI :
10.1109/ICEOE.2011.6013458
Filename :
6013458
Link To Document :
بازگشت