DocumentCode :
3424274
Title :
Fast Gaussian likelihood computation by maximum probability increase estimation for continuous speech recognition
Author :
Morales, Nicolás ; Gu, Liang ; Gao, Yuqing
Author_Institution :
HCTLab., Univ. Autonoma de Madrid, Madrid
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4453
Lastpage :
4456
Abstract :
Speech signals are semi-stationary and speech features in neighboring frames are likely to share similar Gaussian distributions. A fast Gaussian computation algorithm is hence proposed to speed up the computation of the N-best posterior probabilities based on a large set of Gaussian distributions for the task of large vocabulary continuous speech recognition. The maximum probability increase between the current speech frame and a previous reference frame is estimated for all Gaussian distributions in order to reduce explicit computations of posteriors for a large number of Gaussians. The method was applied to the fMPE front-end of IBM´s state-of-the-art speech recognizer resulting a decoding speed-up of 40% in probability computation for a loss-less mode and more than 55% in an approximated implementation, respectively.
Keywords :
Gaussian distribution; probability; speech recognition; Gaussian distributions; fast Gaussian likelihood computation; iV-best posterior probabilities; large vocabulary continuous speech recognition; maximum probability increase estimation; speech signals; Adaptation model; Automatic speech recognition; Decoding; Distributed computing; Feature extraction; Gaussian distribution; Probability; Speech recognition; Viterbi algorithm; Vocabulary; Fast Gaussian computation; fMPE;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518644
Filename :
4518644
Link To Document :
بازگشت