DocumentCode :
2979152
Title :
Joint distributional modeling with cross-correlation based features
Author :
Bilmes, Jeff A.
Author_Institution :
Int. Comput. Sci. Inst., Berkeley, CA, USA
fYear :
1997
fDate :
14-17 Dec 1997
Firstpage :
148
Lastpage :
155
Abstract :
In maximum likelihood based speech recognition systems, it is important to accurately estimate the joint distribution of feature vectors given a particular acoustic model. We propose that by modeling the joint distribution of time localized feature vectors and statistics relating those time localized feature vectors to the relevant acoustic context, we can estimate information contained in the feature vector joint distribution without the accompanying theoretical or computational difficulties. We introduce the modcrossgram (MCG), a computational way of estimating short time spectro temporal correlation based statistics that are informative about the feature vector joint distribution. Using the standard hybrid ANN/HMM architecture, we compare a MCG based speech recognition system with a more traditional one on an isolated word speech database. We show that, in the presence of noise, the MCG based system achieves a significant reduction in word error rate over the standard system
Keywords :
hidden Markov models; maximum likelihood detection; neural nets; speech recognition; statistical analysis; MCG based speech recognition system; acoustic context; acoustic model; cross correlation based features; feature vector joint distribution; feature vectors; isolated word speech database; joint distributional modeling; maximum likelihood based speech recognition systems; modcrossgram; short time spectro temporal correlation based statistics; standard hybrid ANN/HMM architecture; time localized feature vectors; word error rate; Computer architecture; Context modeling; Distributed computing; Error analysis; Hidden Markov models; Maximum likelihood estimation; Noise reduction; Spatial databases; Speech recognition; Statistical distributions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
Type :
conf
DOI :
10.1109/ASRU.1997.658999
Filename :
658999
Link To Document :
بازگشت