مرکز منطقه ای اطلاع رساني علوم و فناوري - Joint distributional modeling with cross-correlation based features

DocumentCode :

2979152

Title :

Joint distributional modeling with cross-correlation based features

Author :

Bilmes, Jeff A.

Author_Institution :

Int. Comput. Sci. Inst., Berkeley, CA, USA

fYear :

1997

fDate :

14-17 Dec 1997

Firstpage :

148

Lastpage :

155

Abstract :

In maximum likelihood based speech recognition systems, it is important to accurately estimate the joint distribution of feature vectors given a particular acoustic model. We propose that by modeling the joint distribution of time localized feature vectors and statistics relating those time localized feature vectors to the relevant acoustic context, we can estimate information contained in the feature vector joint distribution without the accompanying theoretical or computational difficulties. We introduce the modcrossgram (MCG), a computational way of estimating short time spectro temporal correlation based statistics that are informative about the feature vector joint distribution. Using the standard hybrid ANN/HMM architecture, we compare a MCG based speech recognition system with a more traditional one on an isolated word speech database. We show that, in the presence of noise, the MCG based system achieves a significant reduction in word error rate over the standard system

Keywords :

hidden Markov models; maximum likelihood detection; neural nets; speech recognition; statistical analysis; MCG based speech recognition system; acoustic context; acoustic model; cross correlation based features; feature vector joint distribution; feature vectors; isolated word speech database; joint distributional modeling; maximum likelihood based speech recognition systems; modcrossgram; short time spectro temporal correlation based statistics; standard hybrid ANN/HMM architecture; time localized feature vectors; word error rate; Computer architecture; Context modeling; Distributed computing; Error analysis; Hidden Markov models; Maximum likelihood estimation; Noise reduction; Spatial databases; Speech recognition; Statistical distributions;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on

Conference_Location :

Santa Barbara, CA

Print_ISBN :

0-7803-3698-4

Type :

conf

DOI :

10.1109/ASRU.1997.658999

Filename :

658999

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2979152