DocumentCode
1742224
Title
A Markov random field model for automatic speech recognition
Author
Gravier, Guillaume ; Sigelle, Marc ; Chollet, Gérard
Author_Institution
ENST, Paris, France
Volume
3
fYear
2000
fDate
2000
Firstpage
254
Abstract
Speech can be represented as a time/frequency distribution of energy using a multiband filter bank. A Markov random field model, which takes into account the possible time asynchrony across the bands, is estimated for each segmental units to be recognized. The law of the speech process is given by a parametric Gibbs distribution and a maximum likelihood parameter estimation algorithm is developed. Experiments are conducted on an isolated word recognition problem. It is shown that similar performances are obtained with the new model and with standard HMM techniques in the mono-band case. In the multiband case, it is shown that modeling interband synchrony is an interesting approach to increase the performance when the number of bands increases
Keywords
filtering theory; hidden Markov models; maximum likelihood estimation; speech recognition; time-frequency analysis; HMM techniques; Markov random field model; automatic speech recognition; interband synchrony; isolated word recognition problem; maximum likelihood parameter estimation; multiband filter bank; parametric Gibbs distribution; speech process; speech representation; time asynchrony; time/frequency energy distribution; Additive noise; Automatic speech recognition; Cepstral analysis; Filter bank; Hidden Markov models; Markov random fields; Noise robustness; Parameter estimation; Speech enhancement; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2000. Proceedings. 15th International Conference on
Conference_Location
Barcelona
ISSN
1051-4651
Print_ISBN
0-7695-0750-6
Type
conf
DOI
10.1109/ICPR.2000.903533
Filename
903533
Link To Document