A Markov random field model for automatic speech recognition

Author

Gravier, Guillaume ; Sigelle, Marc ; Chollet, Gérard

Author_Institution

ENST, Paris, France

Volume

3

fYear

2000

fDate

2000

Firstpage

254

Abstract

Speech can be represented as a time/frequency distribution of energy using a multiband filter bank. A Markov random field model, which takes into account the possible time asynchrony across the bands, is estimated for each segmental units to be recognized. The law of the speech process is given by a parametric Gibbs distribution and a maximum likelihood parameter estimation algorithm is developed. Experiments are conducted on an isolated word recognition problem. It is shown that similar performances are obtained with the new model and with standard HMM techniques in the mono-band case. In the multiband case, it is shown that modeling interband synchrony is an interesting approach to increase the performance when the number of bands increases

Keywords

filtering theory; hidden Markov models; maximum likelihood estimation; speech recognition; time-frequency analysis; HMM techniques; Markov random field model; automatic speech recognition; interband synchrony; isolated word recognition problem; maximum likelihood parameter estimation; multiband filter bank; parametric Gibbs distribution; speech process; speech representation; time asynchrony; time/frequency energy distribution; Additive noise; Automatic speech recognition; Cepstral analysis; Filter bank; Hidden Markov models; Markov random fields; Noise robustness; Parameter estimation; Speech enhancement; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Pattern Recognition, 2000. Proceedings. 15th International Conference on

Conference_Location

Barcelona

ISSN

1051-4651

Print_ISBN

0-7695-0750-6

Type

conf

DOI

10.1109/ICPR.2000.903533

Filename

903533