Increasing speech recognition robustness with HMM2

Author

Weber, Katrin ; Bengio, Samy ; Bourlard, Herve

Volume

fYear

2002

fDate

13-17 May 2002

Abstract

The purpose of this paper is to investigate the behavior of HMM2 models for the recognition of noisy speech. It has previously been shown that HMM2 is able to model dynamically important structural information inherent in the speech signal, often corresponding to formant positions/tracks. As formant regions are known to be robust in adverse conditions, HMM2 seems particularly promising for improving speech recognition robustness. Here, we review different variants of the HMM2 approach with respect to their application to noise-robust automatic speech recognition. It is shown that HMM2 has the potential to tackle the problem of mismatch between training and testing conditions, and that a multi-stream combination of (already noise-robust) cepstral features and formant-like features (extracted by HMM2) improves the noise robustness of a state-of-the-art automatic speech recognition system.

Keywords

Decoding; Hidden Markov models; Robustness; Signal to noise ratio; Solids; Variable speed drives;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on

Conference_Location

Orlando, FL, USA

ISSN

1520-6149

Print_ISBN

0-7803-7402-9

Type

conf

DOI

10.1109/ICASSP.2002.5743892

Filename

5743892

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=542328