Continuous speech recognition under non-stationary musical environments based on speech state transition model

Author

Fujimoto, M. ; Ariki, Y.

Author_Institution

Dept of Electron. & Inf., Ryukoku Univ., Shiga, Japan

Volume

1

fYear

2001

fDate

2001

Firstpage

297

Abstract

We propose a non-stationary noise reduction method based on the speech state transition model. Our proposed method estimates the speech signal under non-stationary noisy environments such as musical background by applying the speech state transition model to Kalman filtering estimation. The speech state transition model represents the state transition of the speech component in non-stationary noisy speech and is modeled by using Taylor expansion. In this model, the state transition of the noise component is estimated by using linear predictive estimation. In order to evaluate the proposed method, we carried out large vocabulary continuous speech recognition experiments under 3 types of music and compared the results with the conventional parallel model combination (PMC) method in word accuracy rate. As a result, the proposed method obtained a word accuracy rate that was superior to PMC

Keywords

Kalman filters; filtering theory; music; noise; prediction theory; speech recognition; state estimation; Kalman filtering estimation; Taylor expansion; continuous speech recognition; large vocabulary continuous speech recognition; linear predictive estimation; nonstationary musical environments; nonstationary noise reduction method; nonstationary noisy environments; nonstationary noisy speech; parallel model combination method; speech signal estimation; speech state transition model; word accuracy rate; Background noise; Filtering; Kalman filters; Noise reduction; Predictive models; Speech enhancement; Speech recognition; State estimation; Taylor series; Working environment noise;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on

Conference_Location

Salt Lake City, UT

ISSN

1520-6149

Print_ISBN

0-7803-7041-4

Type

conf

DOI

10.1109/ICASSP.2001.940826

Filename

940826