• DocumentCode
    1749657
  • Title

    Continuous speech recognition under non-stationary musical environments based on speech state transition model

  • Author

    Fujimoto, M. ; Ariki, Y.

  • Author_Institution
    Dept of Electron. & Inf., Ryukoku Univ., Shiga, Japan
  • Volume
    1
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    297
  • Abstract
    We propose a non-stationary noise reduction method based on the speech state transition model. Our proposed method estimates the speech signal under non-stationary noisy environments such as musical background by applying the speech state transition model to Kalman filtering estimation. The speech state transition model represents the state transition of the speech component in non-stationary noisy speech and is modeled by using Taylor expansion. In this model, the state transition of the noise component is estimated by using linear predictive estimation. In order to evaluate the proposed method, we carried out large vocabulary continuous speech recognition experiments under 3 types of music and compared the results with the conventional parallel model combination (PMC) method in word accuracy rate. As a result, the proposed method obtained a word accuracy rate that was superior to PMC
  • Keywords
    Kalman filters; filtering theory; music; noise; prediction theory; speech recognition; state estimation; Kalman filtering estimation; Taylor expansion; continuous speech recognition; large vocabulary continuous speech recognition; linear predictive estimation; nonstationary musical environments; nonstationary noise reduction method; nonstationary noisy environments; nonstationary noisy speech; parallel model combination method; speech signal estimation; speech state transition model; word accuracy rate; Background noise; Filtering; Kalman filters; Noise reduction; Predictive models; Speech enhancement; Speech recognition; State estimation; Taylor series; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940826
  • Filename
    940826