• DocumentCode
    454729
  • Title

    Robust Speech Recognition From Noise-Type Based Feature Compensation and Model Interpolation in a Multiple Model Framework

  • Author

    Xu, Haitian ; Tan, Zheng-Hua ; Dalsgaard, Paul ; Lindberg, Børge

  • Author_Institution
    Center for TeleInFrastructure, Aalborg Univ.
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    Compared to multi-condition training (MTR), condition-dependent training generates multiple acoustic hidden Markov model sets each identified by a noisy environment and is known to perform substantially better for known noise types (included in training) while worse for unknown (untrained) noise types. This paper attempts to bridge the performance gap between known and unknown noise types by introducing a minimum mean-square error (MMSE) noise-type based compensation algorithm. On the basis of a modified vector Taylor series and the measurement of feature reliability as well as noise similarity, the MMSE estimation adapts the test features corrupted by the unknown noise type to the corresponding features corrupted by the known noise type. This method significantly improves the recognition performance for unknown noise types while maintaining the good performance for known noise types. Furthermore, in order to benefit directly from MTR, a model interpolation strategy is investigated which combines the MTR and the condition-dependent model sets. Both good performance and low computational cost are achieved by only interpolating the mixtures of each condition-dependent model state with the least weighted mixture in the corresponding MTR model state. The overall system gives promising results
  • Keywords
    interpolation; least mean squares methods; series (mathematics); speech recognition; MMSE; condition-dependent model sets; minimum mean-square error; model interpolation; modified vector Taylor series; multicondition training; noise-type based feature compensation; robust speech recognition; Acoustic noise; Bridges; Hidden Markov models; Interpolation; Noise generators; Noise measurement; Noise robustness; Speech recognition; Taylor series; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660227
  • Filename
    1660227