• DocumentCode
    1690350
  • Title

    An uncertainty decoding approach to noise- and reverberation-robust speech recognition

  • Author

    Maas, R. ; Thippur, Akshaya ; Sehr, Armin ; Kellermann, Walter

  • Author_Institution
    Multimedia Commun. & Signal Process., Univ. of Erlangen-Nuremberg, Erlangen, Germany
  • fYear
    2013
  • Firstpage
    7388
  • Lastpage
    7392
  • Abstract
    The generic REMOS (REverberation MOdeling for robust Speech recognition) concept is extended in this contribution to cope with additional noise components. REMOS originally embeds an explicit reverberation model into a hiddenMarkov model (HMM) leading to a relaxed conditional independence assumption for the observed feature vectors. During recognition, a nonlinear optimization problem is to be solved in order to adapt the HMMs´ output probability density functions to the current reverberation conditions. The extension for additional noise components necessitates a modified numerical solver for the nonlinear optimization problem. We propose an approximation scheme based on continuous piecewise linear regression. Connected-digit recognition experiments demonstrate the potential of REMOS in reverberant and noisy environments. They furthermore reveal that the benefit of an explicit reverberation model, overcoming the conditional independence assumption, increases with increasing signal-to-noise-ratios.
  • Keywords
    approximation theory; hidden Markov models; optimisation; probability; regression analysis; reverberation; speech recognition; HMM; approximation scheme; connected-digit recognition; continuous piecewise linear regression; feature vector; generic REMOS; hidden Markov model; noise-robust speech recognition; nonlinear optimization problem; probability density function; reverberation modeling-robust speech recognition; signal-to-noise-ratio; uncertainty decoding approach; Adaptation models; Hidden Markov models; Noise; Optimization; Reverberation; Speech; Vectors; automatic speech recognition; noise robustness; piecewise linear regression; reverberation robustness; uncertainty decoding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639098
  • Filename
    6639098