• DocumentCode
    1884700
  • Title

    A statistical observation model for noisy reverberant speech features and its application to robust ASR

  • Author

    Leutnant, Volker ; Krueger, A. ; Haeb-Umbach, Reinhold

  • Author_Institution
    Dept. of Commun. Eng., Univ. of Paderborn, Paderborn, Germany
  • fYear
    2012
  • fDate
    12-15 Aug. 2012
  • Firstpage
    142
  • Lastpage
    147
  • Abstract
    In this work, an observation model for the joint compensation of noise and reverberation in the logarithmic mel power spectral density domain is considered. It relates the features of the noisy reverberant speech to those of the non-reverberant speech and the noise. In contrast to enhancement of features only corrupted by reverberation (reverberant features), enhancement of noisy reverberant features requires a more sophisticated model for the error introduced by the proposed observation model. In a first consideration, it will be shown that this error is highly dependent on the instantaneous ratio of the power of reverberant speech to the power of the noise and, moreover, sensitive to the phase between reverberant speech and noise in the short-time discrete Fourier domain. Afterwards, a statistically motivated approach will be presented allowing for the model of the observation error to be inferred from the error model previously used for the reverberation only case. Finally, the developed observation error model will be utilized in a Bayesian feature enhancement scheme, leading to improvements in word accuracy on the AURORA5 database.
  • Keywords
    Bayes methods; compensation; discrete Fourier transforms; feature extraction; speech recognition; AURORA5 database; Bayesian feature enhancement; compensation; instantaneous ratio; logarithmic mel power spectral density domain; noisy reverberant speech features; nonreverberant speech; robust automatic speech recognition; short-time discrete Fourier domain; statistical observation model; word accuracy; Approximation methods; Bayesian methods; Noise; Noise measurement; Reverberation; Speech; Vectors; Bayesian feature enhancement; Robust Automatic Speech Recognition; observation model for reverberant and noisy speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, Communication and Computing (ICSPCC), 2012 IEEE International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4673-2192-1
  • Type

    conf

  • DOI
    10.1109/ICSPCC.2012.6335731
  • Filename
    6335731