• DocumentCode
    2519143
  • Title

    Accounting for deterministic noise components in a MMSE STSA speech enhancement framework

  • Author

    McCallum, Matthew ; Guillemin, Bernard

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Auckland, Auckland, New Zealand
  • fYear
    2012
  • fDate
    2-5 Oct. 2012
  • Firstpage
    169
  • Lastpage
    174
  • Abstract
    Current approaches to speech enhancement usually consider performance in either the presence of coloured broadband noise or periodic noise, but rarely both. In this paper we present a new speech enhancement technique, derived within the standard minimum mean-square error (MMSE) short time spectral amplitude framework, which jointly compensates for both coloured broadband and periodic noise components. With this approach, termed noise mean subtracted (NMS) MMSE, hidden Markov model based frequency tracking techniques are used to estimate periodic noise components and differentiate them from periodic components associated with the speech signal. They are then removed using complex spectral subtraction. The resulting algorithm is evaluated using perceptual evaluation of speech quality (PESQ) for both male and female speech utterances from the NOIZEUS database. It is shown that when the noise contamination comprises both broadband and periodic components, this NMS MMSE algorithm outperforms the standard MMSE algorithm, derived under the assumption of stochastic noise only. The technique has application in a variety of scenarios, including those involving emergency radio communications.
  • Keywords
    least mean squares methods; speech enhancement; MMSE STSA speech enhancement; NOIZEUS database; PESQ; coloured broadband noise; deterministic noise components; hidden Markov model based frequency tracking; minimum mean-square error; noise mean subtracted MMSE; perceptual evaluation of speech quality; periodic noise; short time spectral amplitude framework; speech utterance; Broadband communication; Frequency estimation; Hidden Markov models; Noise; Speech; Speech enhancement; Time frequency analysis; Minimum mean-square error (MMSE); deterministic noise; emergency communications; speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Information Technologies (ISCIT), 2012 International Symposium on
  • Conference_Location
    Gold Coast, QLD
  • Print_ISBN
    978-1-4673-1156-4
  • Electronic_ISBN
    978-1-4673-1155-7
  • Type

    conf

  • DOI
    10.1109/ISCIT.2012.6380884
  • Filename
    6380884