Accounting for deterministic noise components in a MMSE STSA speech enhancement framework

Author

McCallum, Matthew ; Guillemin, Bernard

Author_Institution

Dept. of Electr. & Comput. Eng., Univ. of Auckland, Auckland, New Zealand

fYear

2012

fDate

2-5 Oct. 2012

Firstpage

169

Lastpage

174

Abstract

Current approaches to speech enhancement usually consider performance in either the presence of coloured broadband noise or periodic noise, but rarely both. In this paper we present a new speech enhancement technique, derived within the standard minimum mean-square error (MMSE) short time spectral amplitude framework, which jointly compensates for both coloured broadband and periodic noise components. With this approach, termed noise mean subtracted (NMS) MMSE, hidden Markov model based frequency tracking techniques are used to estimate periodic noise components and differentiate them from periodic components associated with the speech signal. They are then removed using complex spectral subtraction. The resulting algorithm is evaluated using perceptual evaluation of speech quality (PESQ) for both male and female speech utterances from the NOIZEUS database. It is shown that when the noise contamination comprises both broadband and periodic components, this NMS MMSE algorithm outperforms the standard MMSE algorithm, derived under the assumption of stochastic noise only. The technique has application in a variety of scenarios, including those involving emergency radio communications.

Keywords

least mean squares methods; speech enhancement; MMSE STSA speech enhancement; NOIZEUS database; PESQ; coloured broadband noise; deterministic noise components; hidden Markov model based frequency tracking; minimum mean-square error; noise mean subtracted MMSE; perceptual evaluation of speech quality; periodic noise; short time spectral amplitude framework; speech utterance; Broadband communication; Frequency estimation; Hidden Markov models; Noise; Speech; Speech enhancement; Time frequency analysis; Minimum mean-square error (MMSE); deterministic noise; emergency communications; speech enhancement;

fLanguage

English

Publisher

ieee

Conference_Titel

Communications and Information Technologies (ISCIT), 2012 International Symposium on

Conference_Location

Gold Coast, QLD

Print_ISBN

978-1-4673-1156-4

Electronic_ISBN

978-1-4673-1155-7

Type

conf

DOI

10.1109/ISCIT.2012.6380884

Filename

6380884