On the relation between speech corruption models in the spectral and the cepstral domain

Author

Fernandez Astudillo, Ramon ; Gerkmann, Timo

Author_Institution

Spoken Language Syst. Lab., INESC-ID-Lisboa, Lisbon, Portugal

fYear

2013

Firstpage

7044

Lastpage

7048

Abstract

The Gaussian distortion model in the short-time Fourier transform (STFT) domain is the basis of many of the modern speech enhancement algorithms. One of the reasons is that additive sources and late reverberation can be analyzed and processed quite efficiently in this domain. The STFT domain is however not well related to acoustic quality and is also not well suited for learning models due to the high variability of speech in this domain. On the other hand, the cepstral domain has proved to be very well suited for these last two purposes, however, at the cost of loosing the simple linear relation between desired source and additive interferences. In this paper we explore the relation between the Gaussian distortion models in the STFT and the cepstral domain. We show how the assumption of a jointly Gaussian distortion model in the cepstrum domain is fulfilled for well-known distortion models in STFT domain. We provide closed-form solutions relating the joint distributions of corrupted and clean speech in the STFT and the cepstrum domain. We also propose various ways in which this model can be used to enhance speech.

Keywords

Fourier transforms; Gaussian distribution; distortion; learning (artificial intelligence); speech enhancement; Gaussian distortion model; STFT domain; acoustic quality; cepstral domain; learning models; short-time Fourier transform domain; spectral domain; speech corruption models; speech enhancement algorithms; speech variability; Cepstrum; Joints; Signal to noise ratio; Speech; Speech enhancement; Uncertainty; Cepstrum Domain; Speech Enhancement; Uncertainty Propagation;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Conference_Location

Vancouver, BC

ISSN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2013.6639028

Filename

6639028