• DocumentCode
    1688075
  • Title

    On the relation between speech corruption models in the spectral and the cepstral domain

  • Author

    Fernandez Astudillo, Ramon ; Gerkmann, Timo

  • Author_Institution
    Spoken Language Syst. Lab., INESC-ID-Lisboa, Lisbon, Portugal
  • fYear
    2013
  • Firstpage
    7044
  • Lastpage
    7048
  • Abstract
    The Gaussian distortion model in the short-time Fourier transform (STFT) domain is the basis of many of the modern speech enhancement algorithms. One of the reasons is that additive sources and late reverberation can be analyzed and processed quite efficiently in this domain. The STFT domain is however not well related to acoustic quality and is also not well suited for learning models due to the high variability of speech in this domain. On the other hand, the cepstral domain has proved to be very well suited for these last two purposes, however, at the cost of loosing the simple linear relation between desired source and additive interferences. In this paper we explore the relation between the Gaussian distortion models in the STFT and the cepstral domain. We show how the assumption of a jointly Gaussian distortion model in the cepstrum domain is fulfilled for well-known distortion models in STFT domain. We provide closed-form solutions relating the joint distributions of corrupted and clean speech in the STFT and the cepstrum domain. We also propose various ways in which this model can be used to enhance speech.
  • Keywords
    Fourier transforms; Gaussian distribution; distortion; learning (artificial intelligence); speech enhancement; Gaussian distortion model; STFT domain; acoustic quality; cepstral domain; learning models; short-time Fourier transform domain; spectral domain; speech corruption models; speech enhancement algorithms; speech variability; Cepstrum; Joints; Signal to noise ratio; Speech; Speech enhancement; Uncertainty; Cepstrum Domain; Speech Enhancement; Uncertainty Propagation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639028
  • Filename
    6639028