• DocumentCode
    394374
  • Title

    Multichannel speech enhancement using Bayesian spectral amplitude estimation

  • Author

    Lotter, Thomas ; Benien, Christian ; Vary, Peter

  • Author_Institution
    Inst. of Commun. Syst. & Data Process.(iNd), Aachen Univ. (RWTH), Germany
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    This paper introduces two short-time spectral amplitude estimators for speech enhancement with multiple microphones. Based on joint Gaussian models of speech and noise Fourier coefficients the clean speech amplitudes are estimated with respect to the MMSE or the MAP criterion. The estimators outperform single microphone minimum mean square amplitude estimators when the speech is highly correlated and the noise is sufficiently uncorrelated. Whereas the first MMSE estimator also requires the desired signals to be in phase, the second MAP estimator performs a direction-independent noise reduction. The estimators are generalizations of the well known single channel MMSE estimator derived by Ephraim and Malah (1984) and the MAP estimator derived by Wolfe and Godsill see (Proceedings of the 11th IEEE Workshop on Statistical Signal Processing, p.496-499, August 2001) respectively.
  • Keywords
    Bayes methods; Gaussian processes; amplitude estimation; discrete Fourier transforms; least mean squares methods; maximum likelihood estimation; noise; spectral analysis; speech enhancement; Bayesian spectral amplitude estimation; DFT; MAP criterion; MAP estimator; clean speech amplitudes; correlated speech; direction-independent noise reduction; discrete Fourier transform; joint Gaussian models; multichannel speech enhancement; multiple microphones; noise Fourier coefficients; short-time spectral amplitude estimators; single microphone MMSE amplitude estimators; speech Fourier coefficients; speech communication; uncorrelated noise; Amplitude estimation; Bayesian methods; Conferences; Gaussian noise; Microphones; Noise level; Noise reduction; Phase estimation; Signal processing; Speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198922
  • Filename
    1198922