• DocumentCode
    812514
  • Title

    A Speech Enhancement Algorithm Based on a Chi MRF Model of the Speech STFT Amplitudes

  • Author

    Andrianakis, Yiannis ; White, Paul R.

  • Author_Institution
    Nat. Oceanogr. Center, Southampton, UK
  • Volume
    17
  • Issue
    8
  • fYear
    2009
  • Firstpage
    1508
  • Lastpage
    1517
  • Abstract
    A speech enhancement algorithm that takes advantage of the time and frequency dependencies of speech signals is presented in this paper. The above dependencies are incorporated in the statistical model using concepts from the theory of Markov Random Fields. In particular, the speech short-time Fourier transform (STFT) amplitude samples are modeled with a novel Chi Markov Random Field prior, which is then used for the development of an estimator based on the Iterated Conditional Modes method. The novel prior is also coupled with a dasiaharmonicpsila neighborhood, which apart from the immediately adjacent samples on the time frequency plane, also considers samples which are one pitch frequency apart, so as to take advantage of the rich structure of the voiced speech time frames. Additionally, central to the development of the algorithm is the adaptive estimation of the weights that determine the interaction between neighboring samples, which allows the restoration of weak speech spectral components, while maintaining a low level of uniform residual noise. Results that illustrate the improvements achieved with the proposed algorithm, and a comparison with other established speech enhancement schemes are also given.
  • Keywords
    Fourier transforms; Markov processes; iterative methods; speech enhancement; Chi Markov random field; Iterated Conditional Modes method; adaptive estimation; harmonic neighborhood; speech enhancement algorithm; speech restoration; speech short-time Fourier transform amplitude samples; speech spectral components; statistical model; Chi; Gaussian; Markov random fields; short-time Fourier transform (STFT) estimation; speech enhancement;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2009.2022199
  • Filename
    4909049