• DocumentCode
    763553
  • Title

    Codebook driven short-term predictor parameter estimation for speech enhancement

  • Author

    Srinivasan, Sriram ; Samuelsson, Jonas ; Kleijn, W. Bastiaan

  • Author_Institution
    Dept. of Signals, KTH R. Inst. of Technol., Stockholm, Sweden
  • Volume
    14
  • Issue
    1
  • fYear
    2006
  • Firstpage
    163
  • Lastpage
    176
  • Abstract
    In this paper, we present a new technique for the estimation of short-term linear predictive parameters of speech and noise from noisy data and their subsequent use in waveform enhancement schemes. The method exploits a priori information about speech and noise spectral shapes stored in trained codebooks, parameterized as linear predictive coefficients. The method also uses information about noise statistics estimated from the noisy observation. Maximum-likelihood estimates of the speech and noise short-term predictor parameters are obtained by searching for the combination of codebook entries that optimizes the likelihood. The estimation involves the computation of the excitation variances of the speech and noise auto-regressive models on a frame-by-frame basis, using the a priori information and the noisy observation. The high computational complexity resulting from a full search of the joint speech and noise codebooks is avoided through an iterative optimization procedure. We introduce a classified noise codebook scheme that uses different noise codebooks for different noise types. Experimental results show that the use of a priori information and the calculation of the instantaneous speech and noise excitation variances on a frame-by-frame basis result in good performance in both stationary and nonstationary noise conditions.
  • Keywords
    autoregressive processes; iterative methods; linear predictive coding; maximum likelihood estimation; speech coding; speech enhancement; classified noise codebook scheme; codebook driven short-term predictor parameter estimation; computational complexity; frame-by-frame basis; iterative optimization procedure; maximum-likelihood estimates; noise autoregressive models; noise spectral shapes; noise statistics; noisy data; short-term linear predictive parameters; speech enhancement; speech spectral shapes; waveform enhancement schemes; Acoustic noise; Additive noise; Microphones; Mobile communication; Noise shaping; Parameter estimation; Predictive models; Speech enhancement; Statistics; Working environment noise; Autoregressive models; codebooks; maximum-likelihood; nonstationary noise; short-term predictor; speech enhancement;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TSA.2005.854113
  • Filename
    1561274