• DocumentCode
    940019
  • Title

    Combination of autocorrelation-based features and projection measure technique for speaker identification

  • Author

    Yuo, Kuo-Hwei ; Hwang, Tai-Hwei ; Wang, Hsiao-Chuan

  • Author_Institution
    Chung-Shan Inst. of Sci. & Technol., Tao-Yuan, Taiwan
  • Volume
    13
  • Issue
    4
  • fYear
    2005
  • fDate
    7/1/2005 12:00:00 AM
  • Firstpage
    565
  • Lastpage
    574
  • Abstract
    This paper presents a robust approach for speaker identification when the speech signal is corrupted by additive noise and channel distortion. Robust features are derived by assuming that the corrupting noise is stationary and the channel effect is fixed during an utterance. A two-step temporal filtering procedure on the autocorrelation sequence is proposed to minimize the effect of additive and convolutional noises. The first step applies a temporal filtering procedure in autocorrelation domain to remove the additive noise, and the second step is to perform the mean subtraction on the filtered autocorrelation sequence in logarithmic spectrum domain to remove the channel effect. No prior knowledge of noise characteristic is necessary. The additive noise can be a colored noise. Then the proposed robust feature is combined with the projection measure technique to gain further improvement in recognition accuracy. Experimental results show that the proposed method can significantly improve the performance of speaker identification task in noisy environment.
  • Keywords
    correlation methods; filtering theory; speaker recognition; additive noise; autocorrelation sequence; channel distortion; convolutional noises; logarithmic spectrum domain; projection measure technique; speaker identification; two-step temporal filtering; Additive noise; Autocorrelation; Colored noise; Convolution; Distortion measurement; Filtering; Noise robustness; Signal processing; Speech enhancement; Working environment noise; Channel-normalization; projection measure; relative autocorrelation sequence; speaker identification;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2005.848893
  • Filename
    1453599