• DocumentCode
    417118
  • Title

    Improvement of speaker recognition by combining residual and prosodic features with acoustic features

  • Author

    Chen, Shi-Han ; Wang, Hsiao-Chuan

  • Author_Institution
    Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu, Taiwan
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    When a speech signal is encoded in some low bit-rate coding formats, it becomes more difficult to distinguish speaker identities. The paper investigates the codec effect on acoustic and prosodic features. A new representation of prosodic features based on the piecewise fitting of the pitch contour is introduced. A method for including residual features based on the LDA (linear discriminant analysis) algorithm is suggested. By combining prosodic features with acoustic features, we can improve the performance of a speaker recognition system. A series of experiments is performed with coded speech affected by G.729A and GSM codec processes to demonstrate the effectiveness of our proposed method.
  • Keywords
    speaker recognition; speech codecs; speech coding; statistical analysis; G.729A; GSM; LDA algorithm; acoustic features; codec effect; linear discriminant analysis; piecewise fitting; pitch contour; prosodic features; residual features; speaker recognition; Acoustic distortion; Decoding; Degradation; Frequency; GSM; Speaker recognition; Speech analysis; Speech codecs; Speech coding; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1325930
  • Filename
    1325930