DocumentCode :
417118
Title :
Improvement of speaker recognition by combining residual and prosodic features with acoustic features
Author :
Chen, Shi-Han ; Wang, Hsiao-Chuan
Author_Institution :
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
When a speech signal is encoded in some low bit-rate coding formats, it becomes more difficult to distinguish speaker identities. The paper investigates the codec effect on acoustic and prosodic features. A new representation of prosodic features based on the piecewise fitting of the pitch contour is introduced. A method for including residual features based on the LDA (linear discriminant analysis) algorithm is suggested. By combining prosodic features with acoustic features, we can improve the performance of a speaker recognition system. A series of experiments is performed with coded speech affected by G.729A and GSM codec processes to demonstrate the effectiveness of our proposed method.
Keywords :
speaker recognition; speech codecs; speech coding; statistical analysis; G.729A; GSM; LDA algorithm; acoustic features; codec effect; linear discriminant analysis; piecewise fitting; pitch contour; prosodic features; residual features; speaker recognition; Acoustic distortion; Decoding; Degradation; Frequency; GSM; Speaker recognition; Speech analysis; Speech codecs; Speech coding; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1325930
Filename :
1325930
Link To Document :
بازگشت