DocumentCode
417118
Title
Improvement of speaker recognition by combining residual and prosodic features with acoustic features
Author
Chen, Shi-Han ; Wang, Hsiao-Chuan
Author_Institution
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
When a speech signal is encoded in some low bit-rate coding formats, it becomes more difficult to distinguish speaker identities. The paper investigates the codec effect on acoustic and prosodic features. A new representation of prosodic features based on the piecewise fitting of the pitch contour is introduced. A method for including residual features based on the LDA (linear discriminant analysis) algorithm is suggested. By combining prosodic features with acoustic features, we can improve the performance of a speaker recognition system. A series of experiments is performed with coded speech affected by G.729A and GSM codec processes to demonstrate the effectiveness of our proposed method.
Keywords
speaker recognition; speech codecs; speech coding; statistical analysis; G.729A; GSM; LDA algorithm; acoustic features; codec effect; linear discriminant analysis; piecewise fitting; pitch contour; prosodic features; residual features; speaker recognition; Acoustic distortion; Decoding; Degradation; Frequency; GSM; Speaker recognition; Speech analysis; Speech codecs; Speech coding; Speech processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1325930
Filename
1325930
Link To Document