Title :
A pitch synchronous feature extraction method for speaker recognition
Author :
Kim, Samuel ; Eriksson, Thomas ; Kang, Hong-Goo ; Youn, Dae Hee
Author_Institution :
Dept. of Electr. & Electron. Eng., Yonsei Univ., Seoul, South Korea
Abstract :
The paper presents a novel feature extraction method to improve the performance of speaker identification systems. The proposed feature has the form of a typical conventional feature, Mel frequency cepstral coefficients (MFCC), but a flexible segmentation to reduce spectral mismatch between training and testing processes. Specifically, the length and shift size of the analysis frame are determined by a pitch synchronous method, pitch synchronous MFCC (PSMFCC). To verify the performance of the new feature, we measure the cepstral distortion between training and testing and also perform closed set speaker identification tests. With text-independent and text-dependent experiments, the proposed algorithm provides 44.3% and 26.7% relative improvement, respectively.
Keywords :
cepstral analysis; feature extraction; learning (artificial intelligence); speaker recognition; Mel frequency cepstral coefficients; cepstral distortion; closed set speaker identification; pitch synchronous MFCC; pitch synchronous feature extraction method; speaker recognition; spectral mismatch; Cepstral analysis; Character generation; Distortion measurement; Feature extraction; Mel frequency cepstral coefficient; Signal analysis; Speaker recognition; Speech analysis; Speech recognition; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326008