Title :
Multi-feature combination for speaker recognition
Author :
Li, Zhi-Yi ; He, Liang ; Zhang, Wei-Qiang ; Liu, Jia
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fDate :
Nov. 29 2010-Dec. 3 2010
Abstract :
Combination of different features has been proved to be a good method for improving performance in speech recognition. In speaker recognition (SRE), various features have also been developed to reflect complementary aspects of speaker´s characteristics. This paper proposed an effective multi-feature combination in speaker recognition. In order to avoid the “dimensionality disaster” and to delimit the redundant information, linear discriminant analysis (LDA) is used to reduce the high dimensionality of combined feature to be lower. Then feature-domain channel compensation is applied to improve the performance. In experiments, we use the popular short-term spectral Mel-frequency cepstral coefficients (MFCC) and novel spectro-temporal time-frequency cepstrum (TFC) to do feature combination followed by LDA and feature-domain latent factor analysis (fLFA) for channel compensation respectively. The experimental results on the NIST SRE2008 short2 telephone-short3 telephone test set show that the proposed multi-feature combination is an effective method to outperform both raw features.
Keywords :
regression analysis; speaker recognition; time-frequency analysis; channel compensation; feature domain latent factor analysis; linear discriminant analysis; melfrequency cepstral coefficient; speaker recognition; spectrotemporal time-frequency cepstrum; speech recognition; Covariance matrix; Feature extraction; Mel frequency cepstral coefficient; Mutual information; Speaker recognition; Speech; GMM; MFCC; TFC; multi-feature combination;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
DOI :
10.1109/ISCSLP.2010.5684885