مرکز منطقه ای اطلاع رساني علوم و فناوري - Multi-feature combination for speaker recognition

DocumentCode :

2017786

Title :

Multi-feature combination for speaker recognition

Author :

Li, Zhi-Yi ; He, Liang ; Zhang, Wei-Qiang ; Liu, Jia

Author_Institution :

Dept. of Electron. Eng., Tsinghua Univ., Beijing, China

fYear :

2010

fDate :

Nov. 29 2010-Dec. 3 2010

Firstpage :

318

Lastpage :

321

Abstract :

Combination of different features has been proved to be a good method for improving performance in speech recognition. In speaker recognition (SRE), various features have also been developed to reflect complementary aspects of speaker´s characteristics. This paper proposed an effective multi-feature combination in speaker recognition. In order to avoid the “dimensionality disaster” and to delimit the redundant information, linear discriminant analysis (LDA) is used to reduce the high dimensionality of combined feature to be lower. Then feature-domain channel compensation is applied to improve the performance. In experiments, we use the popular short-term spectral Mel-frequency cepstral coefficients (MFCC) and novel spectro-temporal time-frequency cepstrum (TFC) to do feature combination followed by LDA and feature-domain latent factor analysis (fLFA) for channel compensation respectively. The experimental results on the NIST SRE2008 short2 telephone-short3 telephone test set show that the proposed multi-feature combination is an effective method to outperform both raw features.

Keywords :

regression analysis; speaker recognition; time-frequency analysis; channel compensation; feature domain latent factor analysis; linear discriminant analysis; melfrequency cepstral coefficient; speaker recognition; spectrotemporal time-frequency cepstrum; speech recognition; Covariance matrix; Feature extraction; Mel frequency cepstral coefficient; Mutual information; Speaker recognition; Speech; GMM; MFCC; TFC; multi-feature combination;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on

Conference_Location :

Tainan

Print_ISBN :

978-1-4244-6244-5

Type :

conf

DOI :

10.1109/ISCSLP.2010.5684885

Filename :

5684885

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2017786