مرکز منطقه ای اطلاع رساني علوم و فناوري - Modified MFCCs for robust speaker recognition

DocumentCode :

3447248

Title :

Modified MFCCs for robust speaker recognition

Author :

Hong, Wang ; Jin´Gui, Pan

Author_Institution :

State Key Lab. for Novel Software Technol., Nanjing Univ., Nanjing, China

Volume :

fYear :

2010

fDate :

29-31 Oct. 2010

Firstpage :

276

Lastpage :

279

Abstract :

Mel-scale frequency cepstrum coefficients (MFCCs) are commonly used featues in speaker recognition systems, but MFCC values are not very robust in the presence of noise. thus, the modified MFCCs (named as SMN-CMN-MFCC) based on the general noisy speech model is proposed in this paper, which uses spectrum mean normalization (SMN) to suppress the additive noise, and uses cepstral mean normalization (CMN) to remove the effect of convolutional noise. Theoretical analyses show that the combination of SMN and CMN can inhibit additive and convolutional noise at the same time. To verify the performance of the SMN-CMN-MFCC, we have conducted some speaker recognition tests. With the same convolutional noise component, the additive white noise experiments and the additive factory noise experiments show that SMN-CMN-MFCC provides 10.5% and 9.6% relative improvement than the conventional MFCC and ΔMFCC features, respectively.

Keywords :

cepstral analysis; speaker recognition; SMN-CMN-MFCC; additive factory noise; additive noise suppression; cepstral mean normalization; convolutional noise removal; modified Mel-scale frequency cepstrum coefficients; noisy speech model; robust speaker recognition system; spectrum mean normalization; Convolution; Mel frequency cepstral coefficient; Robustness; Signal to noise ratio; Mel-scale Frequency Cepstral Coefficients; feature extraction; speaker recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Computing and Intelligent Systems (ICIS), 2010 IEEE International Conference on

Conference_Location :

Xiamen

Print_ISBN :

978-1-4244-6582-8

Type :

conf

DOI :

10.1109/ICICISYS.2010.5658679

Filename :

5658679

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3447248