Title :
Noise robust speaker identification by dividing MFCC
Author :
Matsumoto, Kaname ; Hayasaka, Noboru ; Iiguni, Youji
Author_Institution :
Grad. Sch. of Eng. Sci., Osaka Univ., Toyonaka, Japan
Abstract :
Until now, systems using speaker identification have not been widely used. The main reason is because the identification accuracy is low. Therefore, in this paper, we report the results of modulation frequency analysis and propose a novel method using effectively modulation frequency components of vocal tract characteristics. We investigated effective components of interframe variation of MFCC. The method that we propose extracts those components and uses them effectively by dividing MFCC. The bandwidth of the division, from 0 to 7.80 Hz and from 7.80 to 25.0 Hz, was determined by the view of the modulation frequency analysis and human speech perception. We divided MFCC into two parts, low-MFCC and middle-MFCC, by the bandwidth. Additionally, we made an identification experiments on those MFCCs. Using those MFCCs improved the identification accuracy by 2.1% on average. However, the effect of the proposed method fluctuated by uttered text.
Keywords :
speaker recognition; MFCC interframe variation; human speech perception; identification accuracy; low-MFCC; middle-MFCC; modulation frequency analysis; modulation frequency component; noise robust speaker identification; uttered text; vocal tract characteristics; Accuracy; Frequency modulation; Hidden Markov models; Mel frequency cepstral coefficient; Speaker recognition; Speech;
Conference_Titel :
Communications, Control and Signal Processing (ISCCSP), 2014 6th International Symposium on
Conference_Location :
Athens
DOI :
10.1109/ISCCSP.2014.6877959