DocumentCode
2774232
Title
Improving Speaker Identification Rate Using Fractals
Author
Nelwamondo, Fulufhelo V. ; Mahola, Unathi ; Marwala, Tshilidzi
Author_Institution
Univ. of the Witwatersrand, Johannesburg
fYear
0
fDate
0-0 0
Firstpage
3231
Lastpage
3236
Abstract
This paper reports on a text-dependent speaker identification system that combines Mel-frequency cepstral coefficients with non-linear turbulence information extracted using multi-scale fractal dimension (MFD). The MFD is estimated using Box-Counting and Minkowiski-Bouligand dimension. The proposed framework is implemented in conjunction with sub-band based speaker identification system. Results show that the proposed framework with Box-Counting feature extraction improves the performance of the classical wideband approach by up to 10% identification rate. It is further observed that the proposed framework gives the improved Bhattacharyya distance between impostors and speakers´ speech distributions.
Keywords
feature extraction; fractals; speaker recognition; box-counting feature extraction; mel-frequency cepstral coefficients; multiscale fractal dimension; nonlinear turbulence information; speaker identification rate; speaker speech distributions; text-dependent speaker identification system; Africa; Data mining; Feature extraction; Fractals; Hidden Markov models; Loudspeakers; Speaker recognition; Speech; Testing; Wideband;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2006. IJCNN '06. International Joint Conference on
Conference_Location
Vancouver, BC
Print_ISBN
0-7803-9490-9
Type
conf
DOI
10.1109/IJCNN.2006.247309
Filename
1716538
Link To Document