DocumentCode :
1749686
Title :
Learning statistically efficient features for speaker recognition
Author :
Jang, Gil-Jin ; Lee, Te-Won ; Oh, Yung-Hwan
Author_Institution :
Dept. of Comput. Sci., KAIST, Taejon, South Korea
Volume :
1
fYear :
2001
fDate :
2001
Firstpage :
437
Abstract :
We apply independent component analysis for extracting an optimal basis to the problem of finding efficient features for a speaker. The basis functions learned by the algorithm are oriented and localized in both space and frequency, bearing a resemblance to Gabor functions. The speech segments are assumed to be generated by a linear combination of the basis functions, thus the distribution of speech segments of a speaker is modeled by a basis, which is calculated so that each component should be independent upon others on the given training data. The speaker distribution is modeled by the basis functions. To assess the efficiency of the basis functions, we performed speaker classification experiments and compared our results with the conventional Fourier-basis. Our results show that the proposed method is more efficient than the conventional Fourier-based features, in that they can obtain a higher classification rate
Keywords :
Gaussian distribution; pattern classification; speaker recognition; statistical analysis; Fourier-based features; basis functions; classification rate; independent component analysis; optimal basis; speaker classification; speaker distribution; speaker recognition; speech segments; statistically efficient features; Computer science; Ear; Focusing; Frequency; Independent component analysis; Laboratories; Speaker recognition; Speech; Training data; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.940861
Filename :
940861
Link To Document :
بازگشت