DocumentCode :
2697034
Title :
Compensating for Mismatch in High-Level Speaker Recognition
Author :
Campbell, W.M.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA
fYear :
2006
fDate :
28-30 June 2006
Firstpage :
1
Lastpage :
6
Abstract :
Speaker recognition using high-level features has been a successful area of exploration. Features obtained from many different levels-phones, words, prosodic events, etc.-are used to characterize the speaker. A good modeling technique for these features is the support vector machine (SVM). SVMs model the n-gram frequencies from speaker utterances in a high-dimensional SVM feature space and have shown excellent performance over a wide variety of high-level features. A complimentary method of recent exploration in SVM speaker recognition is the use of nuisance attributes projection (NAP). NAP removes directions from SVM feature space that are superfluous to the task of speaker recognition-channel information, session variability, etc. In this paper, we consider the application of NAP to high-level speaker recognition. We describe the difficulties in applying this method and propose solutions. We also conduct experiments showing that NAP can reduce variability in SVM feature space leading to improved performance
Keywords :
speaker recognition; support vector machines; NAP; SVM feature space; high-level speaker recognition; n-gram frequency; nuisance attributes projection; support vector machine; Binary trees; Cepstral analysis; Frequency; Kernel; Laboratories; NIST; Robustness; Speaker recognition; Support vector machine classification; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
Conference_Location :
San Juan
Print_ISBN :
1-424400471-1
Electronic_ISBN :
1-4244-0472-X
Type :
conf
DOI :
10.1109/ODYSSEY.2006.248110
Filename :
4013527
Link To Document :
بازگشت