DocumentCode
2179716
Title
Structural MAP adaptation in GMM-supervector based speaker recognition
Author
Ferràs, Marc ; Shinoda, Koichi ; Furui, Sadaoki
Author_Institution
Dept. of Comput. Sci., Tokyo Inst. of Technol., Tokyo, Japan
fYear
2011
fDate
22-27 May 2011
Firstpage
5432
Lastpage
5435
Abstract
In recent years, adaptation techniques have been given special focus in speaker recognition tasks, mainly targeting speaker and session variation disentangling under the Maximum a Posteriori (MAP) criterion. For these techniques, unseen mixtures are usually adapted in a global manner, if ever. In this paper, we explore Structural MAP (SMAP), Maximum a Posteriori adaptation using hierarchical structures of the acoustic space that allow data scarceness issues to be tackled with different precision levels. We explore this approach in a speaker verification system using a Support Vector Machine (SVM) classifier and Gaussian mean supervectors (GMM-SVM). We show that this is an effective approach that considerably outperforms its relevance MAP counterpart in the 2006 NIST Speaker Recognition Evaluation. We also show that using a speaker-adapted Universal Background Model can improve the stability of the clustering algorithm besides obtaining further improvements.
Keywords
Gaussian processes; maximum likelihood estimation; speaker recognition; support vector machines; 2006 NIST speaker recognition evaluation; GMM-SVM; GMM-supervector; Gaussian mean supervectors; SMAP; clustering algorithm; maximum a posteriori; maximum a posteriori criterion; speaker recognition; speaker-adapted universal background model; structural MAP; structural MAP adaptation; support vector machine calssifier; Adaptation models; Kernel; NIST; Speaker recognition; Speech; Support vector machines; Training; GMM-SVM; MAP; SMAP; speaker recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947587
Filename
5947587
Link To Document