• DocumentCode
    2179716
  • Title

    Structural MAP adaptation in GMM-supervector based speaker recognition

  • Author

    Ferràs, Marc ; Shinoda, Koichi ; Furui, Sadaoki

  • Author_Institution
    Dept. of Comput. Sci., Tokyo Inst. of Technol., Tokyo, Japan
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5432
  • Lastpage
    5435
  • Abstract
    In recent years, adaptation techniques have been given special focus in speaker recognition tasks, mainly targeting speaker and session variation disentangling under the Maximum a Posteriori (MAP) criterion. For these techniques, unseen mixtures are usually adapted in a global manner, if ever. In this paper, we explore Structural MAP (SMAP), Maximum a Posteriori adaptation using hierarchical structures of the acoustic space that allow data scarceness issues to be tackled with different precision levels. We explore this approach in a speaker verification system using a Support Vector Machine (SVM) classifier and Gaussian mean supervectors (GMM-SVM). We show that this is an effective approach that considerably outperforms its relevance MAP counterpart in the 2006 NIST Speaker Recognition Evaluation. We also show that using a speaker-adapted Universal Background Model can improve the stability of the clustering algorithm besides obtaining further improvements.
  • Keywords
    Gaussian processes; maximum likelihood estimation; speaker recognition; support vector machines; 2006 NIST speaker recognition evaluation; GMM-SVM; GMM-supervector; Gaussian mean supervectors; SMAP; clustering algorithm; maximum a posteriori; maximum a posteriori criterion; speaker recognition; speaker-adapted universal background model; structural MAP; structural MAP adaptation; support vector machine calssifier; Adaptation models; Kernel; NIST; Speaker recognition; Speech; Support vector machines; Training; GMM-SVM; MAP; SMAP; speaker recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947587
  • Filename
    5947587