• DocumentCode
    1759348
  • Title

    A Symmetric Kernel Partial Least Squares Framework for Speaker Recognition

  • Author

    Srinivasan, Balaji V. ; Yuancheng Luo ; Garcia-Romero, Daniel ; Zotkin, Dmitry N. ; Duraiswami, Ramani

  • Author_Institution
    Adobe Res. Bangalore Labs., Bangalore, India
  • Volume
    21
  • Issue
    7
  • fYear
    2013
  • fDate
    41456
  • Firstpage
    1415
  • Lastpage
    1423
  • Abstract
    I-vectors are concise representations of speaker characteristics. Recent progress in i-vectors related research has utilized their ability to capture speaker and channel variability to develop efficient automatic speaker verification (ASV) systems. Inter-speaker relationships in the i-vector space are non-linear. Accomplishing effective speaker verification requires a good modeling of these non-linearities and can be cast as a machine learning problem. Kernel partial least squares (KPLS) can be used for discriminative training in the i-vector space. However, this framework suffers from training data imbalance and asymmetric scoring. We use “one shot similarity scoring” (OSS) to address this. The resulting ASV system (OSS-KPLS) is tested across several conditions of the NIST SRE 2010 extended core data set and compared against state-of-the-art systems: Joint Factor Analysis (JFA), Probabilistic Linear Discriminant Analysis (PLDA), and Cosine Distance Scoring (CDS) classifiers. Improvements are shown.
  • Keywords
    learning (artificial intelligence); least squares approximations; speaker recognition; ASV system; CDS; JFA; NIST SRE 2010; OSS-KPLS; PLDA; automatic speaker verification system; cosine distance scoring classifiers; i-vectors; joint factor analysis; machine learning problem; probabilistic linear discriminant analysis; speaker recognition; symmetric Kernel partial least squares framework; Adaptation models; Joints; Kernel; Linear discriminant analysis; Speech; Training; Vectors; One-shot similarity; discriminative classifier; kernel PLS; speaker recognition; speaker verification;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2253096
  • Filename
    6480796