• DocumentCode
    712897
  • Title

    Speaker weight estimation from speech signals using a fusion of the i-vector and NFA frameworks

  • Author

    Poorjam, Amir Hossein ; Bahari, Mohamad Hasan ; Van hamme, Hugo

  • Author_Institution
    Center for Process. Speech &Images, KU Leuven, Leuven, Belgium
  • fYear
    2015
  • fDate
    3-5 March 2015
  • Firstpage
    118
  • Lastpage
    123
  • Abstract
    In this paper, a novel approach for automatic speaker weight estimation from spontaneous telephone speech signals is proposed. In this method, each utterance is modeled using the i-vector framework which is based on the factor analysis on Gaussian Mixture Model (GMM) mean supervectors, and the Non-negative Factor Analysis (NFA) framework which is based on a constrained factor analysis on GMM weights. Then, the available information in both Gaussian means and Gaussian weights is exploited through a feature-level fusion of the i-vectors and the NFA vectors. Finally, a least-squares support vector regression (LS-SVR) is employed to estimate the weight of speakers from given utterances. The proposed approach is evaluated on the telephone speech signals of National Institute of Standards and Technology (NIST) 2008 and 2010 Speaker Recognition Evaluation (SRE) corpora. Experimental results over 2339 utterances show that the correlation coefficients between actual and estimated weights of male and female speakers are 0.56 and 0.49, respectively, which indicate the effectiveness of the proposed method in speaker weight estimation.
  • Keywords
    Gaussian processes; mixture models; regression analysis; sensor fusion; speech processing; support vector machines; GMM mean supervectors; Gaussian means; Gaussian mixture model mean supervectors; Gaussian weights; LS-SVR; NFA frameworks; constrained factor analysis; feature-level fusion; i-vector fusion; least-squares support vector regression; nonnegative factor analysis framework; speaker weight estimation; spontaneous telephone speech signals; Correlation; Estimation; Kernel; Speech; Support vector machines; Testing; Training; Least-Squares Support Vector Regression; Non-negative Factor Analysis; Speaker Weight Estimation; i-vector;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence and Signal Processing (AISP), 2015 International Symposium on
  • Conference_Location
    Mashhad
  • Print_ISBN
    978-1-4799-8817-4
  • Type

    conf

  • DOI
    10.1109/AISP.2015.7123494
  • Filename
    7123494