• DocumentCode
    1973860
  • Title

    A text-independent speaker recognition system based on Probabilistic Principle Component Analysis

  • Author

    Xiao-chun, Lu ; Jun-xun, Yin ; Wei-ping, Hu

  • Author_Institution
    Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
  • Volume
    1
  • fYear
    2012
  • fDate
    20-21 Oct. 2012
  • Firstpage
    255
  • Lastpage
    260
  • Abstract
    To alleviate the problem of severe degradation of speaker recognition performance because of the phoneme variability between training and testing speech data, in the text-independent speaker recognition system. The paper proposed a text-independent (TI) speaker identification method that suppresses the phonetic information by a subspace method, Probabilistic Principle Component Analysis (PPCA) is utilized to construct these subspaces. Firstly, the covariance matrix was obtained from the large training speech feature data, and then the projection matrix was obtained using the EM algorithm. In the proposed method, it is assumed that a subspace with large variance in the speech feature space is a “phoneme-dependent subspace” and a complementary subspace of it is a “phoneme-independent subspace”, the feature vectors of train/test speech data are projected to a phoneme-independent subspace and a new feature vectors are obtained. In GMM-based TI speaker identification experiments, the new feature vectors improves the identification rate by 16.25% and 2.99% respectively, compared with conventional MFCC, PCA-based MFCC. It shows that the new feature vectors of the proposed method can efficiently capture speaker-discriminative information, and suppress the other speech information.
  • Keywords
    Gaussian processes; covariance matrices; expectation-maximisation algorithm; principal component analysis; speaker recognition; speech synthesis; EM algorithm; GMM-based TI speaker identification; PPCA; TI speaker identification method; complementary subspace; covariance matrix; feature vectors; large training speech feature data; phoneme variability; phoneme-dependent subspace; probabilistic principle component analysis; projection matrix; speaker recognition performance; text-independent speaker recognition system; Feature extraction; Mel frequency cepstral coefficient; Principal component analysis; Probabilistic logic; Speaker recognition; Speech; Vectors; Probabilitic Principle Component Analysis; eigenvoice; speaker identification; subspace projection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Science, Engineering Design and Manufacturing Informatization (ICSEM), 2012 3rd International Conference on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4673-0914-1
  • Type

    conf

  • DOI
    10.1109/ICSSEM.2012.6340721
  • Filename
    6340721