Title :
A text-independent speaker recognition system based on Probabilistic Principle Component Analysis
Author :
Xiao-chun, Lu ; Jun-xun, Yin ; Wei-ping, Hu
Author_Institution :
Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
Abstract :
To alleviate the problem of severe degradation of speaker recognition performance because of the phoneme variability between training and testing speech data, in the text-independent speaker recognition system. The paper proposed a text-independent (TI) speaker identification method that suppresses the phonetic information by a subspace method, Probabilistic Principle Component Analysis (PPCA) is utilized to construct these subspaces. Firstly, the covariance matrix was obtained from the large training speech feature data, and then the projection matrix was obtained using the EM algorithm. In the proposed method, it is assumed that a subspace with large variance in the speech feature space is a “phoneme-dependent subspace” and a complementary subspace of it is a “phoneme-independent subspace”, the feature vectors of train/test speech data are projected to a phoneme-independent subspace and a new feature vectors are obtained. In GMM-based TI speaker identification experiments, the new feature vectors improves the identification rate by 16.25% and 2.99% respectively, compared with conventional MFCC, PCA-based MFCC. It shows that the new feature vectors of the proposed method can efficiently capture speaker-discriminative information, and suppress the other speech information.
Keywords :
Gaussian processes; covariance matrices; expectation-maximisation algorithm; principal component analysis; speaker recognition; speech synthesis; EM algorithm; GMM-based TI speaker identification; PPCA; TI speaker identification method; complementary subspace; covariance matrix; feature vectors; large training speech feature data; phoneme variability; phoneme-dependent subspace; probabilistic principle component analysis; projection matrix; speaker recognition performance; text-independent speaker recognition system; Feature extraction; Mel frequency cepstral coefficient; Principal component analysis; Probabilistic logic; Speaker recognition; Speech; Vectors; Probabilitic Principle Component Analysis; eigenvoice; speaker identification; subspace projection;
Conference_Titel :
System Science, Engineering Design and Manufacturing Informatization (ICSEM), 2012 3rd International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4673-0914-1
DOI :
10.1109/ICSSEM.2012.6340721