DocumentCode
1973860
Title
A text-independent speaker recognition system based on Probabilistic Principle Component Analysis
Author
Xiao-chun, Lu ; Jun-xun, Yin ; Wei-ping, Hu
Author_Institution
Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
Volume
1
fYear
2012
fDate
20-21 Oct. 2012
Firstpage
255
Lastpage
260
Abstract
To alleviate the problem of severe degradation of speaker recognition performance because of the phoneme variability between training and testing speech data, in the text-independent speaker recognition system. The paper proposed a text-independent (TI) speaker identification method that suppresses the phonetic information by a subspace method, Probabilistic Principle Component Analysis (PPCA) is utilized to construct these subspaces. Firstly, the covariance matrix was obtained from the large training speech feature data, and then the projection matrix was obtained using the EM algorithm. In the proposed method, it is assumed that a subspace with large variance in the speech feature space is a “phoneme-dependent subspace” and a complementary subspace of it is a “phoneme-independent subspace”, the feature vectors of train/test speech data are projected to a phoneme-independent subspace and a new feature vectors are obtained. In GMM-based TI speaker identification experiments, the new feature vectors improves the identification rate by 16.25% and 2.99% respectively, compared with conventional MFCC, PCA-based MFCC. It shows that the new feature vectors of the proposed method can efficiently capture speaker-discriminative information, and suppress the other speech information.
Keywords
Gaussian processes; covariance matrices; expectation-maximisation algorithm; principal component analysis; speaker recognition; speech synthesis; EM algorithm; GMM-based TI speaker identification; PPCA; TI speaker identification method; complementary subspace; covariance matrix; feature vectors; large training speech feature data; phoneme variability; phoneme-dependent subspace; probabilistic principle component analysis; projection matrix; speaker recognition performance; text-independent speaker recognition system; Feature extraction; Mel frequency cepstral coefficient; Principal component analysis; Probabilistic logic; Speaker recognition; Speech; Vectors; Probabilitic Principle Component Analysis; eigenvoice; speaker identification; subspace projection;
fLanguage
English
Publisher
ieee
Conference_Titel
System Science, Engineering Design and Manufacturing Informatization (ICSEM), 2012 3rd International Conference on
Conference_Location
Chengdu
Print_ISBN
978-1-4673-0914-1
Type
conf
DOI
10.1109/ICSSEM.2012.6340721
Filename
6340721
Link To Document