مرکز منطقه ای اطلاع رساني علوم و فناوري - A text-independent speaker recognition system based on Probabilistic Principle Component Analysis

DocumentCode :

1973860

Title :

A text-independent speaker recognition system based on Probabilistic Principle Component Analysis

Author :

Xiao-chun, Lu ; Jun-xun, Yin ; Wei-ping, Hu

Author_Institution :

Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China

Volume :

fYear :

2012

fDate :

20-21 Oct. 2012

Firstpage :

255

Lastpage :

260

Abstract :

To alleviate the problem of severe degradation of speaker recognition performance because of the phoneme variability between training and testing speech data, in the text-independent speaker recognition system. The paper proposed a text-independent (TI) speaker identification method that suppresses the phonetic information by a subspace method, Probabilistic Principle Component Analysis (PPCA) is utilized to construct these subspaces. Firstly, the covariance matrix was obtained from the large training speech feature data, and then the projection matrix was obtained using the EM algorithm. In the proposed method, it is assumed that a subspace with large variance in the speech feature space is a “phoneme-dependent subspace” and a complementary subspace of it is a “phoneme-independent subspace”, the feature vectors of train/test speech data are projected to a phoneme-independent subspace and a new feature vectors are obtained. In GMM-based TI speaker identification experiments, the new feature vectors improves the identification rate by 16.25% and 2.99% respectively, compared with conventional MFCC, PCA-based MFCC. It shows that the new feature vectors of the proposed method can efficiently capture speaker-discriminative information, and suppress the other speech information.

Keywords :

Gaussian processes; covariance matrices; expectation-maximisation algorithm; principal component analysis; speaker recognition; speech synthesis; EM algorithm; GMM-based TI speaker identification; PPCA; TI speaker identification method; complementary subspace; covariance matrix; feature vectors; large training speech feature data; phoneme variability; phoneme-dependent subspace; probabilistic principle component analysis; projection matrix; speaker recognition performance; text-independent speaker recognition system; Feature extraction; Mel frequency cepstral coefficient; Principal component analysis; Probabilistic logic; Speaker recognition; Speech; Vectors; Probabilitic Principle Component Analysis; eigenvoice; speaker identification; subspace projection;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

System Science, Engineering Design and Manufacturing Informatization (ICSEM), 2012 3rd International Conference on

Conference_Location :

Chengdu

Print_ISBN :

978-1-4673-0914-1

Type :

conf

DOI :

10.1109/ICSSEM.2012.6340721

Filename :

6340721

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1973860