DocumentCode :
3428360
Title :
Normalization of total variability matrix for i-vector/PLDA speaker verification
Author :
Wei Rao ; Man-Wai Mak ; Kong-Aik Lee
Author_Institution :
Dept. of Electron. & Inf. Eng., Hong Kong Polytech. Univ., Hong Kong, China
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
4180
Lastpage :
4184
Abstract :
Gaussian PLDA with uncertainty propagation is effective for i-vector based speaker verification. The idea is to propagate the uncertainty of i-vectors caused by the duration variability of utterances to the PLDA model. However, a limitation of the method is the difficulty of performing length normalization on the posterior covariance matrix of an i-vector. This paper proposes a method to avoid performing length normalization on i-vectors in Gaussian PLDA modeling so that uncertainty propagation can be directly applied without transforming the posterior covariance matrices of i-vectors. Instead of performing length normalization on i-vectors independently, the proposed method normalizes the column vectors of the total variability matrix. Because the i-vectors of all utterances are derived from the same normalized total variability matrix, they will be subject to the same degree of normalization, thereby avoiding the undesirable distortion introduced by the utterance-dependent length-normalization process. Experimental results on both NIST 2010 and 2012 SREs demonstrate that the proposed method achieves a performance similar to (and in some situations better than) that of Gaussian PLDA with length normalization. The method has the potential of improving the performance of uncertainty propagation for i-vector/PLDA speaker verification.
Keywords :
Gaussian distribution; covariance matrices; speaker recognition; vectors; Gaussian PLDA modeling; column vectors; duration variability; i-vector-PLDA speaker verification; normalized total variability matrix; posterior covariance matrix; probabilistic linear discriminant analysis; uncertainty propagation; utterance-dependent length-normalization process; Distortion; Logic gates; NIST; Uncertainty; Total variability matrix; i-vectors; probabilistic linear discriminant analysis; speaker verification; uncertainty propagation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178758
Filename :
7178758
Link To Document :
بازگشت