Title :
Cross-lingual speaker verification based on linear transform
Author :
Askar, Rozi ; Dong Wang ; Fanhu Bie ; Wang, Jun ; Zheng, Thomas Fang
Author_Institution :
Center for Speech & Language Technol., Tsinghua Univ., Beijing, China
Abstract :
Speaker verification suffers from serious performance degradation if the enrollment and test speech are in different languages. This degradation can be largely attributed to the different distributions of acoustic features in different languages. This paper proposes a linear transform approach which projects speech signals from its own language to another language so that the language mismatch between enrollment and test can be mitigated. The constrained maximum likelihood linear regression (CMLLR) is adopted to conduct the linear transform in the feature domain. The proposed approach has been evaluated on a Chinese-Uyghur cross-lingual speaker verification task. We collected a bilingual speech database CSLT-CUDGT2014 which consists of 113 female speakers who can speak both Standard Chinese and Uyghur. Based on this database and with the proposed linear transform, a relative improvement about 10% in the equal error rate (EER) was achieved.
Keywords :
feature extraction; maximum likelihood estimation; regression analysis; speaker recognition; transforms; CMLLR; CSLT-CUDGT2014 bilingual speech database; Chinese-Uyghur cross-lingual speaker verification task; EER; acoustic features distribution; constrained maximum likelihood linear regression; cross-lingual speaker verification; equal error rate; feature domain; language mismatch; linear transform approach; Acoustics; Databases; Degradation; Speaker recognition; Speech; Standards; Transforms; CM-LLR; cross-lingual; feature transform; language mismatch; speaker verification;
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
Conference_Location :
Chengdu
DOI :
10.1109/ChinaSIP.2015.7230457