Title :
Dual-microphone based binary mask estimation for robust speaker verification
Author :
Zhao, Yali ; Fu, Zhong-Hua ; Xie, Lei ; Zhang, Jian ; Zhang, Yanning
Author_Institution :
Shaanxi Provincial Key Lab. of Speech & Image Inf. Process., Northwestern Polytech. Univ., Xian, China
Abstract :
Missing feature theory (MFT) has shown great potential for robust speaker recognition in noisy environments. Accurate estimation of binary mask is crucial in MFT-based speaker recognition. This paper addresses the speaker verification problem using MFT in a practical scenario: the location of target speaker is fixed while the locations of noise interferences are unknown. Specifically, we propose a dual-microphone semi-blind approach to estimate the binary mask. During system initialization, a spatial location model for the target is trained precisely. Then a spatial model for corrupted speech is obtained on-line by model adaptation. Finally, the binary mask is estimated by likelihood comparison. Moreover, we propose a reliable frame selection method to further focus on the reliable speech frames for missing data speaker recognition. Experimental results demonstrate that our proposed approach achieves substantial improvements in recognition performance in both white noise and speech corrupted conditions.
Keywords :
speaker recognition; MFT-based speaker recognition; dual-microphone based binary mask estimation; dual-microphone semi-blind approach; missing feature theory; noisy environments; reliable frame selection method; robust speaker verification; speaker verification problem; system initialization; Estimation; Noise; Robustness; Speaker recognition; Speech; Speech recognition;
Conference_Titel :
Audio, Language and Image Processing (ICALIP), 2012 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0173-2
DOI :
10.1109/ICALIP.2012.6376764