Title :
Robust feature front-end for speaker identification
Author :
Liu, Gang ; Lei, Yun ; Hansen, John H L
Author_Institution :
CRSS: Center for Robust Speech Syst., Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
One important challenge for speaker identification (SID) system is sustained performance in diverse conditions. This study presents a novel front-end feature extraction method for SID in clean, noisy, and channel-mismatched acoustic conditions. To address the problem, the perceptual minimum variance distortionless response (PMVDR) feature is employed. While PMVDR has been successfully used for noisy ASR, it has not been considered for SID. We also incorporate longer temporal speaker knowledge based on the shifted delta cepstral (SDC) algorithm. The evaluation over YOHO and another new diversified Robust Open-Set Speaker Identification (ROSSI) database show that both PMVDR and the union with SDC can improve performance significantly. Compared with traditional feature extraction, PMVDR and PMVDR-SDC always give improvement across diverse adverse conditions. Also, PMVDR-SDC can contribute additional improvement in the presence of noise and channel mismatch.
Keywords :
acoustic signal processing; cepstral analysis; feature extraction; noise; speaker recognition; YOHO; automatic speech recognition; channel mismatch; channel-mismatched acoustic condition; diversified robust open-set speaker identification database; front-end feature extraction method; noise; perceptual minimum variance distortionless response feature; robust feature front-end; shifted delta cepstral algorithm; speaker identification system; temporal speaker knowledge; Databases; Feature extraction; Mel frequency cepstral coefficient; Noise; Noise measurement; Robustness; Speech; PMVDR; SDC; noise; robustness; speaker identification;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288853