Title :
Pitch maxima for robust speaker recognition
Author :
Krishnakumar, S. ; Kumar, K. R Prasanna ; Balakrishnan, N.
Author_Institution :
Indian Inst. of Sci., Bangalore, India
Abstract :
This paper presents a novel approach to the design of a robust speaker recognition system. A noise-free synthesised spectrum is produced from a noisy spectrum. This synthesised spectrum is used for feature extraction. From noisy speech, the pitch is extracted using a robust pitch estimation algorithm. This also helps in identifying the voiced segments of speech which are the only ones considered in the synthesis. After estimating pitch, the noisy signal is sampled in the frequency domain at pitch harmonics. From the sampled data, a reconstruction procedure is suggested in this paper in order to generate a noise-free synthesised spectrum which retains the characteristics of the speaker but rejects the noisy contributions. We compare results with the original MFCC parameters and show that on a 100 speaker database, the MFCC parameters computed on the reconstructed spectrum consistently outperforms conventional MFCC parameters over a full range of noise levels under mismatched conditions, while maintaining comparable performance under matched conditions.
Keywords :
feature extraction; frequency estimation; signal reconstruction; signal sampling; speaker recognition; spectral analysis; speech synthesis; MFCC parameters; feature extraction; noise-free synthesised spectrum; noisy signal sampling; performance; pitch extraction; pitch maxima; reconstruction procedure; robust pitch estimation algorithm; robust speaker recognition; speech synthesis; voiced segments; Data mining; Feature extraction; Frequency domain analysis; Frequency estimation; Mel frequency cepstral coefficient; Noise generators; Noise robustness; Signal synthesis; Speaker recognition; Speech synthesis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1202329