DocumentCode
3489064
Title
Pitch maxima for robust speaker recognition
Author
Krishnakumar, S. ; Kumar, K. R Prasanna ; Balakrishnan, N.
Author_Institution
Indian Inst. of Sci., Bangalore, India
Volume
2
fYear
2003
fDate
6-10 April 2003
Abstract
This paper presents a novel approach to the design of a robust speaker recognition system. A noise-free synthesised spectrum is produced from a noisy spectrum. This synthesised spectrum is used for feature extraction. From noisy speech, the pitch is extracted using a robust pitch estimation algorithm. This also helps in identifying the voiced segments of speech which are the only ones considered in the synthesis. After estimating pitch, the noisy signal is sampled in the frequency domain at pitch harmonics. From the sampled data, a reconstruction procedure is suggested in this paper in order to generate a noise-free synthesised spectrum which retains the characteristics of the speaker but rejects the noisy contributions. We compare results with the original MFCC parameters and show that on a 100 speaker database, the MFCC parameters computed on the reconstructed spectrum consistently outperforms conventional MFCC parameters over a full range of noise levels under mismatched conditions, while maintaining comparable performance under matched conditions.
Keywords
feature extraction; frequency estimation; signal reconstruction; signal sampling; speaker recognition; spectral analysis; speech synthesis; MFCC parameters; feature extraction; noise-free synthesised spectrum; noisy signal sampling; performance; pitch extraction; pitch maxima; reconstruction procedure; robust pitch estimation algorithm; robust speaker recognition; speech synthesis; voiced segments; Data mining; Feature extraction; Frequency domain analysis; Frequency estimation; Mel frequency cepstral coefficient; Noise generators; Noise robustness; Signal synthesis; Speaker recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1202329
Filename
1202329
Link To Document