DocumentCode :
3424409
Title :
A pitch extraction algorithm in noise based on temporal and spectral representations
Author :
Shahnaz, C. ; Zhu, W.-P. ; Ahmad, M.O.
Author_Institution :
Dept. of Electr. & Comput. Eng., Concordia Univ., Montreal, QC
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4477
Lastpage :
4480
Abstract :
In this paper, a new algorithm for pitch extraction from noisy speech signals based on both temporal and spectral representations is presented. We derive a harmonic sinusoidal correlation (HSC) model of clean speech as a temporal representation. Given only a noisy speech frame, a noise-robust least-squares minimization technique is proposed to acquire the parameters of the HSC model which are directly employed for the accurate estimation of a pitch-harmonic (PH). Exploiting the extracted PH and based on a spectral representation which is an enhanced spectrum in the discrete cosine transform domain, a two-fold criterion is developed in order to achieve the true consecutive number corresponding to PH that is finally adopted for pitch detection in the presence of noise. Simulation results using the Keele pitch extraction reference database manifest that combining the multi cues obtained from the temporal as well as spectral representations, the proposed algorithm is able to achieve a superior efficacy in comparison to some of the existing methods from high to very low signal-to-noise ratio (SNR) levels.
Keywords :
feature extraction; least squares approximations; minimisation; speech enhancement; speech recognition; Keele pitch extraction reference database; discrete cosine transform domain; harmonic sinusoidal correlation model; noise-robust least-squares minimization technique; pitch extraction algorithm; pitch-harmonic estimation; signal-to-noise ratio; spectral representation; temporal-spectral representations; two-fold criterion; Autocorrelation; Discrete cosine transforms; Noise robustness; Power harmonic filters; Signal processing algorithms; Signal to noise ratio; Speech analysis; Speech enhancement; Speech processing; White noise; Discrete Cosine Transform; Pitch extraction; harmonic sinusoidal correlation model; low SNR; pitch-harmonic;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518650
Filename :
4518650
Link To Document :
بازگشت