Title :
A robust pitch detection in noisy speech with band-pass filtering on modulation spectra
Author :
Xu, Xin ; Miyanaga, Yoshikazu
Author_Institution :
Graduate Sch. of Inf. Sci. & Technol., Hokkaido Univ., Sapporo, Japan
Abstract :
In this report, we are presenting new robust pitch detection for noisy speech. The conventional method, i.e., AUTOC, is vulnerable to the serious noise environment, especially the periodical noise. In the case of additive car noise, the detection accuracy is considerably deteriorated. A new detection method is proposed by adding a process, which implements band-pass filtering on the modulation spectra of the speech sections to AUTOC. The 2-nd power amplitude spectrum of speech in the autocorrelation computation of AUTOC is replaced by the 3-rd power amplitude spectrum. In addition, a band-limitation operation in frequency domain is carried out. It is adapted to the pitch features of human speech. An evaluation using 10 Chinese words is undertaken to compare the proposed detection method with AUTOC and a recent method based on exponentiated band-limited amplitude spectrum. The experiment is at the noise level ranged from 0 dB SNR to 10 dB SNR with white noise, colored noise and car interior noise. It is shown that the error of gross pitch error (GPE) in the proposed detection method is significantly decreased in severely noisy speech.
Keywords :
band-pass filters; filtering theory; frequency-domain analysis; signal detection; speech processing; band-limited amplitude spectrum; band-pass filtering; car interior noise; colored noise; frequency domain; gross pitch error; modulation spectra; noisy speech; power amplitude spectrum; robust pitch detection; white noise; Additive noise; Autocorrelation; Band pass filters; Colored noise; Filtering; Noise level; Noise robustness; Signal to noise ratio; Speech processing; Working environment noise;
Conference_Titel :
Communications and Information Technology, 2005. ISCIT 2005. IEEE International Symposium on
Print_ISBN :
0-7803-9538-7
DOI :
10.1109/ISCIT.2005.1566849