Title :
Harmonic-Temporal Clustering of Speech for Single and Multiple F0 Contour Estimation in Noisy Environments
Author :
Le Roux, Jonathan ; Kameoka, Hirokazu ; Ono, Nobutaka ; de Cheveigne, A. ; Sagayama, Shigeki
Author_Institution :
Graduate Sch. of Inf. Sci. & Technol., Tokyo Univ., Japan
Abstract :
We present in this paper a novel F0 contour estimation method based on a parametric description of the wavelet power spectrum of speech that accounts for its structure simultaneously in time and frequency directions. We model the speech spectrum as a sequence of spectral clusters governed by a smooth common F0 contour expressed as a spline curve. The harmonic and temporal structure of these clusters and their common F0 contour are estimated simultaneously. Through experimental comparisons with existing methods, we show that our algorithm is competitive on clean single-speaker speech, and that it outperforms existing methods both in the presence of noise and for the estimation of multiple F0 contours of cochannel concurrent speech.
Keywords :
pattern clustering; speech processing; wavelet transforms; clean single-speaker speech; cochannel concurrent speech; multiple F0 contour estimation; noisy environments; single F0 contour estimation; speech harmonic-temporal clustering; wavelet power spectrum; Acoustic noise; Algorithm design and analysis; Clustering algorithms; Frequency estimation; Hidden Markov models; Kernel; Speech analysis; Speech enhancement; Spline; Working environment noise; acoustic scene analysis; harmonic-temporal structured clustering (HTC); multi-pitch estimation; noisy speech; spline F0 contour;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367254