Title :
Detection of Dynamic Structures of Speech Fundamental Frequency in Tonal Languages
Author :
Hong, Hong ; Zhao, Zhengmin ; Wang, Xinlong ; Tao, Zhiyong
Author_Institution :
Key Lab. of Modern Acoust., Nanjing Univ., Nanjing, China
Abstract :
An approach is proposed specially for capturing fine dynamic structures of speech fundamental frequency F0 that may vary in such a nonmonotonic way as those of the third tones in Chinese speech. It first estimates the rough trend of variation of a F0 contour by means of the cepstrum technique, and then, utilizes the trend as a reference to track the variation and calculates the detailed contour from a few of intrinsic mode functions that are decomposed by the ensemble empirical mode decomposition. Intensive evaluation and direct comparisons with existing methods are conducted with the standard Chinese Mandarin database, showing the effectiveness of the proposed method in acquiring accurate and reliable F0 contours from speech signals even heavily contaminated with noise.
Keywords :
cepstral analysis; signal detection; speech processing; Chinese Mandarin; cepstrum technique; dynamic structure detection; empirical mode decomposition; speech fundamental frequency; third tones; tonal languages; Acoustic signal detection; Cepstrum; Databases; Detectors; Event detection; Frequency estimation; Natural languages; Noise; Noise measurement; Speech; Speech analysis; Speech enhancement; Speech processing; Time frequency analysis; Cepstrum; dynamic structure of $F_{0}$ ; ensemble EMD; fundamental frequency; third tones;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2010.2058799