Title :
Use of Poisson Processes to Generate Fundamental Frequency Contours
Author :
Ni, Jinfu ; Nakamura, Satoshi
Author_Institution :
Nat. Inst. of Inf. & Commun. Technol.
Abstract :
The prosodic contributions to voice fundamental frequency (F0) contours can be analyzed into a series of sparser tonal targets (F0 peaks and valleys). The transitions through these targets are interpolated by spline or filtering functions to predict the shape of F0 contours. A functional model was proposed in the previous work for this purpose. This paper presents an enhanced version of this model achieved by replacing its decay filter with a Poisson-process-induced filter. It is enhanced because the former is a special case of the latter. The new filter manages to delay the decaying process while interpolations are being uttered. A target point can thus act as target levels, if necessary. The algorithms for estimating parameters, which were implemented on computers, are also presented. Experiments conducted on thousands of observed F0 contours, including Mandarin, Japanese, and English, indicate that the enhanced version significantly facilitates their automatic parameterization.
Keywords :
filtering theory; speech processing; splines (mathematics); stochastic processes; English; Japanese; Mandarin; Poisson-process-induced filter; decaying process; filtering functions; fundamental frequency contours; parameters estimation; spline interpolation; voice fundamental frequency; Communications technology; Filtering; Filters; Frequency; Information analysis; Natural languages; Parameter estimation; Shape; Speech analysis; Spline; Poisson distributions; Prosody modeling; Speech processing; Speech synthesis; Voice conversion;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2007.367040