DocumentCode
3526296
Title
Regression-based clustering for hierarchical pitch conversion
Author
Lee, Chung-han ; Hsia, Chi-Chun ; Wu, Chung-Hsien ; Lin, Mai-Chun
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan
fYear
2009
fDate
19-24 April 2009
Firstpage
3593
Lastpage
3596
Abstract
This study presents a hierarchical pitch conversion method using regression-based clustering for conversion function modeling. The pitch contour of a speech utterance is first extracted and decomposed into sentence-, word and sub-syllable-level features in a top-down mechanism. The pair-wise source and target pitch feature vectors at each level are then clustered to generate the pitch conversion function. Regression-based clustering, which clusters the feature vectors to achieve a minimum conversion error between the predicted and the real feature vectors is proposed for conversion function generation. A classification and regression tree (CART), incorporating linguistic, phonetic and source prosodic features, is adopted to select the most suitable function for pitch conversion. Several objective and subjective evaluations were conducted and the comparison results to the GMM-based methods for pitch conversion confirm the performance of the proposed regression-based clustering approach.
Keywords
pattern clustering; regression analysis; signal classification; speech synthesis; trees (mathematics); classification-and-regression tree; conversion function modeling; hierarchical pitch conversion; linguistic feature; regression-based clustering; speech synthesis; speech utterance extraction; sub-syllable-level feature; Classification tree analysis; Computer industry; Computer science; Curve fitting; Data mining; Mean square error methods; Medical services; Polynomials; Predictive models; Speech synthesis; Regression-based; clustering; hierarchical; pitch conversion;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960403
Filename
4960403
Link To Document