DocumentCode :
3144636
Title :
Generalized F0 modelling with absolute and relative pitch features for singing voice synthesis
Author :
Lee, S.W. ; Ang, Shen Ting ; Dong, Minghui ; Li, Haizhou
Author_Institution :
Human Language Technol. Dept., A*STAR, Singapore, Singapore
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
429
Lastpage :
432
Abstract :
Natural pitch fluctuations are essential to human singing. To effectively synthesize singing voice, the generation of these pitch fluctuations is necessary. Previous synthesis methods classify and reproduce them individually. These fluctuations, however, are found to be dependent and vary under different contexts. This paper proposes a generalized framework for F0 modelling to learn and generate these fluctuations on a note basis. Context-dependent hidden Markov models, representing the possible fluctuations observed in particular musical contexts, are built. To capture the pitch fluctuation and the voicing transitions in human singing, we employ both absolute and relative pitch as the modelling features. Results of our experiments on pitch accuracy and quality of synthesized singing showed that the proposed framework achieves accurate pitch generation and better naturalness of synthesized outputs.
Keywords :
hidden Markov models; speech synthesis; voice equipment; absolute pitch feature; context-dependent hidden Markov model; generalized F0 modelling; musical context; natural pitch fluctuation; relative pitch feature; singing voice synthesis; voicing transition; Context; Context modeling; Fluctuations; Hidden Markov models; Humans; Speech; Training; HMM; modelling; pitch; singing; synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6287908
Filename :
6287908
Link To Document :
بازگشت