DocumentCode
180504
Title
Regression approaches to perceptual age control in singing voice conversion
Author
Kobayashi, Kaoru ; Toda, Takechi ; Nakano, T. ; Goto, Misako ; Neubig, Graham ; Sakti, Sakriani ; Nakamura, Shigenari
Author_Institution
Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Ikoma, Japan
fYear
2014
fDate
4-9 May 2014
Firstpage
7904
Lastpage
7908
Abstract
The perceptual age of a singing voice is the age of the singer as perceived by the listener, and is one of the notable characteristics that determines perceptions of a song. In this paper, we describe a novel voice timbre control technique based on the perceptual age for singing voice conversion (SVC). Singers can sing expressively by controlling prosody and voice timbre, but the varieties of voices that singers can produce are limited by physical constraints. Previous work has attempted to overcome the limitation through the use of statistical voice conversion. This technique makes it possible to convert singing voice timbre of an arbitrary source singer into that of an arbitrary target singer. However, it is still difficult to intuitively control singing voice characteristics by manipulating parameters corresponding to specific physical traits, such as gender and age. In this paper, we develop a technique for controlling the voice timbre based on perceptual age that maintains the singer´s individuality. The experimental results show that the proposed voice timbre control method makes it possible to change the singer´s perceptual age while not having an adverse effect on the perceived individuality.
Keywords
regression analysis; speech synthesis; SVC; age control; arbitrary source singer; arbitrary target singer; perceptual age control; regression approaches; singing voice characteristics; singing voice conversion; singing voice timbre; statistical voice conversion; voice timbre control technique; Joints; Speech; Static VAr compensators; Timbre; Training; Vectors; perceptual age; regression approaches; singer´s individuality; singing voice conversion; voice timbre control;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6855139
Filename
6855139
Link To Document