DocumentCode
2235330
Title
Partial Linear Regression for Audio-Driven Talking Head Application
Author
Hsieh, Chao-Kuei ; Chen, Yung-Chang
Author_Institution
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu
fYear
2005
fDate
6-6 July 2005
Firstpage
281
Lastpage
284
Abstract
Virtual avatars in many applications are constructed manually or by a single speech-driven model, which needs a lot of training data and long training time. It is an essential problem to build up a user-dependent model more efficiently. In this paper, a new adaptation method, called the partial linear regression (PLR) is proposed and adopted in an audio-driven talking head application. This method allows users to adapt the partial parameters from the available adaptive data while keeping the others unchanged. In our experiments, the PLR algorithm can retrench the hours of time spent on retraining a new user-dependent model, and adjust the user-independent model to a more personalized one. The animated results with adapted models were 36% closer to the user-dependent model than using the pre-trained user-independent model
Keywords
avatars; computer animation; regression analysis; PLR algorithm; adaptive data; audio-driven talking head application; computer animation; partial linear regression; single speech-driven model; user-dependent model; virtual avatar; Application software; Avatars; Chaos; Hidden Markov models; Linear regression; Loudspeakers; Maximum likelihood linear regression; Neural networks; Speech synthesis; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
Conference_Location
Amsterdam
Print_ISBN
0-7803-9331-7
Type
conf
DOI
10.1109/ICME.2005.1521415
Filename
1521415
Link To Document