Partial Linear Regression for Audio-Driven Talking Head Application

Author

Hsieh, Chao-Kuei ; Chen, Yung-Chang

Author_Institution

Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu

fYear

2005

fDate

6-6 July 2005

Firstpage

281

Lastpage

284

Abstract

Virtual avatars in many applications are constructed manually or by a single speech-driven model, which needs a lot of training data and long training time. It is an essential problem to build up a user-dependent model more efficiently. In this paper, a new adaptation method, called the partial linear regression (PLR) is proposed and adopted in an audio-driven talking head application. This method allows users to adapt the partial parameters from the available adaptive data while keeping the others unchanged. In our experiments, the PLR algorithm can retrench the hours of time spent on retraining a new user-dependent model, and adjust the user-independent model to a more personalized one. The animated results with adapted models were 36% closer to the user-dependent model than using the pre-trained user-independent model

Keywords

avatars; computer animation; regression analysis; PLR algorithm; adaptive data; audio-driven talking head application; computer animation; partial linear regression; single speech-driven model; user-dependent model; virtual avatar; Application software; Avatars; Chaos; Hidden Markov models; Linear regression; Loudspeakers; Maximum likelihood linear regression; Neural networks; Speech synthesis; Training data;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on

Conference_Location

Amsterdam

Print_ISBN

0-7803-9331-7

Type

conf

DOI

10.1109/ICME.2005.1521415

Filename

1521415