DocumentCode :
3396873
Title :
Towards expressive Romanian speaking 3D avatars for multimedia interfaces
Author :
Ilie, Marija D. ; Ciobanu, Amelia ; Negrescu, Cristian ; Stanomir, Dumitru
Author_Institution :
Telecommun. Dept., Politeh. Univ. of Bucharest, Bucharest, Romania
fYear :
2013
fDate :
7-9 July 2013
Firstpage :
47
Lastpage :
50
Abstract :
This paper presents an implemented interface for Romanian language of expressive 3D talking agents, also known as 3D avatars. The major contribution of this work regards adding synchronized 3D lips animation sequences to any given Romanian TTS-generated synthetic word/text. The synchronization is performed using a syllable by syllable approach. The application is based on the Romanian-specific visual speech coarticulation model and on the Romanian logopedics platform for deaf people, both presented in earlier works by the same authors. The proposed Romanian-particular 3D avatar multimedia interface allows users to follow the avatar as it speaks Romanian based on a given text and its associated TTS-generated wave file. The efficiency of the method was successfully validated through several subjective tests, including a large number of normal-hearing testers and using a five level Likert scale to verify whether the speech animations are well-synchronized with the played sound. The results are promising for future 3D avatar live interaction applications for Romanian natives, and also for Romanian language teaching applications.
Keywords :
avatars; computer animation; handicapped aids; human computer interaction; multimedia computing; natural language interfaces; 3D avatar live interaction applications; Romanian TTS-generated synthetic word-text; Romanian language; Romanian language teaching applications; Romanian logopedics platform; Romanian natives; Romanian-specific visual speech coarticulation model; TTS-generated wav file; deaf people; expressive 3D talking agents; expressive Romanian speaking 3D avatars; five level Likert scale; multimedia interfaces; normal-hearing testers; syllable-by-syllable approach; synchronized 3D lips animation sequences; Decision support systems; 3D avatar; 3D facial animation; Romanian language;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Signals and Image Processing (IWSSIP), 2013 20th International Conference on
Conference_Location :
Bucharest
ISSN :
2157-8672
Print_ISBN :
978-1-4799-0941-4
Type :
conf
DOI :
10.1109/IWSSIP.2013.6623446
Filename :
6623446
Link To Document :
بازگشت