Korean speech recognition using phonemics for lip-sync animation

Author

Sun-Min Hwang ; Bok-Hee Song ; Han-Kyung Yun

Author_Institution

Sch. of CSE, Korea Univ. of Tech. & Educ., Chonan, South Korea

Volume

2

fYear

2014

fDate

26-28 April 2014

Firstpage

1011

Lastpage

1014

Abstract

A speaker dependent voice recognition algorithm has been developed for producing an autonomic natural animating of the character´ s mouth shape for small and medium sized animation productions or e-learning contents productions. Since the basic technique for recognizing Korean speech has been based on research results of other languages such as English and Japanese, it should check once at least or a margin for applying the Korean vocal sound system. One of reason is that Korean phonemes always have a same phonetic value. However, the scope of this study is the recognition of single vowels for a digital contents producing, particularly lip sync animation, since the lip sync producing generally requires lots of tedious hand work of animators and it seriously affects the animation producing cost and development period to get a high quality of lip animation. In this research, a real time processed automatic lip sync algorithm for virtual characters as the animation key in digital contents is studied by considering Korean vocal sound system. The proposed algorithm contributes to produce a natural condonable lip animation with the lower producing cost and the shorter development period. The recognition process consists of speech signal as the input, filtering, Fast Fourier Transform and identification. The result shows the proposed speaker dependent single vowel recognition system is able to distinguish Korean single vowels from dialogue of a dubbing artist with real-time. The average of the recognition ratio was 97.3% in the laboratory environment.

Keywords

computer animation; fast Fourier transforms; learning management systems; speaker recognition; English; Japanese; Korean speech recognition; Korean vocal sound system; animation key; automatic lip sync algorithm; digital content producing; e-learning content productions; fast Fourier transform; phonemics; speaker dependent voice recognition algorithm; virtual characters; Animation; Feature extraction; Real-time systems; Shape; Speech; Speech recognition; Synchronization; Korean phoneme; phonemics; speaker dependent; voice recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Information Science, Electronics and Electrical Engineering (ISEEE), 2014 International Conference on

Conference_Location

Sapporo

Print_ISBN

978-1-4799-3196-5

Type

conf

DOI

10.1109/InfoSEEE.2014.6947821

Filename

6947821