مرکز منطقه ای اطلاع رساني علوم و فناوري - Real-Time Continuous Phoneme Recognition System Using Class-Dependent Tied-Mixture HMM With HBT Structure for Speech-Driven Lip-Sync

DocumentCode :

982033

Title :

Real-Time Continuous Phoneme Recognition System Using Class-Dependent Tied-Mixture HMM With HBT Structure for Speech-Driven Lip-Sync

Author :

Park, JunHo ; Ko, Hanseok

Author_Institution :

Dept. of Eng., Univ. of Cambridge, Cambridge

Volume :

Issue :

fYear :

2008

Firstpage :

1299

Lastpage :

1306

Abstract :

This work describes a real-time lip-sync method using which an avatar´s lip shape is synchronized with the corresponding speech signal. Phoneme recognition is generally regarded as an important task in the operation of a real-time lip-sync system. In this work, the use of the Head-Body-Tail (HBT) model is proposed for the purpose of more efficiently recognizing phonemes which are variously uttered due to co-articulation effects. The HBT model effectively deals with the transition parts of context-dependent models for small-sized vocabulary tasks. These models provide better recognition performance than general context-dependent or context-independent models for the task of digit or vowel recognition. Moreover, each phoneme is categorized into one among four classes and the class-dependent codebook is generated to further improve the performance. Additionally, for the clear representation of the context dependency information in the transient parts, some Gaussians are excluded from class-dependent codebook. The proposed method leads to a lip-sync system that performs at a level that is similar to previous designs based on HBT and continuous hidden Markov models (CHMMs). However, our method reduces the number of model parameters by one-third and enables real-time operation.

Keywords :

avatars; hidden Markov models; signal processing; speech processing; speech recognition; CHMM; HBT structure; avatar lip shape; class-dependent tied-mixture HMM; continuous hidden Markov models; head-body-tail; hidden Markov models; real-time continuous phoneme recognition system; small-sized vocabulary tasks; speech signal; speech-driven lip-sync; Head-body-tail HMM; phoneme recognition; real-time lip-sync;

fLanguage :

English

Journal_Title :

Multimedia, IEEE Transactions on

Publisher :

ieee

ISSN :

1520-9210

Type :

jour

DOI :

10.1109/TMM.2008.2004908

Filename :

4668497

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=982033