مرکز منطقه ای اطلاع رساني علوم و فناوري - Constrained optimization for audio-to-visual conversion

DocumentCode :

987839

Title :

Constrained optimization for audio-to-visual conversion

Author :

Choi, Kyoung-Ho ; Hwang, Jenq-Neng

Author_Institution :

Spatial Inf. Technol. Center, Electron. & Telecommun. Res. Inst., Daejeon, South Korea

Volume :

Issue :

fYear :

2004

fDate :

6/1/2004 12:00:00 AM

Firstpage :

1783

Lastpage :

1790

Abstract :

We have developed a new audio-to-visual conversion algorithm that uses a constrained optimization approach to take advantage of dynamics of mouth movements. Based on facial muscle analysis, the dynamics of mouth movements is modeled, and constraints are obtained from it. The obtained constraints are used to estimate visual parameters from speech in a framework of hidden Markov model (HMM)-based visual parameter estimation. To solve the constrained optimization problem, the Lagrangian approach is used to transform the constrained problem into an unconstrained problem in our implementation. The proposed method is tested on various noisy environments to show its robustness and correctness. Our proposed algorithm is favorably compared with the mixture-based HMM method, which also uses audio-visual HMMs and finds optimal estimates based on a joint audio-visual probability distribution. Our proposed algorithm can estimate optimal visual parameters while satisfying the constraints and avoiding performance degradation in noisy environments.

Keywords :

audio-visual systems; computer animation; hidden Markov models; optimisation; parameter estimation; probability; speech processing; Lagrangian approach; additive white Gaussian noise; audio-to-visual conversion algorithm; audio-visual probability distribution; constrained optimization approach; facial muscle analysis; hidden Markov models; jittering; microphone mismatch; mouth movements; visual parameter estimation; Constraint optimization; Facial muscles; Hidden Markov models; Lagrangian functions; Mouth; Parameter estimation; Robustness; Speech; Testing; Working environment noise; Audio-to-visual conversion; HMM; HMMI; talking heads;

fLanguage :

English

Journal_Title :

Signal Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1053-587X

Type :

jour

DOI :

10.1109/TSP.2004.827153

Filename :

1299110

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=987839