DocumentCode :
1977086
Title :
Face analysis for the synthesis of photo-realistic talking heads
Author :
Graf, Hans Peter ; Cosatto, Eric ; Ezzat, Tony
Author_Institution :
AT&T Labs-Res., Red Bank, NJ, USA
fYear :
2000
fDate :
2000
Firstpage :
189
Lastpage :
194
Abstract :
This paper describes techniques for extracting bitmaps of facial parts from videos of a talking person. The goal is to synthesize photo-realistic talking heads of high quality that show picture-perfect appearance and realistic head movements with good lip-sound synchronization. For the synthesis of a talking head, bitmaps of facial parts are combined to form whole heads and then sequences of such images are integrated with audio from a text-to-speech synthesizer. For a seamless integration of facial parts into an animation, their shape and visual appearance must be known with high accuracy. The recognition system has to find not only the locations of facial features, but must also be able to determine the head´s orientation and recognize the facial expressions. Our face recognition proceeds in multiple steps, each with an increased precision. Using motion, color and shape information, the head´s position and the location of the main facial features are determined first. Then smaller areas are searched with matched filters, in order to identify specific facial features with high precision. From this information a head´s 3D orientation is calculated. Facial parts are cut from the image and, using the head´s orientation, are warped into bitmaps with `normalized´ orientation and scale
Keywords :
computer animation; face recognition; feature extraction; filtering theory; image colour analysis; image sequences; matched filters; motion estimation; realistic images; search problems; speech synthesis; synchronisation; animation; audio integration; bitmap extraction; color information; face analysis; face recognition; facial feature location; head orientation; image sequences; lip-sound synchronization; matched filters; motion information; photo-realistic talking heads; picture-perfect appearance; realistic head movements; recognition system; searching; shape information; talking head synthesis; text-to-speech synthesizer; videos; Bridges; Electrical capacitance tomography; Face recognition; Facial animation; Identity-based encryption; Magnetic heads; Read only memory; Shape measurement; Speech synthesis; Videos;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Face and Gesture Recognition, 2000. Proceedings. Fourth IEEE International Conference on
Conference_Location :
Grenoble
Print_ISBN :
0-7695-0580-5
Type :
conf
DOI :
10.1109/AFGR.2000.840633
Filename :
840633
Link To Document :
بازگشت