• DocumentCode
    1295122
  • Title

    A media conversion from speech to facial image for intelligent man-machine interface

  • Author

    Morishima, Shigeo ; Harashima, Hiroshi

  • Author_Institution
    Fac. of Eng., Seikei Univ., Tokyo, Japan
  • Volume
    9
  • Issue
    4
  • fYear
    1991
  • fDate
    5/1/1991 12:00:00 AM
  • Firstpage
    594
  • Lastpage
    600
  • Abstract
    An automatic field motion image synthesis scheme (driven by speech) and a real-time image synthesis design are presented. The purpose of this research is to realize an intelligent human-machine interface or intelligent communication system with talking head images. A human face is reconstructed on the display of a terminal using a 3-D surface model and texture mapping technique. Facial motion images are synthesized naturally by transformation of the lattice points on 3-D wire frames. Two driving motion methods, a text-to-image conversion scheme and a voice-to-image conversion scheme, are proposed. In the first method, the synthesized head image can appear to speak some given words and phrases naturally. In the second case, some mouth and jaw motions can be synthesized in synchronization with voice signals from a speaker. Facial expressions other than mouth shape and jaw position can be added at any moment, so it is easy to make the facial model appear angry, to smile, to appear sad, etc., by special modification rules. These schemes were implemented on a parallel image computer system. A real-time image synthesizer was able to generate facial motion images on the display at a TV image video rate
  • Keywords
    computerised picture processing; parallel processing; real-time systems; speech synthesis; user interfaces; 3-D surface model; 3-D wire frames; TV image video rate; driving motion methods; facial expressions; facial image; facial model; facial motion images; human face reconstruction; intelligent communication system; intelligent man-machine interface; media conversion; parallel image computer system; real-time image synthesis; real-time image synthesizer; speech synthesis; talking head images; terminal display; text-to-image conversion; texture mapping technique; voice signals; voice-to-image conversion; Face; Head; Humans; Image converters; Image generation; Intelligent systems; Man machine systems; Mouth; Signal synthesis; Speech synthesis;
  • fLanguage
    English
  • Journal_Title
    Selected Areas in Communications, IEEE Journal on
  • Publisher
    ieee
  • ISSN
    0733-8716
  • Type

    jour

  • DOI
    10.1109/49.81953
  • Filename
    81953