• DocumentCode
    306307
  • Title

    Joint audio-video processing for multimedia

  • Author

    Chen, Tsuhan ; Rao, Ram

  • Author_Institution
    AT&T Res., Holmdel, NJ, USA
  • Volume
    1
  • fYear
    1996
  • fDate
    5-10 Aug 1996
  • Firstpage
    548
  • Abstract
    In this paper, the authors report recent developments in the research of joint audio-visual processing for multimedia applications. These include: bimodality in speech production and perception, automatic lipreading, talking-head animation and lip synchronization. They present in detail the enabling technologies for these applications. A new trend of research is to utilize audio-visual interaction in the coding of talking head video. They show that the marriage of speech analysis and image processing can create a number of new research opportunities
  • Keywords
    computer animation; multimedia communication; multimedia computing; speech coding; speech processing; synchronisation; video coding; audio-visual interaction; automatic lipreading; bimodality; enabling technologies; image processing; joint audio-visual processing; lip synchronization; multimedia applications; recent developments; research opportunities; speech analysis; speech perception; speech production; talking head video coding; talking-head animation; Humans; Image analysis; Image converters; Lips; Mouth; Speech analysis; Speech recognition; Teeth; Tongue; Videoconference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Electronics, Control, and Instrumentation, 1996., Proceedings of the 1996 IEEE IECON 22nd International Conference on
  • Conference_Location
    Taipei
  • Print_ISBN
    0-7803-2775-6
  • Type

    conf

  • DOI
    10.1109/IECON.1996.571012
  • Filename
    571012