• DocumentCode
    304750
  • Title

    Spatio-temporal model-assisted compatible coding for low and very low bitrate videotelephony

  • Author

    Jae-Beom Lee ; Eleftheriad, Alexandros

  • Author_Institution
    Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
  • Volume
    1
  • fYear
    1996
  • fDate
    16-19 Sep 1996
  • Firstpage
    429
  • Abstract
    We introduce the concept of spatio-temporal model-assisted compatible (STMAC) coding, a technique to selectively encode areas of different importance to the human eye in terms of space and time in moving images. For this, we use the fact that human “eye contact” and “lip synchronization” are very important in person-to-person communication. Several areas including the eyes and lips need different types of quality, since different areas have different perceptual significance to human observers. For example, for the eyes “high resolution” is needed for clear communication, while for the lips “frequent refresh” is needed. The approach provides a better rate-distortion tradeoff than conventional image coding technologies based on MPEG-1, MPEG-2, H.261, as well as H.263, since STMAC coding is applied on top of an encoder, taking full advantage of its core design. The decoder does not need to be changed in any way although the encoder´s rate control unit is slightly modified. This characteristic leads to the name “compatible” in the proposed concept. Experimental results are given using ITU-T H.263, addressing very low bit rate compression (13-17 Kbps)
  • Keywords
    rate distortion theory; telecommunication standards; video coding; videotelephony; 13 to 17 Kbit/s; ITU-T H.263; STMAC coding; core design; decoder; encoder´s rate control unit; eye contact; frequent refresh; high resolution; human observers; lip synchronization; low bitrate videotelephony; perceptual significance; person-to-person communication; rate-distortion tradeoff; selective encoding; spatio-temporal model-assisted compatible coding; very low bitrate videotelephony; Bit rate; Codecs; Decoding; Eyes; Frequency synchronization; Humans; Image coding; Lips; Space technology; Spatial resolution;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 1996. Proceedings., International Conference on
  • Conference_Location
    Lausanne
  • Print_ISBN
    0-7803-3259-8
  • Type

    conf

  • DOI
    10.1109/ICIP.1996.560867
  • Filename
    560867