• DocumentCode
    1093648
  • Title

    Automatic foveation for video compression using a neurobiological model of visual attention

  • Author

    Itti, Laurent

  • Author_Institution
    Psychol. & Neurosci. Graduate Program, Univ. of Southern California, Los Angeles, CA, USA
  • Volume
    13
  • Issue
    10
  • fYear
    2004
  • Firstpage
    1304
  • Lastpage
    1318
  • Abstract
    We evaluate the applicability of a biologically-motivated algorithm to select visually-salient regions of interest in video streams for multiply-foveated video compression. Regions are selected based on a nonlinear integration of low-level visual cues, mimicking processing in primate occipital, and posterior parietal cortex. A dynamic foveation filter then blurs every frame, increasingly with distance from salient locations. Sixty-three variants of the algorithm (varying number and shape of virtual foveas, maximum blur, and saliency competition) are evaluated against an outdoor video scene, using MPEG-1 and constant-quality MPEG-4 (DivX) encoding. Additional compression radios of 1.1 to 8.5 are achieved by foveation. Two variants of the algorithm are validated against eye fixations recorded from four to six human observers on a heterogeneous collection of 50 video clips (over 45 000 frames in total). Significantly higher overlap than expected by chance is found between human and algorithmic foveations. With both variants, foveated clips are, on average, approximately half the size of unfoveated clips, for both MPEG-1 and MPEG-4. These results suggest a general-purpose usefulness of the algorithm in improving compression ratios of unconstrained video.
  • Keywords
    data compression; eye; filters; integration; neurophysiology; video coding; MPEG-1; automatic foveation; biologically-motivated algorithm; constant-quality MPEG-4 encoding; dynamic foveation filter; eye fixation; low-level visual cues; mimicking processing; neurobiological model; nonlinear integration; outdoor video scene; posterior parietal cortex; primate occipital; video compression; video streams; visual attention; visually-salient regions; Biological information theory; Biological system modeling; Brain modeling; Filters; Humans; MPEG 4 Standard; Shape; Streaming media; Transform coding; Video compression; Adult; Algorithms; Artificial Intelligence; Attention; Biomimetics; Computer Graphics; Computer Simulation; Data Compression; Eye Movements; Female; Fovea Centralis; Humans; Image Enhancement; Male; Models, Neurological; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Signal Processing, Computer-Assisted; User-Computer Interface; Video Recording; Visual Perception;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2004.834657
  • Filename
    1331443