• DocumentCode
    432538
  • Title

    A generic mid-level representation for semantic video analysis

  • Author

    Tang, Qing ; Lim, Joo-Hwee ; Jin, Jesse S. ; Sun, Haiping ; Tian, Qi

  • Author_Institution
    Sch. of Inf. Technol., Sydney Univ., NSW, Australia
  • Volume
    1
  • fYear
    2004
  • fDate
    24-27 Oct. 2004
  • Firstpage
    629
  • Abstract
    The paper presents a generic, mid-level representation for efficient semantic video analysis, which adopts a frame-by-frame scheme using P-frames rather than shot-based schemes. Each P-frame is partitioned into an m×n grid (row by column), and each cell is called a ´block´. The representation can bridge the semantic gap and build an intermediate description of video features across frames and blocks. Soccer video is used to showcase the potential of the framework for real video processing. Experiments with tennis video and news video have also been conducted. Results demonstrate the excellent performance of the framework in semantic analysis and also indicate its further potential for automatic video analysis.
  • Keywords
    feature extraction; image representation; video signal processing; automatic video analysis; feature extraction; frame-by-frame scheme; generic representation; mid-level representation; news video; semantic video analysis; soccer video; tennis video; video features; video processing; Australia; Bridges; Event detection; Information analysis; Information retrieval; Information technology; Large Hadron Collider; Performance analysis; Statistical learning; Streaming media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 2004. ICIP '04. 2004 International Conference on
  • ISSN
    1522-4880
  • Print_ISBN
    0-7803-8554-3
  • Type

    conf

  • DOI
    10.1109/ICIP.2004.1418833
  • Filename
    1418833