• DocumentCode
    2136369
  • Title

    Combining multiple visual processing streams for locating and classifying objects in video

  • Author

    Paiton, DM ; Brumby, SP ; Kenyon, GT ; Kunde, GJ ; Peterson, KD ; Ham, MI ; Schultz, PF ; George, JS

  • Author_Institution
    Los Alamos Nat. Lab., Los Alamos, NM, USA
  • fYear
    2012
  • fDate
    22-24 April 2012
  • Firstpage
    49
  • Lastpage
    52
  • Abstract
    Automated, invariant object detection has proven itself to be a substantial challenge for the artificial intelligence research community. In computer vision, many different benchmarks have been established using whole-image classification based on datasets that are too small to eliminate statistical artifacts. As an alternative, we used a new dataset consisting of ~62GB (on the order of 40,000 2Mpixel frames) of compressed high-definition aerial video, which we employed for both object classification and localization. Our algorithms mimic the processing pathways in primate visual cortex, exploiting color/texture, shape/form and motion. We then combine the data using a clustering technique to produce a final output in the form of labeled bounding boxes around objects of interest in the video. Localization adds additional complexity not generally found in whole-image classification problems. Our results are evaluated qualitatively and quantitatively using a scoring metric that assessed the overlap between our detections and ground-truth.
  • Keywords
    computer vision; image classification; image colour analysis; image motion analysis; image texture; object detection; pattern clustering; video coding; video streaming; artificial intelligence research community; automated invariant object detection; clustering technique; compressed high-definition aerial video; computer vision; image color; image motion; image shape; image texture; labeled bounding boxes; multiple visual processing streams; object classification; object localization; primate visual cortex; processing pathways; scoring metric; Brain models; Clustering algorithms; Computational modeling; Image edge detection; Kernel; Visualization; NeoVision2; clustering algorithms; object detection; optic flow; visual cortex;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Analysis and Interpretation (SSIAI), 2012 IEEE Southwest Symposium on
  • Conference_Location
    Santa Fe, NM
  • Print_ISBN
    978-1-4673-1831-0
  • Electronic_ISBN
    978-1-4673-1829-7
  • Type

    conf

  • DOI
    10.1109/SSIAI.2012.6202450
  • Filename
    6202450