• DocumentCode
    2174745
  • Title

    Recognizing action at a distance

  • Author

    Efros, Alexei A. ; Berg, Alexander C. ; Mori, Greg ; Malik, Jitendra

  • Author_Institution
    Comput. Sci. Div., California Univ., Berkeley, CA, USA
  • fYear
    2003
  • fDate
    13-16 Oct. 2003
  • Firstpage
    726
  • Abstract
    Our goal is to recognize human action at a distance, at resolutions where a whole person may be, say, 30 pixels tall. We introduce a novel motion descriptor based on optical flow measurements in a spatiotemporal volume for each stabilized human figure, and an associated similarity measure to be used in a nearest-neighbor framework. Making use of noisy optical flow measurements is the key challenge, which is addressed by treating optical flow not as precise pixel displacements, but rather as a spatial pattern of noisy measurements which are carefully smoothed and aggregated to form our spatiotemporal motion descriptor. To classify the action being performed by a human figure in a query sequence, we retrieve nearest neighbor(s) from a database of stored, annotated video sequences. We can also use these retrieved exemplars to transfer 2D/3D skeletons onto the figures in the query sequence, as well as two forms of data-based action synthesis "do as I do" and "do as I say". Results are demonstrated on ballet, tennis as well as football datasets.
  • Keywords
    computer vision; image motion analysis; image retrieval; image sequences; video signal processing; 2D skeletons; 3D skeletons; action classification; ballet dataset; data-based action synthesis; distant action recognition; football dataset; human action; motion descriptor; nearest neighbor retrieval; nearest-neighbor framework; noisy optical flow; optical flow measurements; query sequence; retrieved exemplars; similarity measure; spatial pattern; spatiotemporal volume; stabilized human figure; tennis dataset; video sequences database; Displacement measurement; Fluid flow measurement; Humans; Image motion analysis; Information retrieval; Motion measurement; Optical noise; Spatial databases; Spatiotemporal phenomena; Volume measurement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on
  • Conference_Location
    Nice, France
  • Print_ISBN
    0-7695-1950-4
  • Type

    conf

  • DOI
    10.1109/ICCV.2003.1238420
  • Filename
    1238420