• DocumentCode
    2717821
  • Title

    Detecting activities of daily living in first-person camera views

  • Author

    Pirsiavash, Hamed ; Ramanan, Deva

  • Author_Institution
    Dept. of Comput. Sci., Univ. of California, Irvine, CA, USA
  • fYear
    2012
  • fDate
    16-21 June 2012
  • Firstpage
    2847
  • Lastpage
    2854
  • Abstract
    We present a novel dataset and novel algorithms for the problem of detecting activities of daily living (ADL) in firstperson camera views. We have collected a dataset of 1 million frames of dozens of people performing unscripted, everyday activities. The dataset is annotated with activities, object tracks, hand positions, and interaction events. ADLs differ from typical actions in that they can involve long-scale temporal structure (making tea can take a few minutes) and complex object interactions (a fridge looks different when its door is open). We develop novel representations including (1) temporal pyramids, which generalize the well-known spatial pyramid to approximate temporal correspondence when scoring a model and (2) composite object models that exploit the fact that objects look different when being interacted with. We perform an extensive empirical evaluation and demonstrate that our novel representations produce a two-fold improvement over traditional approaches. Our analysis suggests that real-world ADL recognition is “all about the objects,” and in particular, “all about the objects being interacted with.”
  • Keywords
    cameras; gesture recognition; image motion analysis; activities of daily living detection; approximate temporal correspondence; complex object interactions; composite object models; extensive empirical evaluation; first-person camera views; long-scale temporal structure; real-world ADL recognition; spatial pyramid; temporal pyramids; Biomedical monitoring; Cameras; Detectors; Face; Hidden Markov models; Taxonomy; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
  • Conference_Location
    Providence, RI
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4673-1226-4
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2012.6248010
  • Filename
    6248010