• DocumentCode
    79225
  • Title

    Multiple subsequence combination in human action recognition

  • Author

    Onofri, Leonardo ; Soda, Paolo ; Iannello, Giulio

  • Author_Institution
    Comput. Sci. & Bioinf. Lab., Univ. Campus Bio-Medico di Roma, Rome, Italy
  • Volume
    8
  • Issue
    1
  • fYear
    2014
  • fDate
    Feb. 2014
  • Firstpage
    26
  • Lastpage
    34
  • Abstract
    Human action recognition is an active research area with applications in several domains such as visual surveillance, video retrieval and human-computer interaction. Current approaches assign action labels to video streams considering the whole video as a single sequence but, in some cases, the large variability between frames may lead to misclassifications. The authors propose a multiple subsequence combination (MSC) method that divides the video into several consecutive subsequences. It applies part-based and bag of visual words approaches to classify each subsequence. Then, it combines subsequence labels to assign an action label to the video. The proposed approach was tested on the KTH, UCF sports, Youtube and Robo-Kitchen datasets, which have large differences in terms of video length, object appearance and pose, object scale, viewpoint, background, as well as number, type and complexity of actions performed. Two main results were achieved. First, the MSC approach shows better performances compared to classify the video as a whole, even when few subsequences are used. Second, the approach is robust and stable since, for each dataset, its performances are comparable to the part-based approach at the state-of-the-art.
  • Keywords
    computational complexity; image sequences; object recognition; pose estimation; video signal processing; KTH; MSC; Robo-Kitchen datasets; UCF sports; Youtube; actions complexity; bag of visual words approaches; consecutive subsequences; human action recognition; multiple subsequence combination; object appearance; object pose; part-based approaches; video streams;
  • fLanguage
    English
  • Journal_Title
    Computer Vision, IET
  • Publisher
    iet
  • ISSN
    1751-9632
  • Type

    jour

  • DOI
    10.1049/iet-cvi.2013.0015
  • Filename
    6725836