• DocumentCode
    1798825
  • Title

    An expressive deep model for human action parsing from a single image

  • Author

    Zhujin Liang ; Xiaolong Wang ; Rui Huang ; Liang Lin

  • Author_Institution
    Sun Yat-Sen Univ., Guangzhou, China
  • fYear
    2014
  • fDate
    14-18 July 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper aims at one newly raising task in vision and multimedia research: recognizing human actions from still images. Its main challenges lie in the large variations in human poses and appearances, as well as the lack of temporal motion information. Addressing these problems, we propose to develop an expressive deep model to naturally integrate human layout and surrounding contexts for higher level action understanding from still images. In particular, a Deep Belief Net is trained to fuse information from different noisy sources such as body part detection and object detection. To bridge the semantic gap, we used manually labeled data to greatly improve the effectiveness and efficiency of the pre-training and fine-tuning stages of the DBN training. The resulting framework is shown to be robust to sometimes unreliable inputs (e.g., imprecise detections of human parts and objects), and outperforms the state-of-the-art approaches.
  • Keywords
    computer vision; object detection; pose estimation; DBN training; Deep Belief Net; body part detection; expressive deep model; fine-tuning stage; human action parsing; human action recognition; human appearance variation; human layout integration; human pose variation; information fuse; multimedia research; object detection; pretraining stage; semantic gap; still images; vision research; Context modeling; Detectors; Estimation; Head; Image recognition; Object detection; Training; Action Recognition; Deep Belief Net; Human Parsing; Image Understanding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2014 IEEE International Conference on
  • Conference_Location
    Chengdu
  • Type

    conf

  • DOI
    10.1109/ICME.2014.6890158
  • Filename
    6890158