• DocumentCode
    2277055
  • Title

    A distribution based video representation for human action recognition

  • Author

    Song, Yan ; Tang, Sheng ; Zheng, Yan-Tao ; Chua, Tat-Seng ; Zhang, Yongdong ; Lin, Shouxun

  • Author_Institution
    Lab. of Adv. Comput. Res., Chinese Acad. of Sci., Beijing, China
  • fYear
    2010
  • fDate
    19-23 July 2010
  • Firstpage
    772
  • Lastpage
    777
  • Abstract
    Most current research on human action recognition in videos uses the bag-of-words (BoW) representations based on vector quantization on local spatial temporal features, due to the simplicity and good performance of such representations. In contrast to the BoW schemes, this paper explores a localized, continuous and probabilistic video representation. Specifically, the proposed representation encodes the visual and motion information of an ensemble of local spatial temporal (ST) features of a video into a distribution estimated by a generative probabilistic model such as the Gaussian Mixture Model. Furthermore, this probabilistic video representation naturally gives rise to an information-theoretic distance metric of videos. This makes the representation readily applicable as input to most discriminative classifiers, such as the nearest neighbor schemes and the kernel methods. The experiments on two datasets, KTH and UCF sports, show that the proposed approach could deliver promising results.
  • Keywords
    Gaussian processes; image motion analysis; image representation; pattern classification; probability; vectors; video signal processing; Gaussian mixture model; KTH; UCF sports; bag-of-words representations; discriminative classifiers; distribution based video representation; generative probabilistic model; human action recognition; information theoretic distance metric; probabilistic video representation; spatial temporal features; vector quantization; Accuracy; Databases; Feature extraction; Humans; Measurement; Probabilistic logic; Vocabulary; human action recognition; information-theoretic video matching; probabilistic video representation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2010 IEEE International Conference on
  • Conference_Location
    Suntec City
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4244-7491-2
  • Type

    conf

  • DOI
    10.1109/ICME.2010.5582550
  • Filename
    5582550