• DocumentCode
    2047106
  • Title

    A Framework for Evaluating Human Action Detection via Multidimensional Approach

  • Author

    Lili, N.A.

  • Author_Institution
    Dept of Multimedia, UPM, Serdang, Malaysia
  • fYear
    2009
  • fDate
    11-14 Aug. 2009
  • Firstpage
    186
  • Lastpage
    190
  • Abstract
    This work discusses the application of an Artificial Intelligence technique called data extraction and a process-based ontology in constructing experimental qualitative models for video retrieval and detection. We present a framework architecture that uses multimodality features as the knowledge representation scheme to model the behaviors of a number of human actions in the video scenes. The main focus of this paper placed on the design of two main components (model classifier and inference engine) for a tool abbreviated as VASD (Video Action Scene Detector) for retrieving and detecting human actions from video scenes. The discussion starts by presenting the workflow of the retrieving and detection process and the automated model classifier construction logic. We then move on to demonstrate how the constructed classifiers can be used with multimodality features for detecting human actions. Finally, behavioral explanation manifestation is discussed. The simulator is implemented in bilingual; Matlab and C++ are at the backend supplying data and theories while Java handles all front-end GUI and action pattern updating.
  • Keywords
    C++ language; Java; graphical user interfaces; hidden Markov models; inference mechanisms; mathematics computing; ontologies (artificial intelligence); video retrieval; video signal processing; C++ language; Java; Matlab; Video Action Scene Detector tool; construction logic; data extraction; graphical user interface; human action detection evaluation; inference engine; knowledge representation scheme; model classifier; multidimensional approach; multimodality features; process-based ontology; video detection; video retrieval; Artificial intelligence; Data mining; Engines; Humans; Information retrieval; Knowledge representation; Layout; Mathematical model; Multidimensional systems; Ontologies; audio feature; hidden Markov model; human action detection; visual feature;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Graphics, Imaging and Visualization, 2009. CGIV '09. Sixth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3789-4
  • Type

    conf

  • DOI
    10.1109/CGIV.2009.48
  • Filename
    5298204