A Framework for Evaluating Human Action Detection via Multidimensional Approach

Author

Lili, N.A.

Author_Institution

Dept of Multimedia, UPM, Serdang, Malaysia

fYear

2009

fDate

11-14 Aug. 2009

Firstpage

186

Lastpage

190

Abstract

This work discusses the application of an Artificial Intelligence technique called data extraction and a process-based ontology in constructing experimental qualitative models for video retrieval and detection. We present a framework architecture that uses multimodality features as the knowledge representation scheme to model the behaviors of a number of human actions in the video scenes. The main focus of this paper placed on the design of two main components (model classifier and inference engine) for a tool abbreviated as VASD (Video Action Scene Detector) for retrieving and detecting human actions from video scenes. The discussion starts by presenting the workflow of the retrieving and detection process and the automated model classifier construction logic. We then move on to demonstrate how the constructed classifiers can be used with multimodality features for detecting human actions. Finally, behavioral explanation manifestation is discussed. The simulator is implemented in bilingual; Matlab and C++ are at the backend supplying data and theories while Java handles all front-end GUI and action pattern updating.

Keywords

C++ language; Java; graphical user interfaces; hidden Markov models; inference mechanisms; mathematics computing; ontologies (artificial intelligence); video retrieval; video signal processing; C++ language; Java; Matlab; Video Action Scene Detector tool; construction logic; data extraction; graphical user interface; human action detection evaluation; inference engine; knowledge representation scheme; model classifier; multidimensional approach; multimodality features; process-based ontology; video detection; video retrieval; Artificial intelligence; Data mining; Engines; Humans; Information retrieval; Knowledge representation; Layout; Mathematical model; Multidimensional systems; Ontologies; audio feature; hidden Markov model; human action detection; visual feature;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Graphics, Imaging and Visualization, 2009. CGIV '09. Sixth International Conference on

Conference_Location

Tianjin

Print_ISBN

978-0-7695-3789-4

Type

conf

DOI

10.1109/CGIV.2009.48

Filename

5298204