Title :
VASD: Video Action Scene Detector Using Audio Visual Data
Author_Institution :
Dept of Multimedia, UPM Serdang, Serdang, Malaysia
Abstract :
This paper presents a method which able to integrate audio and visual information for human action scene analysis. The approach is top-down for determining and extracting action scenes in video by analyzing both audio and video data. We proposed a framework for recognizing actions by measuring image and action-based information from video with the following characteristics: feature extraction is done automatically; the method deals with both visual and auditory information, and captures both spatial and temporal characteristics; and the extracted features are natural, in the sense that they are closely related to the human perceptual processing. Our effort was to implementing idea of action identification by extracting syntactic properties of a video such as edge feature extraction, colour distribution, audio and motion vectors. In this paper, we present a simple method for human activity recognition based on a Hidden Markov models (HMMs) for sensing, learning and training the actions. In addition, we used audio visual features to distinguish the human actions and to reach a decision. We describe the use of the model that diagnoses states of a human activity based on events from video. We reviewed the model, present an implementation, and report on experiments to demonstrate the robustness of the framework.
Keywords :
audio signal processing; hidden Markov models; video signal processing; action recognition; action-based information; audio information; audio vectors; audio visual data; audio visual features; auditory information; colour distribution; edge feature extraction; hidden Markov models; human action scene analysis; human activity recognition; motion vectors; video action scene detector; visual information; Character recognition; Data mining; Detectors; Feature extraction; Hidden Markov models; Humans; Image analysis; Image recognition; Layout; Robustness; HMM; audiovisual features; video action scene detection;
Conference_Titel :
Computer Technology and Development, 2009. ICCTD '09. International Conference on
Conference_Location :
Kota Kinabalu
Print_ISBN :
978-0-7695-3892-1
DOI :
10.1109/ICCTD.2009.21