DocumentCode :
615119
Title :
Video based activity recognition in trauma resuscitation
Author :
Chakraborty, Imon ; Elgammal, Ahmed ; Burd, Randall S.
Author_Institution :
Dept. of Comput. Sci., Rutgers Univ., Piscataway, NJ, USA
fYear :
2013
fDate :
22-26 April 2013
Firstpage :
1
Lastpage :
8
Abstract :
We present a system for automated transcription of trauma resuscitation in the emergency department (ED). Using a ceiling-mounted single camera video recording, our goal is to track and transcribe the medical procedures performed during resuscitation of a patient, the time instances of their initiation and their temporal durations. In this multi-agent, multitask setting, we represent procedures as high-level concepts composed of low-level features based on the patient´s pose, scene dynamics, clinician motions and device locations. In particular, the low-level features are transformed into intermediate action attributes (e.g., “hand grasping of an object of interest”) and are used as building blocks to describe procedures. Procedures are expressed as first-order logic statements that capture spatio-temporal attribute interactions compactly in an activity grammar. The probabilities from feature observations and the logical semantics are combined probabilistically in a Markov Logic Network (MLN). At runtime, a Markov Network is dynamically constructed representing hypothesized procedures, spatio-temporal relationships and attribute probabilities. Inference on this network determines the most consistent sequence of procedures over time. Our activity model is modular and extendible to a multitude of sensor inputs and detection methods. The method is thus adaptable to many activity recognition problems. In this paper, we show our approach using videos of simulated trauma simulations. The accuracy of the results confirms the suitability of our framework.
Keywords :
Markov processes; image motion analysis; injuries; medical image processing; multi-agent systems; object recognition; video signal processing; MLN; Markov logic network; action attribute; activity grammar; automated transcription; ceiling-mounted single camera video recording; clinician motion; emergency department; feature observation; first-order logic statement; hand grasping; logical semantics; low-level feature; medical procedure; multiagent; multitask setting; patient pose; scene dynamics; spatio-temporal attribute interaction; temporal duration; trauma resuscitation; video based activity recognition; Biomedical imaging; Feature extraction; Grammar; Hidden Markov models; Markov random fields; Tracking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Face and Gesture Recognition (FG), 2013 10th IEEE International Conference and Workshops on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-5545-2
Electronic_ISBN :
978-1-4673-5544-5
Type :
conf
DOI :
10.1109/FG.2013.6553758
Filename :
6553758
Link To Document :
بازگشت