DocumentCode :
3276088
Title :
Audio-Visual Feature Extraction for Semi-Automatic Annotation of Meetings
Author :
Kepesi, Mariain ; Neffe, Michael ; Van Pham, Thang ; Grabner, Michael ; Grabner, Helmut ; Juffinger, Andreas
Author_Institution :
SPSC Lab., Graz Univ. of Technol.
fYear :
2006
fDate :
3-6 Oct. 2006
Firstpage :
207
Lastpage :
211
Abstract :
In this paper we present the building blocks of our semi-automatic annotation tool which supports multi-modal and multi-level annotation of meetings. The main focus is on the proper design and functionality of the modules for recognizing meeting actions. The key features, identity and position of the speakers, are provided by different modalities (audio and video). Three audio algorithms (voice activity detection, speaker identification and direction of arrival) and three video algorithms (detection, tracking and identification) form the low-level feature extraction components. Low-level features are automatically merged and the recognized actions are proposed to the user by visualizing them. The annotation labels are related but not limited to events during meetings. The user can finally confirm or if necessary, modify the suggestion, and then store the actions into a database
Keywords :
audio databases; audio-visual systems; data visualisation; feature extraction; image recognition; video databases; audio database; audio-visual feature extraction; data visualization; meeting action recognition; semiautomatic annotation tool; video algorithm; Computer graphics; Computer networks; Computer science; Data mining; Feature extraction; Laboratories; Social network services; Spatial databases; Visual databases; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 2006 IEEE 8th Workshop on
Conference_Location :
Victoria, BC
Print_ISBN :
0-7803-9751-7
Electronic_ISBN :
0-7803-9752-5
Type :
conf
DOI :
10.1109/MMSP.2006.285298
Filename :
4064548
Link To Document :
بازگشت