Title :
Extracting High Level Semantics by Means of Speech, Audio, and Image Primitives in Surveillance Applications
Author :
Goldmann, L. ; Samour, A. ; Karaman, Mustafa ; Sikora, Thomas
Abstract :
Traditional surveillance systems are usually based on visual information only. With the emerging multimedia analysis techniques, interests are changing towards systems that incorporate multiple sensors and different modalities, which leads to new ways of analyzing this multimedia data and more sophisticated applications. This paper shortly reviews the ideas of traditional surveillance systems and explains actual research interests in this domain. Then, it focuses on the typical structure, goals, and applications of multimedia surveillance systems. These issues are supported by short descriptions of selected analysis steps of such a system currently under development. Some experimental results are given to illustrate the extracted semantics and to assess the performance of the individual steps.
Keywords :
audio signal processing; image recognition; multimedia systems; speaker recognition; surveillance; audio primitives; image primitives; multimedia data analysis; multimedia surveillance system; multimodal analysis; semantic extraction; smart room technology; speech primitives; video surveillance; Computer vision; Data analysis; Data mining; Information analysis; Multimedia systems; Pattern recognition; Smart cameras; Speech; Surveillance; Vehicles; multimedia surveillance; multimodal analysis; smart room technologies;
Conference_Titel :
Image Processing, 2006 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
1-4244-0480-0
DOI :
10.1109/ICIP.2006.312945