Multiple media cues for MPEG-7

Author

Brown, B.J. ; Derom, K. ; Lindsay, A. ; Saraceno, C.

Author_Institution

Starlab Brussels, Belgium

fYear

2000

fDate

4-8 Sept. 2000

Firstpage

Lastpage

Abstract

This work presents a methodology to extract and represent the semantic content of audio-visual documents. A collection of diverse tools is used to extract low level, signal based descriptions. Joint audio and visual analysis is utilized to automatically extract higher level semantic features. High-level, hand-annotated, descriptors are also used. The hand annotated descriptors are used for retrieval purpose as well as to enhance the results of the automatic procedure, i.e. to allow the system to learn how high level semantic information are linked to low level automatically extracted features through user´s input. We draw upon MPEG-7´s collection of Descriptors to provide some targets for our audio and visual analysis methods. Selected MPEG-7 Description Schemes, such as the textual description, the description of persons, and the description of the structural aspects of the content of the AV document [1], provide some of the larger containment structures for our features.

fLanguage

English

Publisher

ieee

Conference_Titel

Signal Processing Conference, 2000 10th European

Conference_Location

Tampere, Finland

Print_ISBN

978-952-1504-43-3

Type

conf

Filename

7075804

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=696958