مرکز منطقه ای اطلاع رساني علوم و فناوري - Audiovisual fusion with segment models for video structure analysis

DocumentCode :

448831

Title :

Audiovisual fusion with segment models for video structure analysis

Author :

Delakis, M. ; Gravier, G. ; Gros, P.

Author_Institution :

IRISA, Univ. of Rennes 1, Rennes, France

fYear :

2005

fDate :

Nov. 30 2005-Dec. 1 2005

Firstpage :

Lastpage :

Abstract :

Hidden Markov Models provide a powerful framework for bridging the semantic gap between low-level video features and high-level user needs by taking full advantage of our prior knowledge on the video structure. A serious flaw of HMMs is that they require all the modalities of a video document to be strictly synchronous before their fusion. Taking as a case study tennis broadcasts analysis, we introduce video indexing using Segment Models, a generalization of Hidden Markov Models, where the fusion of different modalities can be performed in a more flexible way. Operating essentially as a layered topology they allow the fusion of asynchronous modalities but do not rely on synchronization points fixed a priori. They also facilitate the fusion of audio models of high-level semantics, like the content of a complete scene, on top of the raw lowlevel audio frames. Segment Models provide encouraging experimental results.

Keywords :

audio-visual systems; hidden Markov models; image segmentation; indexing; sensor fusion; video signal processing; audiovisual fusion; hidden Markov models; segment models; tennis broadcasts analysis; video indexing; video structure analysis;

fLanguage :

English

Publisher :

iet

Conference_Titel :

Integration of Knowledge, Semantics and Digital Media Technology, 2005. EWIMT 2005. The 2nd European Workshop on the (Ref. No. 2005/11099)

Conference_Location :

London

ISSN :

0537-9989

Print_ISBN :

0-86341-595-4

Type :

conf

Filename :

1575950

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=448831