DocumentCode :
3201381
Title :
Undirected Graphical Models for Video Analysis and Classification
Author :
Liu, Yan ; Yang, Jun ; Hauptmann, Alexander G.
Author_Institution :
IBM, Yorktown Heights
fYear :
2007
fDate :
2-5 July 2007
Firstpage :
1495
Lastpage :
1498
Abstract :
Accurate and efficient video classification and retrieval demands the fusion of multimodal information and the use of intermediate representations. This paper describes an undirected graphical model based on exponential-family harmonium, which derives intermediate semantic representations of video data by jointly modeling the textual and image information in the video. We propose an extension of the model to derive category-specific video representation and integrate video classification as a part of the modeling process. We report satisfactory classification performance on a set of 15 video categories from TRECVID collection as well as comparison on the effectiveness of different inference algorithms.
Keywords :
directed graphs; image classification; image fusion; image representation; video retrieval; video signal processing; TRECVID collection; category-specific video representation; exponential-family harmonium; intermediate semantic representation; multimodal information fusion; undirected graphical models; video analysis; video classification; video retrieval demand; Computer science; Data mining; Ear; Graphical models; Indexing; Inference algorithms; Information retrieval; Linear discriminant analysis; Principal component analysis; Speech analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2007 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
1-4244-1016-9
Electronic_ISBN :
1-4244-1017-7
Type :
conf
DOI :
10.1109/ICME.2007.4284945
Filename :
4284945
Link To Document :
بازگشت