Title :
Undirected Graphical Models for Video Analysis and Classification
Author :
Liu, Yan ; Yang, Jun ; Hauptmann, Alexander G.
Author_Institution :
IBM, Yorktown Heights
Abstract :
Accurate and efficient video classification and retrieval demands the fusion of multimodal information and the use of intermediate representations. This paper describes an undirected graphical model based on exponential-family harmonium, which derives intermediate semantic representations of video data by jointly modeling the textual and image information in the video. We propose an extension of the model to derive category-specific video representation and integrate video classification as a part of the modeling process. We report satisfactory classification performance on a set of 15 video categories from TRECVID collection as well as comparison on the effectiveness of different inference algorithms.
Keywords :
directed graphs; image classification; image fusion; image representation; video retrieval; video signal processing; TRECVID collection; category-specific video representation; exponential-family harmonium; intermediate semantic representation; multimodal information fusion; undirected graphical models; video analysis; video classification; video retrieval demand; Computer science; Data mining; Ear; Graphical models; Indexing; Inference algorithms; Information retrieval; Linear discriminant analysis; Principal component analysis; Speech analysis;
Conference_Titel :
Multimedia and Expo, 2007 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
1-4244-1016-9
Electronic_ISBN :
1-4244-1017-7
DOI :
10.1109/ICME.2007.4284945