DocumentCode
799088
Title
Tensor-Based Transductive Learning for Multimodality Video Semantic Concept Detection
Author
Wu, Fei ; Liu, Yanan ; Zhuang, Yueting
Author_Institution
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
Volume
11
Issue
5
fYear
2009
Firstpage
868
Lastpage
878
Abstract
Interaction and integration of multimodality media types such as visual, audio, and textual data in video are the essence of video semantic analysis. Contextual information propagation is useful for both intra- and inter-shot correlations. However, the traditional concatenated vector representation of videos weakens the power of the propagation and compensation among the multiple modalities. In this paper, we introduce a higher-order tensor framework for video analysis. We represent image frame, audio, and text in video shots as data points by the 3rd-order tensor. Then we propose a novel dimension reduction algorithm which explicitly considers the manifold structure of the tensor space from contextual temporal associated cooccurring multimodal media data. Our algorithm inherently preserves the intrinsic structure of the sub- manifold where tensorshots are sampled and is also able to map out-of-sample data points directly. We propose a new transductive support tensor machines algorithm to train effective classifier using large amount of unlabeled data together with the labeled data. Experiment results on TREVID 2005 data set show that our method improves the performance of video semantic concept detection.
Keywords
learning (artificial intelligence); tensors; video signal processing; 3rd-order tensor; TREVID 2005 data set; concatenated vector representation; contextual information propagation; higher-order tensor framework; inter-shot correlations; intra-shot correlations; multimodal media data; multimodality media types; multimodality video semantic concept detection; tensor-based transductive learning; transductive support tensor machines algorithm; video analysis; video semantic concept detection; Contextual temporal associated cooccurrence (CTAC); TensorShot; dimensionality reduction; higher-order SVD (HOSVD); multimodality video semantic concept detection; transductive support tensor machines (TSTM);
fLanguage
English
Journal_Title
Multimedia, IEEE Transactions on
Publisher
ieee
ISSN
1520-9210
Type
jour
DOI
10.1109/TMM.2009.2021724
Filename
4907041
Link To Document