DocumentCode
635481
Title
Correlation-based Feature Analysis and Multi-Modality Fusion framework for multimedia semantic retrieval
Author
Hsin-Yu Ha ; Yimin Yang ; Fleites, Fausto C. ; Shu-Ching Chen
Author_Institution
Sch. of Comput. & Inf. Sci., Florida Int. Univ., Miami, FL, USA
fYear
2013
fDate
15-19 July 2013
Firstpage
1
Lastpage
6
Abstract
In this paper, we propose a Correlation based Feature Analysis (CFA) and Multi-Modality Fusion (CFA-MMF) framework for multimedia semantic concept retrieval. The CFA method is able to reduce the feature space and capture the correlation between features, separating the feature set into different feature groups, called Hidden Coherent Feature Groups (HCFGs), based on Maximum Spanning Tree (MaxST) algorithm. A correlation matrix is built upon feature pair correlations, and then a MaxST is constructed based on the correlation matrix. By performing a graph cut procedure on the MaxST, a set of feature groups are obtained, where the intra-group correlation is maximized and the inter-group correlation is minimized. Finally, one classifier is trained for each of the feature groups, and the generated scores from different classifiers are fused for the final retrieval. The proposed framework is effective because it reduces the dimensionality of the feature space. The experimental results on the NUSWIDE-Lite data set demonstrate the effectiveness of the proposed CFA-MMF framework.
Keywords
correlation methods; information retrieval; matrix algebra; multimedia computing; semantic Web; trees (mathematics); CFA method; CFA-MMF framework; HCFG; MaxST algorithm; NUSWIDE-lite data set; correlation matrix; correlation-based feature analysis; feature pair correlations; feature space; graph cut procedure; hidden coherent feature groups; inter-group correlation; intra-group correlation; maximum spanning tree algorithm; multimedia semantic concept retrieval; multimedia semantic retrieval; multimodality fusion framework; Algorithm design and analysis; Correlation; Feature extraction; Multimedia communication; Semantics; Training; Visualization; feature correlation; fusion; maximum spanning tree; multi-modality; multimedia semantic retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo (ICME), 2013 IEEE International Conference on
Conference_Location
San Jose, CA
ISSN
1945-7871
Type
conf
DOI
10.1109/ICME.2013.6607639
Filename
6607639
Link To Document