DocumentCode
2967014
Title
Feature Selection Using Correlation and Reliability Based Scoring Metric for Video Semantic Detection
Author
Zhu, Qiusha ; Lin, Lin ; Shyu, Mei-Ling ; Chen, Shu-Ching
Author_Institution
Dept. of Electr. & Comput. Eng., Univ. of Miami, Coral Gables, FL, USA
fYear
2010
fDate
22-24 Sept. 2010
Firstpage
462
Lastpage
469
Abstract
Content-based multimedia retrieval faces many challenges such as semantic gap, imbalanced data, and varied qualities of the media. Feature selection as a component of the retrieval process plays an important role. The aim of feature selection is to identify a subset of features by removing irrelevant or redundant features. An effective subset of features can not only improve model performance and reduce computational complexity, but also enhance semantic interpretability. To achieve these objectives, in this paper, a novel metric that integrates the correlation and reliability information between each feature and each class obtained from Multiple Correspondence Analysis (MCA) is proposed to score the features for feature selection. Based on these scores, a ranked list of features can be generated and different selection criteria can be adopted to select a subset of features. To evaluate the proposed framework, four other well-known feature selection methods, namely information gain, chi-square measure, correlation-based feature selection, and relief are compared with the proposed method over five popular classifiers using the benchmark data from TRECVID 2009 high-level feature extraction task. The results show that the proposed method outperforms the other methods in terms of classification accuracy, the size of feature subspace, and the ability to capture the semantic information.
Keywords
computational complexity; content-based retrieval; feature extraction; multimedia computing; reliability; video retrieval; chi-square measure; computational complexity; content-based multimedia retrieval; correlation based scoring metric; correlation-based feature selection; imbalanced data; information gain; multiple correspondence analysis; reliability based scoring metric; semantic gap; semantic interpretability; video semantic detection; Correlation; Equations; Feature extraction; Mathematical model; Measurement; Reliability; Semantics; Feature selection; correlation; reliability; video semantic detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Semantic Computing (ICSC), 2010 IEEE Fourth International Conference on
Conference_Location
Pittsburgh, PA
Print_ISBN
978-1-4244-7912-2
Electronic_ISBN
978-0-7695-4154-9
Type
conf
DOI
10.1109/ICSC.2010.65
Filename
5629038
Link To Document