DocumentCode :
2815608
Title :
Enhancing Concept Detection by Pruning Data with MCA-Based Transaction Weights
Author :
Lin, Lin ; Shyu, Mei-Ling ; Chen, Shu-Ching
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Miami, Coral Gables, FL, USA
fYear :
2009
fDate :
14-16 Dec. 2009
Firstpage :
304
Lastpage :
311
Abstract :
With the rapid increase in the amount of multimedia data, the researches on semantic information retrieval are facing a very challenging problem - the number of positive data instances with the target concept/object/event compared with the number of negative data instances without the target concept/object/event is much smaller, which is also called the data imbalance issue. Therefore, one of the popular topics in multimedia information processing and retrieval is data pruning, a technique that can automatically identify and prune the data instances from the training data set so that the pruned data set is able to enhance the performance of model learning, classification, and concept detection. In this paper, a novel data pruning framework which gives each transaction a weight based on multiple correspondence analysis (MCA) is proposed. These transaction weights are used as the measure for pruning the training data set. Meanwhile, the testing data set could be weighted and pruned as well so that the computational cost is reduced not only when building the model but also when applying the classifiers. Experimenting with 18 high-level concepts and the benchmark (both balanced and imbalanced) data sets from TRECVID, our proposed framework achieves promising results to enhance the concept detection performance of three well-known classifiers commonly used for concept detection.
Keywords :
information retrieval; multimedia databases; concept detection enhancement; multimedia data; multiple correspondence analysis; pruning data; semantic information retrieval; target concept; target event; target object; transaction weights; Collaboration; Content based retrieval; Information filtering; Information filters; Information retrieval; Multimedia computing; Object detection; Support vector machines; Training data; USA Councils; concept detection; data pruning; multiple correspondence analysis; transaction weight;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia, 2009. ISM '09. 11th IEEE International Symposium on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4244-5231-6
Electronic_ISBN :
978-0-7695-3890-7
Type :
conf
DOI :
10.1109/ISM.2009.125
Filename :
5363259
Link To Document :
بازگشت