Title :
Web Multimedia Object Classification Using Cross-Domain Correlation Knowledge
Author :
Wenting Lu ; Jingxuan Li ; Tao Li ; Weidong Guo ; Honggang Zhang ; Jun Guo
Author_Institution :
Capital Univ. of Econ. & Bus., Beijing, China
Abstract :
Given a collection of web images with the corresponding textual descriptions, in this paper, we propose a novel cross-domain learning method to classify these web multimedia objects by transferring the correlation knowledge among different information sources. Here, the knowledge is extracted from unlabeled objects through unsupervised learning and applied to perform supervised classification tasks. To mine more meaningful correlation knowledge, instead of using commonly used visual words in the traditional bag-of-visual-words (BoW) model, we discover higher level visual components (words and phrases) to incorporate the spatial and semantic information into our image representation model, i.e., bag-of-visual-phrases (BoP). By combining the enriched visual components with the textual words, we calculate the frequently co-occurring pairs among them to construct a cross-domain correlated graph in which the correlation knowledge is mined. After that, we investigate two different strategies to apply such knowledge to enrich the feature space where the supervised classification is performed. By transferring such knowledge, our cross-domain transfer learning method can not only handle large scale web multimedia objects, but also deal with the situation that the textual descriptions of a small portion of web images are missing. Empirical experiments on two different datasets of web multimedia objects are conducted to demonstrate the efficacy and effectiveness of our proposed cross-domain transfer learning method.
Keywords :
Internet; correlation methods; graph theory; image classification; image representation; knowledge acquisition; multimedia computing; text analysis; unsupervised learning; BoP; BoW; Web image collection; Web multimedia object classification; bag-of-visual-phrases; bag-of-visual-words model; cross-domain correlated graph; cross-domain correlation knowledge; cross-domain transfer learning method; feature space; image representation model; information sources; knowledge extraction; supervised classification tasks; textual descriptions; unsupervised learning; visual components; Correlation; Feature extraction; Knowledge engineering; Learning systems; Multimedia communication; Semantics; Visualization; Bag-of-Visual-Phrases Model; Correlation Knowledge; Cross-Domain; Multimedia Object Classification; Transfer Learning;
Journal_Title :
Multimedia, IEEE Transactions on
DOI :
10.1109/TMM.2013.2280895