Author_Institution :
Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
Abstract :
In computer vision and multimedia analysis, it is common to use multiple features (or multimodal features) to represent an object. For example, to well characterize a natural scene image, we typically extract a set of visual features to represent its color, texture, and shape. However, it is challenging to integrate multimodal features optimally. Since they are usually high-order correlated, e.g., the histogram of gradient (HOG), bag of scale invariant feature transform descriptors, and wavelets are closely related because they collaboratively reflect the image texture. Nevertheless, the existing algorithms fail to capture the high-order correlation among multimodal features. To solve this problem, we present a new multimodal feature integration framework. Particularly, we first define a new measure to capture the high-order correlation among the multimodal features, which can be deemed as a direct extension of the previous binary correlation. Therefore, we construct a feature correlation hypergraph (FCH) to model the high-order relations among multimodal features. Finally, a clustering algorithm is performed on FCH to group the original multimodal features into a set of partitions. Moreover, a multiclass boosting strategy is developed to obtain a strong classifier by combining the weak classifiers learned from each partition. The experimental results on seven popular datasets show the effectiveness of our approach.
Keywords :
computer vision; correlation methods; feature extraction; graph theory; image classification; image representation; image texture; multimedia systems; pattern clustering; FCH; HOG; clustering algorithm; computer vision; feature correlation hypergraph; high-order correlation; high-order potentials; high-order relations; histogram of gradient; image texture; multiclass boosting strategy; multimedia analysis; multimodal feature integration framework; multimodal recognition; object representation; scale invariant feature transform descriptor bag; strong classifier; wavelets; weak classifiers; Boosting; Correlation; Entropy; Joints; Kernel; Support vector machines; Vectors; Feature correlation hypergraph; high-order relations; multimodal features;