Integrating Visual and Textual Features for Web Image Clustering

Author

Xia, D.S. ; Xiang, Z.Q. ; Zou, Y.X.

Author_Institution

ADSPLAB, Peking Univ., Shenzhen, China

fYear

2015

fDate

20-22 April 2015

Firstpage

116

Lastpage

123

Abstract

With the explosive growth of Web and tremendous development of digital image processing technologies, the applications of Web image have attracted much attention, such as the Web image retrieval. Since the Web images are often with some related text tags, making use of both visual and textual features of Web image will help improving the accuracy of the Web image clustering. Researches show that Web image clustering methods, such as graph partitioning models and hyper graph partitioning models, didn´t make use the relations between texts and image simultaneously. In this paper, we explore to take both visual and textual features into account for Web image clustering by building a graph model and develop a novel iterative clustering method. With K clusters initialized, we calculate the occurrence frequency of each visual/textual feature over the j-th cluster (j = 1, 2,⋯, K), which is used to measure the significance of the feature for the j-th cluster. Then the likelihood of each image, which belongs to the j-th cluster, can be determined accordingly. Furthermore, a mixture model is built for the predicted feature linked to each image and the EM algorithm is adopted to get K component parameters which describe posterior probabilities of all clusters for each image. Then two K-dimensional vectors consisting of component parameters will be used to describe the image and adjust the cluster index of it. Several experiments have been performed with MIR-Flickr25K and IAPR TC-12 Benchmark datasets and the performance of the proposed Web image clustering algorithm is superior to that of the compared algorithm.

Keywords

graph theory; image texture; iterative methods; pattern clustering; probability; text analysis; EM algorithm; IAPR TC-12 benchmark dataset; K-component parameters; K-dimensional vectors; MIR-Flickr25K benchmark dataset; Web image clustering; Web image retrieval; accuracy improvement; cluster index; component parameters; digital image processing technologies; hypergraph partitioning models; image likelihood; iterative clustering method; occurrence frequency; posterior probabilities; text tags; textual feature; textual feature integration; visual feature; visual feature integration; Algorithm design and analysis; Clustering algorithms; Feature extraction; Multimedia communication; Semantics; Time complexity; Visualization; Web image clustering; mixtual model; ranking functions; visual and textual features;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia Big Data (BigMM), 2015 IEEE International Conference on

Conference_Location

Beijing

Print_ISBN

978-1-4799-8687-3

Type

conf

DOI

10.1109/BigMM.2015.35

Filename

7153864