DocumentCode :
3093711
Title :
Integrating Visual and Textual Features for Web Image Clustering
Author :
Xia, D.S. ; Xiang, Z.Q. ; Zou, Y.X.
Author_Institution :
ADSPLAB, Peking Univ., Shenzhen, China
fYear :
2015
fDate :
20-22 April 2015
Firstpage :
116
Lastpage :
123
Abstract :
With the explosive growth of Web and tremendous development of digital image processing technologies, the applications of Web image have attracted much attention, such as the Web image retrieval. Since the Web images are often with some related text tags, making use of both visual and textual features of Web image will help improving the accuracy of the Web image clustering. Researches show that Web image clustering methods, such as graph partitioning models and hyper graph partitioning models, didn´t make use the relations between texts and image simultaneously. In this paper, we explore to take both visual and textual features into account for Web image clustering by building a graph model and develop a novel iterative clustering method. With K clusters initialized, we calculate the occurrence frequency of each visual/textual feature over the j-th cluster (j = 1, 2,⋯, K), which is used to measure the significance of the feature for the j-th cluster. Then the likelihood of each image, which belongs to the j-th cluster, can be determined accordingly. Furthermore, a mixture model is built for the predicted feature linked to each image and the EM algorithm is adopted to get K component parameters which describe posterior probabilities of all clusters for each image. Then two K-dimensional vectors consisting of component parameters will be used to describe the image and adjust the cluster index of it. Several experiments have been performed with MIR-Flickr25K and IAPR TC-12 Benchmark datasets and the performance of the proposed Web image clustering algorithm is superior to that of the compared algorithm.
Keywords :
graph theory; image texture; iterative methods; pattern clustering; probability; text analysis; EM algorithm; IAPR TC-12 benchmark dataset; K-component parameters; K-dimensional vectors; MIR-Flickr25K benchmark dataset; Web image clustering; Web image retrieval; accuracy improvement; cluster index; component parameters; digital image processing technologies; hypergraph partitioning models; image likelihood; iterative clustering method; occurrence frequency; posterior probabilities; text tags; textual feature; textual feature integration; visual feature; visual feature integration; Algorithm design and analysis; Clustering algorithms; Feature extraction; Multimedia communication; Semantics; Time complexity; Visualization; Web image clustering; mixtual model; ranking functions; visual and textual features;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Big Data (BigMM), 2015 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-8687-3
Type :
conf
DOI :
10.1109/BigMM.2015.35
Filename :
7153864
Link To Document :
بازگشت