Title :
Comparison of clustering approaches for summarizing large populations of images
Author :
Jing, Yushi ; Covell, Michele ; Rowley, Henry A.
Author_Institution :
Google Inc., Mountain View, CA, USA
Abstract :
This paper compares the efficacy and efficiency of different clustering approaches for selecting a set of exemplar images, to present in the context of a semantic concept. We evaluate these approaches using 900 diverse queries, each associated with 1000 web images, and comparing the exemplars chosen by clustering to the top 20 images for that search term. Our results suggest that Affinity Propagation is effective in selecting exemplars that match the top search images but at high computational cost. We improve on these early results using a simple distribution-based selection filter on incomplete clustering results. This improvement allows us to use more computationally efficient approaches to clustering, such as Hierarchical Agglomerative Clustering (HAC) and Partitioning Around Medoids (PAM), while still reaching the same (or better) quality of results as were given by Affinity Propagation in the original study. The computational savings is significant since these alternatives are 7-27 times faster than Affinity Propagation.
Keywords :
Internet; image processing; information filtering; pattern clustering; Web images; affinity propagation; clustering approaches; distribution-based selection filter; hierarchical agglomerative clustering; large image populations; partitioning around medoids; Bipartite graph; Clustering algorithms; Computational efficiency; Google; Search engines; Semantics; Visualization; Web image summarization; clustering; k-medoids;
Conference_Titel :
Multimedia and Expo (ICME), 2010 IEEE International Conference on
Conference_Location :
Suntec City
Print_ISBN :
978-1-4244-7491-2
DOI :
10.1109/ICME.2010.5583276