Title :
A k-means clustering algorithm based on the distribution of SIFT
Author :
Hui Lv ; Xianglin Huang ; Lifang Yang ; Tao Liu ; Ping Wang
Author_Institution :
Sch. of Comput. Sci., Commun. Univ. of China, Beijing, China
Abstract :
Bag-of-Words based Image retrieval recently became the research hotspot. To improve the performance of visual word training in Bag-of-Words based image retrieval system, a k-means clustering algorithm based on the distribution of SIFT (Scale Invariant Feature Transform) feature data on each dimension is proposed. The initial clustering centers are obtained by analyzing the distribution of SIFT feature data on each dimension, and combing the iDistance method which is used to partition the data space in high-dimensional indexing according to the data distribution adaptively. Then the AKM (Approximate k-means) is used to do cluster on the sample feature data, train the visual words and get the visual vocabulary finally. In AKM, the k-d tree is built on the cluster centers at the beginning of each iteration to increase speed. The image retrieval system is constructed to verify the performance of our proposed method. Experiments are carried out on the oxford buildings 5k datasets which have 11 landmarks and the mAP (mean Average Precision) is used to evaluate the performance of image retrieval. Our proposed method achieves 31.9% compared to the AKM´s 29.8%, so it is clear that our proposed method optimizes the visual words training process and finally improves the bag-of-words based image retrieval performance.
Keywords :
document image processing; feature extraction; image retrieval; indexing; pattern clustering; trees (mathematics); AKM; SIFT feature data distribution; approximate k-means clustering algorithm; bag-of-words based image retrieval system; data space partition; high-dimensional indexing; iDistance method; k-d tree; mAP; mean average precision; scale invariant feature transform; visual vocabulary; visual word training; Clustering algorithms; Feature extraction; Image retrieval; Quantization (signal); Vectors; Visualization; Vocabulary;
Conference_Titel :
Information Science and Technology (ICIST), 2013 International Conference on
Conference_Location :
Yangzhou
Print_ISBN :
978-1-4673-5137-9
DOI :
10.1109/ICIST.2013.6747776