DocumentCode :
3385872
Title :
A k-means clustering algorithm based on the distribution of SIFT
Author :
Hui Lv ; Xianglin Huang ; Lifang Yang ; Tao Liu ; Ping Wang
Author_Institution :
Sch. of Comput. Sci., Commun. Univ. of China, Beijing, China
fYear :
2013
fDate :
23-25 March 2013
Firstpage :
1301
Lastpage :
1304
Abstract :
Bag-of-Words based Image retrieval recently became the research hotspot. To improve the performance of visual word training in Bag-of-Words based image retrieval system, a k-means clustering algorithm based on the distribution of SIFT (Scale Invariant Feature Transform) feature data on each dimension is proposed. The initial clustering centers are obtained by analyzing the distribution of SIFT feature data on each dimension, and combing the iDistance method which is used to partition the data space in high-dimensional indexing according to the data distribution adaptively. Then the AKM (Approximate k-means) is used to do cluster on the sample feature data, train the visual words and get the visual vocabulary finally. In AKM, the k-d tree is built on the cluster centers at the beginning of each iteration to increase speed. The image retrieval system is constructed to verify the performance of our proposed method. Experiments are carried out on the oxford buildings 5k datasets which have 11 landmarks and the mAP (mean Average Precision) is used to evaluate the performance of image retrieval. Our proposed method achieves 31.9% compared to the AKM´s 29.8%, so it is clear that our proposed method optimizes the visual words training process and finally improves the bag-of-words based image retrieval performance.
Keywords :
document image processing; feature extraction; image retrieval; indexing; pattern clustering; trees (mathematics); AKM; SIFT feature data distribution; approximate k-means clustering algorithm; bag-of-words based image retrieval system; data space partition; high-dimensional indexing; iDistance method; k-d tree; mAP; mean average precision; scale invariant feature transform; visual vocabulary; visual word training; Clustering algorithms; Feature extraction; Image retrieval; Quantization (signal); Vectors; Visualization; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science and Technology (ICIST), 2013 International Conference on
Conference_Location :
Yangzhou
Print_ISBN :
978-1-4673-5137-9
Type :
conf
DOI :
10.1109/ICIST.2013.6747776
Filename :
6747776
Link To Document :
بازگشت