DocumentCode :
3079017
Title :
PCAH: A PCA-Based Hierarchical Clustering Method for Visual Words Construction
Author :
Ying He ; Lin Mei ; Jian Wang ; Zhi-zong Wu ; Xue-xia Zhong
Author_Institution :
Cyber Phys. Syst. R&D Center, Third Res. Inst. of Minist. of Public Security, Shanghai, China
fYear :
2015
fDate :
4-7 May 2015
Firstpage :
1009
Lastpage :
1018
Abstract :
Most of the existing methods for generating a visual dictionary SIFT based on local characteristics, and adopt the common K-means clustering method to get the visual dictionary. But when the image vector dimension of the local feature is growing higher, the vector distribution of the local characteristics becomes sparse, resulting in the high correlation distance between the image vectors and reducing the comparability and universality of the visual patterns. According to the above problem, based on the local SIFT features, this paper introduced a Principal Component Analysis Hierarchical clustering method (PCAH) for generating the visual dictionary. This method can effectively ease the feature dimension disaster and obtain better stability. In addition, this method can solve the problem because of high dimension and structure complexity in the feature space of the images efficiently, and can get better performance in generating the visual dictionary. The experiment is executed on the pedestrians dataset Test_dataset1(our own dataset), pos, the scene classification dataset Upright vs Inverted, and the behavior classification dataset Stanford40_JPEGImages. And the datasets are divided into two groups based on the number of the SIFT features (one is less than 300 and the other is more than 5000). We adopt the Silhouette index and the computation time as the evaluation index. The experiment results indicate that comparing with the K-means clustering algorithm, the proposed PCA-based Hierarchical clustering method (PCAH) can reach higher quality visual words. At the same time, the computation speed of the PCAH clustering method is faster.
Keywords :
pattern clustering; principal component analysis; text detection; K-means clustering algorithm; PCA-based hierarchical clustering method; PCAH clustering method; SIFT features; evaluation index; principal component analysis hierarchical clustering method; silhouette index; visual dictionary; visual words construction; Clustering algorithms; Clustering methods; Dictionaries; Feature extraction; Principal component analysis; Visualization; Vocabulary; Clustering methods; K-means; PCAH; Visual words;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
Conference_Location :
Shenzhen
Type :
conf
DOI :
10.1109/CCGrid.2015.33
Filename :
7152587
Link To Document :
بازگشت