DocumentCode
1658465
Title
Automatic image annotation via local sparse coding
Author
Wenbo Zhang ; Dongping Tian ; Hong Hu ; Xiaofei Zhao ; Zhongzhi Shi
Author_Institution
Key Lab. of Intell. Inf. Process., Inst. of Comput. Technol., Beijing, China
fYear
2013
Firstpage
1661
Lastpage
1665
Abstract
Sparse coding is an active research topic in machine learning and signal processing community. In this paper, we propose a novel local sparse model for multi-label image annotation. Existing feature descriptors and extraction algorithms pay less attention to semantic information and extracted feature dimension usually is high, which leads to heavy computation. Noise and redundant information often reduce the performance of sparse model. To address these issues, we combine label and visual information for feature selection while most previous work only utilizes labels and ignores visual information itself. First of all, we make use of label sets to seek images neighbor relations and generate the Gaussian kernel matrix over these neighbor images, then use LLP(Local Learning Projection) algorithm to get minimal local estimation error. After that, for each query image, we find its K nearest neighbors in the transformed space and use these neighbors to reconstruct it via sparse coding. Moreover, during coding, we penalize the corresponding reconstruction coefficients to implicitly reflect the neighbor relations. Finally, propagating tags from training data to test data. Image annotation experiments on the Corel5k dataset show the performance of our approach is comparable to several state-of-the-art algorithms.
Keywords
Gaussian channels; feature extraction; image reconstruction; image retrieval; Gaussian kernel matrix; LLP; automatic image annotation; feature descriptor algorithm; feature extraction algorithm; feature selection; local learning projection algorithm; local sparse coding; machine learning; minimal local estimation error; multilabel image annotation; reconstruction coefficient; semantic information; signal processing community; sparse coding; sparse model; Encoding; Hidden Markov models; Image coding; Image reconstruction; Semantics; Training; Visualization; KN-N; feature selection; image annotation; local; sparse coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6637934
Filename
6637934
Link To Document