DocumentCode
3517801
Title
Effective image representation based on bi-layer visual codebook
Author
Song, Yan ; Tang, Jinhui ; Li, Xia ; Tian, Qi ; Dai, Lirong
Author_Institution
Dept. of EEIS, Univ. of Sci & Tech. of China, Hefei, China
fYear
2011
fDate
28-28 Nov. 2011
Firstpage
224
Lastpage
228
Abstract
Recently, the Bag-of-visual Words (BoW) based image representation has drawn much attention in image categorization and retrieval applications. It is known that the visual codebook construction and the related quantization methods play the important roles in BoW model. Traditionally, visual codebook is generated by clustering local features into groups, and the original feature is hard quantized to its nearest centers. It is known that the quantization error may degrade the effectiveness of the BoW representation. To address this problem, several soft quantization based methods have been proposed in literature. However, the effectiveness of these methods is still unsatisfactory. In this paper, we propose a novel and effective image representation method based on a bi-layer codebook. In this method, we first construct the bi-layer codebook to explicitly reduce the quantization error. And then, inspired by the locality-constrained linear coding method[18], we propose a ridge regression based quantization to assign multiple visual words to the local feature. Furthermore, the k nearest neighbor strategy is integrated to improve the efficiency of quantization. To evaluate the proposed image representation, we compare it with the existing image representations on two benchmark datasets in the image classification experiments. The experimental results demonstrate the superiority over the state-of-the-art techniques.
Keywords
image classification; image coding; image representation; linear codes; pattern clustering; regression analysis; bag-of-visual words; bilayer visual codebook; effective image representation method; image categorization; image classification; image retrieval applications; k nearest neighbor strategy; local feature clustering; locality-constrained linear coding method; multiple visual words; quantization error; ridge regression based quantization; soft quantization based methods; Feature extraction; Image classification; Image coding; Image representation; Quantization; Vectors; Visualization; Image Classification; Image Representation; visual codebook;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ACPR), 2011 First Asian Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4577-0122-1
Type
conf
DOI
10.1109/ACPR.2011.6166534
Filename
6166534
Link To Document