مرکز منطقه ای اطلاع رساني علوم و فناوري - Creating efficient codebooks for visual recognition

DocumentCode :

2541891

Title :

Creating efficient codebooks for visual recognition

Author :

Jurie, Frederic ; Triggs, Bill

Author_Institution :

GRAVIR-INRIA-CNRS, Montbonnot, France

Volume :

fYear :

2005

fDate :

17-21 Oct. 2005

Firstpage :

604

Abstract :

Visual codebook based quantization of robust appearance descriptors extracted from local image patches is an effective means of capturing image statistics for texture analysis and scene classification. Codebooks are usually constructed by using a method such as k-means to cluster the descriptor vectors of patches sampled either densely (´textons´) or sparsely (´bags of features´ based on key-points or salience measures) from a set of training images. This works well for texture analysis in homogeneous images, but the images that arise in natural object recognition tasks have far less uniform statistics. We show that for dense sampling, k-means over-adapts to this, clustering centres almost exclusively around the densest few regions in descriptor space and thus failing to code other informative regions. This gives suboptimal codes that are no better than using randomly selected centres. We describe a scalable acceptance-radius based clusterer that generates better codebooks and study its performance on several image classification tasks. We also show that dense representations outperform equivalent keypoint based ones on these tasks and that SVM or mutual information based feature selection starting from a dense codebook further improves the performance.

Keywords :

feature extraction; image classification; image coding; pattern clustering; dense codebook; dense sampling; feature selection; image classification; image patch; mutual information; robust appearance descriptor; scalable acceptance-radius based clusterer; support vector machine; visual codebook based quantization; visual recognition; Image analysis; Image classification; Image sampling; Image texture analysis; Layout; Object recognition; Quantization; Robustness; Statistical analysis; Support vector machines;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on

ISSN :

1550-5499

Print_ISBN :

0-7695-2334-X

Type :

conf

DOI :

10.1109/ICCV.2005.66

Filename :

1541309

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2541891