مرکز منطقه ای اطلاع رساني علوم و فناوري - Contextual Bag-of-Words for Visual Categorization

DocumentCode :

1398862

Title :

Contextual Bag-of-Words for Visual Categorization

Author :

Li, Teng ; Mei, Tao ; Kweon, In-So ; Hua, Xian-Sheng

Author_Institution :

Dept. of Electr. Eng., Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea

Volume :

Issue :

fYear :

2011

fDate :

4/1/2011 12:00:00 AM

Firstpage :

381

Lastpage :

392

Abstract :

Bag-of-words (BOW), which represents an image by the histogram of local patches on the basis of a visual vocabulary, has attracted intensive attention in visual categorization due to its good performance and flexibility. Conventional BOW neglects the contextual relations between local patches due to its Naïve Bayesian assumption. However, it is well known that contextual relations play an important role for human beings to recognize visual categories from their local appearance. This paper proposes a novel contextual bag-of-words (CBOW) representation to model two kinds of typical contextual relations between local patches, i.e., a semantic conceptual relation and a spatial neighboring relation. To model the semantic conceptual relation, visual words are grouped on multiple semantic levels according to the similarity of class distribution induced by them, accordingly local patches are encoded and images are represented. To explore the spatial neighboring relation, an automatic term extraction technique is adopted to measure the confidence that neighboring visual words are relevant. Word groups with high relevance are used and their statistics are incorporated into the BOW representation. Classification is taken using the support vector machine with an efficient kernel to incorporate the relational information. The proposed approach is extensively evaluated on two kinds of visual categorization tasks, i.e., video event and scene categorization. Experimental results demonstrate the importance of contextual relations of local patches and the CBOW shows superior performance to conventional BOW.

Keywords :

Bayes methods; feature extraction; image representation; support vector machines; vocabulary; Naïve Bayesian assumption; automatic term extraction technique; contextual bag-of-words representation; semantic conceptual relation; spatial neighboring relation; support vector machine; visual categorization; visual vocabulary; Bayesian methods; Context modeling; Data mining; Histograms; Humans; Kernel; Statistical distributions; Support vector machine classification; Support vector machines; Vocabulary; Bag-of-words; conceptual relation; local patches context; neighboring relation;

fLanguage :

English

Journal_Title :

Circuits and Systems for Video Technology, IEEE Transactions on

Publisher :

ieee

ISSN :

1051-8215

Type :

jour

DOI :

10.1109/TCSVT.2010.2041828

Filename :

5401099

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1398862