DocumentCode :
46239
Title :
Informative visual words construction to improve bag of words image representation
Author :
Farhangi, Mohammad Mehdi ; Soryani, Mohsen ; Fathy, Mahmood
Author_Institution :
Sch. of Comput. Eng., Iran Univ. of Sci. & Technol., Tehran, Iran
Volume :
8
Issue :
5
fYear :
2014
fDate :
May-14
Firstpage :
310
Lastpage :
318
Abstract :
Bag of visual words model has recently attracted much attention from computer vision society because of its notable success in analysing images and exploring their content. This study improves this model by utilising the adjacency information between words. To explore this information, a binary tree structure is constructed from the visual words in order to model the is - a relationships in the vocabulary. Informative nodes of this tree are extracted by using the χ2 criterion and are used to capture the adjacency information of visual words. This approach is a simple and computationally effective way for modelling the spatial relations of visual words, which improves the image classification performance. The authors evaluated our method for visual classification of three known datasets: 15 natural scenes, Caltech-101 and Graz-01.
Keywords :
computer vision; image classification; image representation; adjacency information; binary tree structure; computer vision; image classification; image representation; informative visual words construction;
fLanguage :
English
Journal_Title :
Image Processing, IET
Publisher :
iet
ISSN :
1751-9659
Type :
jour
DOI :
10.1049/iet-ipr.2013.0449
Filename :
6829932
Link To Document :
بازگشت