DocumentCode
63162
Title
Efficient Scale- and Rotation-Invariant Encoding of Visual Words for Image Classification
Author
Anwar, Hafeez ; Zambanini, Sebastian ; Kampel, Martin
Author_Institution
Comput. Vision Lab., Vienna Univ. of Technol., Vienna, Austria
Volume
22
Issue
10
fYear
2015
fDate
Oct. 2015
Firstpage
1762
Lastpage
1765
Abstract
The problem of incorporating spatial information to the bag-of-visual-words model for image classification is addressed in this letter. To incorporate such information, we propose to encode the global geometric relationships of the visual words in the 2D image plane in a scale- and rotation-invariant manner. This is established by measuring scale- and rotation-invariant geometrical properties given by triangles of identical visual words. Experimental results demonstrate that our proposed method is more robust to changes in scale and image rotations than the bag-of-visual words model on a butterfly and fish dataset.
Keywords
image classification; image coding; 2D image plane; bag-of-visual-word model; butterfly dataset; fish dataset; global geometric relationship; image classification; image rotation; rotation-invariant encoding; scale-invariant encoding; spatial information; Feature extraction; Histograms; Image color analysis; Lead; Robustness; Visualization; Vocabulary; Image classification; object recognition; support vector machines;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2015.2432851
Filename
7106459
Link To Document