Efficient Scale- and Rotation-Invariant Encoding of Visual Words for Image Classification

Author

Anwar, Hafeez ; Zambanini, Sebastian ; Kampel, Martin

Author_Institution

Comput. Vision Lab., Vienna Univ. of Technol., Vienna, Austria

Volume

22

Issue

10

fYear

2015

fDate

Oct. 2015

Firstpage

1762

Lastpage

1765

Abstract

The problem of incorporating spatial information to the bag-of-visual-words model for image classification is addressed in this letter. To incorporate such information, we propose to encode the global geometric relationships of the visual words in the 2D image plane in a scale- and rotation-invariant manner. This is established by measuring scale- and rotation-invariant geometrical properties given by triangles of identical visual words. Experimental results demonstrate that our proposed method is more robust to changes in scale and image rotations than the bag-of-visual words model on a butterfly and fish dataset.

Keywords

image classification; image coding; 2D image plane; bag-of-visual-word model; butterfly dataset; fish dataset; global geometric relationship; image classification; image rotation; rotation-invariant encoding; scale-invariant encoding; spatial information; Feature extraction; Histograms; Image color analysis; Lead; Robustness; Visualization; Vocabulary; Image classification; object recognition; support vector machines;

fLanguage

English

Journal_Title

Signal Processing Letters, IEEE

Publisher

ieee

ISSN

1070-9908

Type

jour

DOI

10.1109/LSP.2015.2432851

Filename

7106459