• DocumentCode
    63162
  • Title

    Efficient Scale- and Rotation-Invariant Encoding of Visual Words for Image Classification

  • Author

    Anwar, Hafeez ; Zambanini, Sebastian ; Kampel, Martin

  • Author_Institution
    Comput. Vision Lab., Vienna Univ. of Technol., Vienna, Austria
  • Volume
    22
  • Issue
    10
  • fYear
    2015
  • fDate
    Oct. 2015
  • Firstpage
    1762
  • Lastpage
    1765
  • Abstract
    The problem of incorporating spatial information to the bag-of-visual-words model for image classification is addressed in this letter. To incorporate such information, we propose to encode the global geometric relationships of the visual words in the 2D image plane in a scale- and rotation-invariant manner. This is established by measuring scale- and rotation-invariant geometrical properties given by triangles of identical visual words. Experimental results demonstrate that our proposed method is more robust to changes in scale and image rotations than the bag-of-visual words model on a butterfly and fish dataset.
  • Keywords
    image classification; image coding; 2D image plane; bag-of-visual-word model; butterfly dataset; fish dataset; global geometric relationship; image classification; image rotation; rotation-invariant encoding; scale-invariant encoding; spatial information; Feature extraction; Histograms; Image color analysis; Lead; Robustness; Visualization; Vocabulary; Image classification; object recognition; support vector machines;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2015.2432851
  • Filename
    7106459