• DocumentCode
    2958355
  • Title

    Discriminative learning of relaxed hierarchy for large-scale visual recognition

  • Author

    Gao, Tianshi ; Koller, Daphne

  • Author_Institution
    Dept. of Electr. Eng., Stanford Univ., Stanford, CA, USA
  • fYear
    2011
  • fDate
    6-13 Nov. 2011
  • Firstpage
    2072
  • Lastpage
    2079
  • Abstract
    In the real visual world, the number of categories a classifier needs to discriminate is on the order of hundreds or thousands. For example, the SUN dataset [24] contains 899 scene categories and ImageNet [6] has 15,589 synsets. Designing a multiclass classifier that is both accurate and fast at test time is an extremely important problem in both machine learning and computer vision communities. To achieve a good trade-off between accuracy and speed, we adopt the relaxed hierarchy structure from [15], where a set of binary classifiers are organized in a tree or DAG (directed acyclic graph) structure. At each node, classes are colored into positive and negative groups which are separated by a binary classifier while a subset of confusing classes is ignored. We color the classes and learn the induced binary classifier simultaneously using a unified and principled max-margin optimization. We provide an analysis on generalization error to justify our design. Our method has been tested on both Caltech-256 (object recognition) [9] and the SUN dataset (scene classification) [24], and shows significant improvement over existing methods.
  • Keywords
    computer vision; directed graphs; image classification; learning (artificial intelligence); object recognition; Caltech-256 dataset; ImageNet; SUN dataset; binary classifiers; computer vision communities; directed acyclic graph structure; discriminative learning; generalization error analysis; large-scale visual recognition; machine learning; max-margin optimization; multiclass classifier; object recognition; relaxed hierarchy structure; scene classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision (ICCV), 2011 IEEE International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1550-5499
  • Print_ISBN
    978-1-4577-1101-5
  • Type

    conf

  • DOI
    10.1109/ICCV.2011.6126481
  • Filename
    6126481