DocumentCode
2958355
Title
Discriminative learning of relaxed hierarchy for large-scale visual recognition
Author
Gao, Tianshi ; Koller, Daphne
Author_Institution
Dept. of Electr. Eng., Stanford Univ., Stanford, CA, USA
fYear
2011
fDate
6-13 Nov. 2011
Firstpage
2072
Lastpage
2079
Abstract
In the real visual world, the number of categories a classifier needs to discriminate is on the order of hundreds or thousands. For example, the SUN dataset [24] contains 899 scene categories and ImageNet [6] has 15,589 synsets. Designing a multiclass classifier that is both accurate and fast at test time is an extremely important problem in both machine learning and computer vision communities. To achieve a good trade-off between accuracy and speed, we adopt the relaxed hierarchy structure from [15], where a set of binary classifiers are organized in a tree or DAG (directed acyclic graph) structure. At each node, classes are colored into positive and negative groups which are separated by a binary classifier while a subset of confusing classes is ignored. We color the classes and learn the induced binary classifier simultaneously using a unified and principled max-margin optimization. We provide an analysis on generalization error to justify our design. Our method has been tested on both Caltech-256 (object recognition) [9] and the SUN dataset (scene classification) [24], and shows significant improvement over existing methods.
Keywords
computer vision; directed graphs; image classification; learning (artificial intelligence); object recognition; Caltech-256 dataset; ImageNet; SUN dataset; binary classifiers; computer vision communities; directed acyclic graph structure; discriminative learning; generalization error analysis; large-scale visual recognition; machine learning; max-margin optimization; multiclass classifier; object recognition; relaxed hierarchy structure; scene classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision (ICCV), 2011 IEEE International Conference on
Conference_Location
Barcelona
ISSN
1550-5499
Print_ISBN
978-1-4577-1101-5
Type
conf
DOI
10.1109/ICCV.2011.6126481
Filename
6126481
Link To Document