Title :
Sparse Output Coding for Large-Scale Visual Recognition
Author :
Bin Zhao ; Xing, Eric P.
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
Many vision tasks require a multi-class classifier to discriminate multiple categories, on the order of hundreds or thousands. In this paper, we propose sparse output coding, a principled way for large-scale multi-class classification, by turning high-cardinality multi-class categorization into a bit-by-bit decoding problem. Specifically, sparse output coding is composed of two steps: efficient coding matrix learning with scalability to thousands of classes, and probabilistic decoding. Empirical results on object recognition and scene classification demonstrate the effectiveness of our proposed approach.
Keywords :
decoding; image classification; learning (artificial intelligence); matrix algebra; object recognition; probability; bit-by-bit decoding problem; coding matrix learning; high-cardinality multiclass categorization; large-scale multiclass classification; large-scale visual recognition; multiclass classifier; object recognition; probabilistic decoding; scene classification; sparse output coding; Accuracy; Decoding; Encoding; Gradient methods; Training data; Visualization;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
Conference_Location :
Portland, OR
DOI :
10.1109/CVPR.2013.430