DocumentCode :
2915817
Title :
Large-scale image classification: Fast feature extraction and SVM training
Author :
Lin, Yuanqing ; Lv, Fengjun ; Zhu, Shenghuo ; Yang, Ming ; Cour, Timothee ; Yu, Kai ; Cao, Liangliang ; Huang, Thomas
Author_Institution :
NEC Labs. America, Cupertino, CA, USA
fYear :
2011
fDate :
20-25 June 2011
Firstpage :
1689
Lastpage :
1696
Abstract :
Most research efforts on image classification so far have been focused on medium-scale datasets, which are often defined as datasets that can fit into the memory of a desktop (typically 4G~48G). There are two main reasons for the limited effort on large-scale image classification. First, until the emergence of ImageNet dataset, there was almost no publicly available large-scale benchmark data for image classification. This is mostly because class labels are expensive to obtain. Second, large-scale classification is hard because it poses more challenges than its medium-scale counterparts. A key challenge is how to achieve efficiency in both feature extraction and classifier training without compromising performance. This paper is to show how we address this challenge using ImageNet dataset as an example. For feature extraction, we develop a Hadoop scheme that performs feature extraction in parallel using hundreds of mappers. This allows us to extract fairly sophisticated features (with dimensions being hundreds of thousands) on 1.2 million images within one day. For SVM training, we develop a parallel averaging stochastic gradient descent (ASGD) algorithm for training one-against-all 1000-class SVM classifiers. The ASGD algorithm is capable of dealing with terabytes of training data and converges very fast-typically 5 epochs are sufficient. As a result, we achieve state-of-the-art performance on the ImageNet 1000-class classification, i.e., 52.9% in classification accuracy and 71.8% in top 5 hit rate.
Keywords :
feature extraction; gradient methods; image classification; learning (artificial intelligence); stochastic processes; support vector machines; visual databases; Hadoop scheme; ImageNet dataset; SVM training; classifier training; fast feature extraction; large scale image classification; medium scale datasets; parallel averaging stochastic gradient descent algorithm; Encoding; Feature extraction; Image coding; Kernel; Support vector machines; Training; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4577-0394-2
Type :
conf
DOI :
10.1109/CVPR.2011.5995477
Filename :
5995477
Link To Document :
بازگشت