DocumentCode :
3775990
Title :
How to initialize the CNN for small datasets: Extracting discriminative filters from pre-trained model
Author :
Guanwen Zhang;Jien Kato;Yu Wang;Kenji Mase
Author_Institution :
Graduate School of Information Science, Nagoya University, Nagoya, Japan
fYear :
2015
Firstpage :
479
Lastpage :
483
Abstract :
In this paper, we study how to initialize the convolutional neural network (CNN) model for training on a small dataset. Specially, we try to extract discriminative filters from the pre-trained model for a target task. On the basis of relative entropy and linear reconstruction, two methods, Minimum Entropy Loss (MEL) and Minimum Reconstruction Error (MRE), are proposed. The CNN models initialized by the proposed MEL and MRE methods are able to converge fast and achieve better accuracy. We evaluate MEL and MRE on the CIFAR10, CIFAR100, SVHN, and STL-10 public datasets. The consistent performances demonstrate the advantages of the proposed methods.
Keywords :
"Training","Entropy","Convergence","Testing","Kernel","Image reconstruction","Convolution"
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ACPR), 2015 3rd IAPR Asian Conference on
Electronic_ISBN :
2327-0985
Type :
conf
DOI :
10.1109/ACPR.2015.7486549
Filename :
7486549
Link To Document :
بازگشت