DocumentCode
3775990
Title
How to initialize the CNN for small datasets: Extracting discriminative filters from pre-trained model
Author
Guanwen Zhang;Jien Kato;Yu Wang;Kenji Mase
Author_Institution
Graduate School of Information Science, Nagoya University, Nagoya, Japan
fYear
2015
Firstpage
479
Lastpage
483
Abstract
In this paper, we study how to initialize the convolutional neural network (CNN) model for training on a small dataset. Specially, we try to extract discriminative filters from the pre-trained model for a target task. On the basis of relative entropy and linear reconstruction, two methods, Minimum Entropy Loss (MEL) and Minimum Reconstruction Error (MRE), are proposed. The CNN models initialized by the proposed MEL and MRE methods are able to converge fast and achieve better accuracy. We evaluate MEL and MRE on the CIFAR10, CIFAR100, SVHN, and STL-10 public datasets. The consistent performances demonstrate the advantages of the proposed methods.
Keywords
"Training","Entropy","Convergence","Testing","Kernel","Image reconstruction","Convolution"
Publisher
ieee
Conference_Titel
Pattern Recognition (ACPR), 2015 3rd IAPR Asian Conference on
Electronic_ISBN
2327-0985
Type
conf
DOI
10.1109/ACPR.2015.7486549
Filename
7486549
Link To Document