Title :
Hyperparameter search for deep convolutional neural network using effect factors
Author :
Zhenzhen Li ; Lianwen Jin ; Chunlin Yang ; Zhuoyao Zhong
Author_Institution :
South China Univ. of Technol., Guangzhou, China
Abstract :
Learning a deep architecture involves a tough issue called hyperparameter search. This is especially the case for convolutional neural networks with a large number of hyperparameters. To solve this problem, we propose a tensor completion method to predict the best architecture configurations for convolutional neural networks. This method is based on a hypothesis that the generalization performance of a deep architecture is controlled by several effect factors, each of which is a function of hyperparameter of the deep architecture. Predicted generalization accuracy of the best configurations are checked by carrying out deep learning computation. Since generalization performance for a practical recognition task is always data- and code-dependent, we tried out our method on an open deep learning platform named Caffe, and we increased the generalization accuracy from 98.97% to around 99.25% on MNIST by replacing only five numbers.
Keywords :
learning (artificial intelligence); neural nets; tensors; Caffe open deep learning platform; MNIST; deep architecture learning; deep convolutional neural network; hyperparameter search; tensor completion method; Accuracy; Computer architecture; Convolutional codes; Machine learning; Neural networks; Noise; Tensile stress; Architecture hyperparameter search; artificial neural network; convolutional neural network; deep learning; machine learning;
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
Conference_Location :
Chengdu
DOI :
10.1109/ChinaSIP.2015.7230511