Title :
Training a neural network with conjugate gradient methods
Author :
Towsey, Michael ; Alpsan, Dogan ; Sztriha, Laszlo
Author_Institution :
Dept. of Biophys., United Arab Emirates Univ., Al-Ain, United Arab Emirates
Abstract :
This study investigates the use of several variants of conjugate gradient (CG) optimisation and line search methods to accelerate the convergence of an MLP neural network learning two medical signal classification problems. Much of the previous work has been done with artificial problems which have little relevance to real world problems and results on real world problems have been variable. The effectiveness of CG compared to standard backpropagation (BP) depended on the degree to which the learning task required finding a global minimum. If learning was stopped when the training set had been learned to an acceptable degree of error tolerance (the typical pattern classification problem), standard BP was faster than CG and did not display the convergence difficulties usually attributed to it. If learning required finding a global minimum (as in function minimisation or function estimation tasks), CG methods were faster but performance was very much dependent on careful selection of `tuning´ parameters and line search. This requirement for meta-optimisation was more difficult for CG than for BP because of the larger number of parameters
Keywords :
conjugate gradient methods; convergence of numerical methods; learning (artificial intelligence); medical signal processing; multilayer perceptrons; optimisation; pattern classification; search problems; conjugate gradient methods; convergence; function estimation; function minimisation; global minimum; line search methods; medical signal classification problems; meta-optimisation; multilayer perceptron; neural network training; pattern classification; tuning parameters; Acceleration; Artificial neural networks; Backpropagation; Character generation; Convergence; Gradient methods; Neural networks; Optimization methods; Pattern classification; Search methods;
Conference_Titel :
Neural Networks, 1995. Proceedings., IEEE International Conference on
Conference_Location :
Perth, WA
Print_ISBN :
0-7803-2768-3
DOI :
10.1109/ICNN.1995.488128