DocumentCode :
1842945
Title :
Activation functions with learnable amplitude
Author :
Trentin, Edmondo
Author_Institution :
ITC-irst, Trento, Italy
Volume :
3
fYear :
1999
fDate :
1999
Firstpage :
1794
Abstract :
Network training algorithms have heavily concentrated on the learning of connection weights. Little effort has been made to learn the amplitude of the activation functions, which defines the range of values that the function can take. This paper introduces novel algorithms to learn the amplitudes of nonlinear activations in layered networks, without any assumption on their analytical form. Three instances of the algorithms are developed: (i) a common amplitude is shared among all the nonlinear units; (ii) each layer has its own amplitude; (iii) neuron-specific amplitudes are allowed. Experimental results validate the approach to a large extent, showing a dramatic improvement in performance over the nets with fixed amplitudes
Keywords :
learning (artificial intelligence); multilayer perceptrons; transfer functions; activation function amplitude; connection weight learning; learnable amplitude; multilayered neural networks; network training algorithms; neuron-specific amplitudes; nonlinear activations; Backpropagation algorithms; Cost function; Kernel; Shape control; Shape measurement; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 1999. IJCNN '99. International Joint Conference on
Conference_Location :
Washington, DC
ISSN :
1098-7576
Print_ISBN :
0-7803-5529-6
Type :
conf
DOI :
10.1109/IJCNN.1999.832650
Filename :
832650
Link To Document :
بازگشت