Title :
Statistical active learning in multilayer perceptrons
Author_Institution :
RIKEN, Inst. of Phys. & Chem. Res., Saitama, Japan
fDate :
1/1/2000 12:00:00 AM
Abstract :
Proposes methods for generating input locations actively in gathering training data, aiming at solving problems unique to muitilayer perceptrons. One of the problems is that optimum input locations, which are calculated deterministically, sometimes distribute densely around the same point and cause local minima in backpropagation training. Two probabilistic active learning methods, which utilize the statistical variance of locations, are proposed to solve this problem. One is parametric active learning and the other is multipoint-search active learning. Another serious problem in applying active learning to multilayer perceptrons is that a Fisher information matrix can be singular, while many methods, including the proposed ones, assume its regularity. A technique of pruning redundant hidden units is proposed to keep the Fisher information matrix regular. Combined with this technique, active learning can be applied stably to multilayer perceptrons. The effectiveness of the proposed methods is demonstrated through computer simulations on simple artificial problems and a real-world problem of color conversion
Keywords :
covariance matrices; information theory; learning (artificial intelligence); multilayer perceptrons; search problems; statistical analysis; Fisher information matrix; input locations; multipoint-search active learning; parametric active learning; probabilistic active learning methods; pruning; redundant hidden units; statistical active learning; statistical variance; training data; Backpropagation; Computer simulation; Design for experiments; Learning systems; Machine learning; Mean square error methods; Multilayer perceptrons; Nonhomogeneous media; Response surface methodology; Training data;
Journal_Title :
Neural Networks, IEEE Transactions on