مرکز منطقه ای اطلاع رساني علوم و فناوري - Statistical active learning in multilayer perceptrons

DocumentCode :

1299635

Title :

Statistical active learning in multilayer perceptrons

Author :

Fukumizu, Kenji

Author_Institution :

RIKEN, Inst. of Phys. & Chem. Res., Saitama, Japan

Volume :

Issue :

fYear :

2000

fDate :

1/1/2000 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

Proposes methods for generating input locations actively in gathering training data, aiming at solving problems unique to muitilayer perceptrons. One of the problems is that optimum input locations, which are calculated deterministically, sometimes distribute densely around the same point and cause local minima in backpropagation training. Two probabilistic active learning methods, which utilize the statistical variance of locations, are proposed to solve this problem. One is parametric active learning and the other is multipoint-search active learning. Another serious problem in applying active learning to multilayer perceptrons is that a Fisher information matrix can be singular, while many methods, including the proposed ones, assume its regularity. A technique of pruning redundant hidden units is proposed to keep the Fisher information matrix regular. Combined with this technique, active learning can be applied stably to multilayer perceptrons. The effectiveness of the proposed methods is demonstrated through computer simulations on simple artificial problems and a real-world problem of color conversion

Keywords :

covariance matrices; information theory; learning (artificial intelligence); multilayer perceptrons; search problems; statistical analysis; Fisher information matrix; input locations; multipoint-search active learning; parametric active learning; probabilistic active learning methods; pruning; redundant hidden units; statistical active learning; statistical variance; training data; Backpropagation; Computer simulation; Design for experiments; Learning systems; Machine learning; Mean square error methods; Multilayer perceptrons; Nonhomogeneous media; Response surface methodology; Training data;

fLanguage :

English

Journal_Title :

Neural Networks, IEEE Transactions on

Publisher :

ieee

ISSN :

1045-9227

Type :

jour

DOI :

10.1109/72.822506

Filename :

822506

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1299635