DocumentCode
2018431
Title
Data selection based on Bayesian error bar
Author
Cho, S. ; Choi, S. ; Wong, P.M.
Author_Institution
Dept. of Ind. Eng., Seoul Nat. Univ., South Korea
Volume
1
fYear
1999
fDate
1999
Firstpage
418
Abstract
Outliers, noise and data density imbalance, present in most real world data, render it difficult to properly train neural networks. Conventionally residual analysis was used to detect outliers. When used with neural networks, however, the procedure is computationally costly. The authors propose an efficient heuristic data selection method that is based on Bayesian error bars. After a neural network is trained, the residual and error bar are computed for each data. The data that correspond to large residual or large error bars are removed from the training data set. The remaining data are then used to further train the network. The proposed approach was applied to two real world problems: rock porosity and permeability prediction problems in reservoir engineering, with a significant generalization performance improvement of 30-55%. This preliminary result suggests that the approach deserves further investigation
Keywords
Bayes methods; civil engineering computing; data handling; errors; learning (artificial intelligence); neural nets; water supply; Bayesian error bar; data density imbalance; data selection; generalization performance improvement; heuristic data selection method; neural network training; outliers; permeability prediction problems; real world data; real world problems; reservoir engineering; residual analysis; rock porosity; training data set; Australia; Bars; Bayesian methods; Computer networks; Data security; Industrial engineering; National security; Neural networks; Petroleum; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Information Processing, 1999. Proceedings. ICONIP '99. 6th International Conference on
Conference_Location
Perth, WA
Print_ISBN
0-7803-5871-6
Type
conf
DOI
10.1109/ICONIP.1999.844025
Filename
844025
Link To Document