DocumentCode :
2922676
Title :
Learning to Predict Salient Regions from Disjoint and Skewed Training Sets
Author :
Shoemaker, L. ; Banfield, R.E. ; Hall, L.O. ; Bowyer, K.W. ; Kegelmeyer, W.P.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL
fYear :
2006
fDate :
13-15 Nov. 2006
Firstpage :
116
Lastpage :
126
Abstract :
We present an ensemble learning approach that achieves accurate predictions from arbitrarily partitioned data. The partitions come from the distributed processing requirements of a large scale simulation where the volume of the data is such that classifiers can train only on data local to a given partition. As a result of the partition reflecting the need for efficient simulation analysis, rather than the needs of data mining, the class statistics vary across partitions; indeed some classes will likely be absent from some partitions. We combine a fast ensemble learning algorithm with majority voting to generate an accurate working model of the simulation. Results from several simulations show that regions of interest are successfully identified in spite of training set class imbalances. Accuracy is analyzed both at the level of nodes in the simulation data structure, and in terms of higher-level regions of interest. It is shown that over 98% of salient regions are found in independent test sets. Hence, this approach will be a significant time saver for simulation users and developers
Keywords :
learning (artificial intelligence); class statistics; distributed processing; ensemble learning; large scale simulation; Analytical models; Data mining; Data structures; Distributed processing; Large-scale systems; Partitioning algorithms; Statistical analysis; Statistical distributions; Testing; Voting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence, 2006. ICTAI '06. 18th IEEE International Conference on
Conference_Location :
Arlington, VA
ISSN :
1082-3409
Print_ISBN :
0-7695-2728-0
Type :
conf
DOI :
10.1109/ICTAI.2006.75
Filename :
4031888
Link To Document :
بازگشت