DocumentCode :
1738453
Title :
A parallel decision tree builder for mining very large visualization datasets
Author :
Bowyer, K.W. ; Hall, L.O. ; Moore, T. ; Chawla, N. ; Kegelmeyer, W.P.
Author_Institution :
Univ. of South Florida, Tampa, FL, USA
Volume :
3
fYear :
2000
fDate :
8-11 Oct. 2000
Firstpage :
1888
Abstract :
Simulation problems in the DOE ASCI program generate visualization datasets more than a terabyte in size. The practical difficulties in visualizing such datasets motivate the desire for automatic recognition of salient events. We have developed a parallel decision tree classifier for use in this context. Comparisons to ScalParC, a previous attempt to build a fast parallelization of a decision tree classifier, are provided. Our parallel classifier executes on the "ASCI Red" supercomputer. Experiments demonstrate that datasets too large to be processed on a single processor can be efficiently handled in parallel, and suggest that there need not be any decrease in accuracy relative to a monolithic classifier constructed on a single processor.
Keywords :
data mining; data visualisation; decision trees; digital simulation; parallel processing; pattern classification; physics computing; ASCI Red supercomputer; DOE ASCI program; ScalParC; automatic event recognition; decision tree classifier; fast parallelization; parallel decision tree builder; simulation problems; very large visualization dataset mining; Acceleration; Classification tree analysis; Concurrent computing; Data visualization; Decision trees; Physics computing; Supercomputers; Testing; Training data; US Department of Energy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics, 2000 IEEE International Conference on
Conference_Location :
Nashville, TN
ISSN :
1062-922X
Print_ISBN :
0-7803-6583-6
Type :
conf
DOI :
10.1109/ICSMC.2000.886388
Filename :
886388
Link To Document :
بازگشت