• DocumentCode
    1738453
  • Title

    A parallel decision tree builder for mining very large visualization datasets

  • Author

    Bowyer, K.W. ; Hall, L.O. ; Moore, T. ; Chawla, N. ; Kegelmeyer, W.P.

  • Author_Institution
    Univ. of South Florida, Tampa, FL, USA
  • Volume
    3
  • fYear
    2000
  • fDate
    8-11 Oct. 2000
  • Firstpage
    1888
  • Abstract
    Simulation problems in the DOE ASCI program generate visualization datasets more than a terabyte in size. The practical difficulties in visualizing such datasets motivate the desire for automatic recognition of salient events. We have developed a parallel decision tree classifier for use in this context. Comparisons to ScalParC, a previous attempt to build a fast parallelization of a decision tree classifier, are provided. Our parallel classifier executes on the "ASCI Red" supercomputer. Experiments demonstrate that datasets too large to be processed on a single processor can be efficiently handled in parallel, and suggest that there need not be any decrease in accuracy relative to a monolithic classifier constructed on a single processor.
  • Keywords
    data mining; data visualisation; decision trees; digital simulation; parallel processing; pattern classification; physics computing; ASCI Red supercomputer; DOE ASCI program; ScalParC; automatic event recognition; decision tree classifier; fast parallelization; parallel decision tree builder; simulation problems; very large visualization dataset mining; Acceleration; Classification tree analysis; Concurrent computing; Data visualization; Decision trees; Physics computing; Supercomputers; Testing; Training data; US Department of Energy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 2000 IEEE International Conference on
  • Conference_Location
    Nashville, TN
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-6583-6
  • Type

    conf

  • DOI
    10.1109/ICSMC.2000.886388
  • Filename
    886388