DocumentCode :
2724002
Title :
The Parallel Classification of Very Large Collections of Data on Multi-core Platforms
Author :
Andrada, Dodut Aurora ; Lemnaru, Camelia ; Potolea, Rodica
Author_Institution :
Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
fYear :
2011
fDate :
6-8 July 2011
Firstpage :
57
Lastpage :
62
Abstract :
Perhaps the most utilized and demanded task in data mining is classification. Most existing classification algorithms require all the data used for constructing the model for classification, or at least a good part of it, to be stored in the memory. This makes them limited by the availability of the memory. We present a parallel algorithm, based on the SPRINT decision tree, which eliminates the dependency on the memory available by storing the data to be processed in a database.
Keywords :
data mining; decision trees; multiprocessing systems; parallel algorithms; very large databases; SPRINT decision tree; data mining; multicore platforms; parallel algorithm; parallel classification; very large collections; Buildings; Data mining; Decision trees; Indexes; Instruction sets; Training; classification; large datasets; multi-thread; parallel computation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Computing (ISPDC), 2011 10th International Symposium on
Conference_Location :
Cluj Napoca
Print_ISBN :
978-1-4577-1536-5
Type :
conf
DOI :
10.1109/ISPDC.2011.18
Filename :
6108256
Link To Document :
بازگشت