Title :
Anthill: a scalable run-time environment for data mining applications
Author :
R.A. Ferreira;W. Meira;D. Guedes;L.M.A. Drummond;B. Coutinho;G. Teodoro;T. Tavares;R. Araujo;G.T. Ferreira
Author_Institution :
Dept. of Comput. Sci., Univ. Fed. de Minas Gerais, Belo Horizonte, Brazil
fDate :
6/27/1905 12:00:00 AM
Abstract :
Data mining techniques are becoming increasingly more popular as a reasonable means to collect summaries from the rapidly growing datasets in many areas. However, as the size of the raw data increases, parallel data mining algorithms are becoming a necessity. In this paper, we present a run-time support system that was designed to allow the efficient implementation of data-mining algorithms on heterogeneous distributed environments. We believe that the runtime framework is suitable for a broader class of applications, beyond data mining. We also present a parallelization strategy that is supported by the run-time system. We show scalability results of three different data-mining algorithms that were parallelized using our approach and our run-time support. All applications scale almost linearly up to a large number of nodes.
Keywords :
"Runtime environment","Data mining","Application software","Clustering algorithms","Algorithm design and analysis","Scalability","Computer science","Costs","Memory","Data analysis"
Conference_Titel :
Computer Architecture and High Performance Computing, 2005. SBAC-PAD 2005. 17th International Symposium on
Print_ISBN :
0-7695-2446-X
DOI :
10.1109/CAHPC.2005.12