DocumentCode
3621001
Title
Anthill: a scalable run-time environment for data mining applications
Author
R.A. Ferreira;W. Meira;D. Guedes;L.M.A. Drummond;B. Coutinho;G. Teodoro;T. Tavares;R. Araujo;G.T. Ferreira
Author_Institution
Dept. of Comput. Sci., Univ. Fed. de Minas Gerais, Belo Horizonte, Brazil
fYear
2005
fDate
6/27/1905 12:00:00 AM
Firstpage
159
Lastpage
166
Abstract
Data mining techniques are becoming increasingly more popular as a reasonable means to collect summaries from the rapidly growing datasets in many areas. However, as the size of the raw data increases, parallel data mining algorithms are becoming a necessity. In this paper, we present a run-time support system that was designed to allow the efficient implementation of data-mining algorithms on heterogeneous distributed environments. We believe that the runtime framework is suitable for a broader class of applications, beyond data mining. We also present a parallelization strategy that is supported by the run-time system. We show scalability results of three different data-mining algorithms that were parallelized using our approach and our run-time support. All applications scale almost linearly up to a large number of nodes.
Keywords
"Runtime environment","Data mining","Application software","Clustering algorithms","Algorithm design and analysis","Scalability","Computer science","Costs","Memory","Data analysis"
Publisher
ieee
Conference_Titel
Computer Architecture and High Performance Computing, 2005. SBAC-PAD 2005. 17th International Symposium on
ISSN
1550-6533
Print_ISBN
0-7695-2446-X
Type
conf
DOI
10.1109/CAHPC.2005.12
Filename
1592569
Link To Document