DocumentCode :
2191169
Title :
Parallelizing an Information Theoretic Co-clustering Algorithm Using a Cloud Middleware
Author :
Ramanathan, Venkatram ; Ma, Wenjing ; Ravi, Vignesh T. ; Liu, Tantan ; Agrawal, Gagan
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
fYear :
2010
fDate :
13-13 Dec. 2010
Firstpage :
186
Lastpage :
193
Abstract :
The emerging cloud environments are well suited for storage and analysis of large datasets, since they can allow on-demand access to resources. However, developing high-performance implementations of data analysis tasks is a challenging problem. In our prior work, we have developed a middleware called FREERIDE (FRamework for Rapid Implementation of Data mining Engines). FREERIDE is based upon the observation that the processing structure of a large number of data mining algorithms involves generalized reductions. FREERIDE offers a high-level interface and implements both distributed memory and shared memory parallelization. In this paper, we consider a challenging new data mining algorithm, information theoretic co-clustering, and parallelize it using the FREERIDE middleware. We show how the main processing loops of row clustering and column clustering of the Co-clustering algorithm can essentially be fit into a generalized reduction structure. We achieve good parallel efficiency, with a speedup of nearly 21 on 32 cores.
Keywords :
cloud computing; data analysis; data mining; information retrieval; middleware; parallel algorithms; pattern clustering; cloud middleware; data analysis; data mining algorithm; data storage; generalized reduction structure; information theoretic co-clustering algorithm; on-demand resource access; parallel algorithm; Co-clustering; Parallel Data Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2010 IEEE International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4244-9244-2
Electronic_ISBN :
978-0-7695-4257-7
Type :
conf
DOI :
10.1109/ICDMW.2010.100
Filename :
5693299
Link To Document :
بازگشت