Title :
Parallel Large Average Submatrices Biclustering Based on MapReduce
Author :
Qin Lin;Yun Xue;Wensheng Chen;Shuqun Ye;Wanli Li;Jingjing Liu
Author_Institution :
Sch. of Inf. Eng., Guangdong Med. Univ., Dongguan, China
Abstract :
Large Average Sub matrices (LAS) is one of the biclustering algorithms, which can capture large average sub matrices within a high dimensional data matrix. It has gained increasing popularity in many fields such as biological data analysis and financial forecasting. However, due to urgent requirements for high performance in large scale data processing applications, high performance parallel solutions for LAS biclustering are highly desirable. In this paper, we propose an efficient parallel large average submatrices biclustering based on MapReduce. Furthermore, we boost the search efficiency of LAS by using heap sort. Experimental results demonstrate that the presented parallel algorithm has advantages of both high speedup and good scalability.
Keywords :
"Biology","Electronic mail","Data analysis","Distributed databases","Yttrium","Time complexity","Security"
Conference_Titel :
Computational Intelligence and Security (CIS), 2015 11th International Conference on
DOI :
10.1109/CIS.2015.40