DocumentCode :
3523532
Title :
MapReduce-based efficient algorithm for finding large patterns
Author :
Junqiang Liu ; Yongsheng Wu ; Shijian Xu ; Qingfeng Zhou ; Mengtao Xu
Author_Institution :
Sch. of Inf. & Electron. Eng., Zhejiang Gongshang Univ., Hangzhou, China
fYear :
2015
fDate :
27-29 March 2015
Firstpage :
164
Lastpage :
169
Abstract :
Finding large patterns is an objective of computational intelligence and a key step in many data mining applications, in particular in Big Data applications, where the scalability of mining algorithms is a great issue. This paper proposes an efficient algorithm Pampas that takes full advantage of the MapReduce framework in addressing the scalability issue. The novelty lies in two aspects: Pampas is the first parallel algorithm that integrates a breadth-first search strategy with a vertical mining approach, and Pampas proposes to employ different vertical formats in combination to represent the data, which improves not only scalability but also efficiency. Extensive experimental results demonstrate that the proposed algorithm outperforms the existing algorithms and scales out well with respect to database size and cluster size.
Keywords :
Big Data; data handling; data mining; parallel algorithms; tree searching; Big Data applications; MapReduce-based efficient algorithm; Pampas algorithm; breadth-first search strategy; cluster size; computational intelligence; data mining applications; data representation; database size; large-pattern finding; mining algorithm scalability; parallel algorithm; vertical mining approach;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Computational Intelligence (ICACI), 2015 Seventh International Conference on
Conference_Location :
Wuyi
Print_ISBN :
978-1-4799-7257-9
Type :
conf
DOI :
10.1109/ICACI.2015.7184769
Filename :
7184769
Link To Document :
بازگشت