DocumentCode :
3664392
Title :
Study on GSP algorithm based on Hadoop
Author :
Huanhuan Li;Xiaofeng Zhou;Chaojun Pan
Author_Institution :
Hohai University, Institute of computer and information, Nanjing, Jiangsu Province, China
fYear :
2015
fDate :
5/1/2015 12:00:00 AM
Firstpage :
321
Lastpage :
324
Abstract :
According to the traditional sequential pattern mining algorithm with high complexity and not suitable for massive data, this paper puts forward Ha-GSP, which is a sequential pattern mining algorithm based on cloud computing platform Hadoop. On the basis of GSP algorithm, this paper research on implementation of GSP algorithm on Hadoop platform, and uses MapReduce model to solve the problem of massive data mining, and then finds frequent sequences. The experimental results show that: improved algorithm with less time, high efficiency.
Keywords :
"Algorithm design and analysis","Data mining","Computers","Cloud computing","Distributed databases","Computational modeling"
Publisher :
ieee
Conference_Titel :
Electronics Information and Emergency Communication (ICEIEC), 2015 5th International Conference on
Print_ISBN :
978-1-4799-7283-8
Type :
conf
DOI :
10.1109/ICEIEC.2015.7284549
Filename :
7284549
Link To Document :
بازگشت