DocumentCode
3664392
Title
Study on GSP algorithm based on Hadoop
Author
Huanhuan Li;Xiaofeng Zhou;Chaojun Pan
Author_Institution
Hohai University, Institute of computer and information, Nanjing, Jiangsu Province, China
fYear
2015
fDate
5/1/2015 12:00:00 AM
Firstpage
321
Lastpage
324
Abstract
According to the traditional sequential pattern mining algorithm with high complexity and not suitable for massive data, this paper puts forward Ha-GSP, which is a sequential pattern mining algorithm based on cloud computing platform Hadoop. On the basis of GSP algorithm, this paper research on implementation of GSP algorithm on Hadoop platform, and uses MapReduce model to solve the problem of massive data mining, and then finds frequent sequences. The experimental results show that: improved algorithm with less time, high efficiency.
Keywords
"Algorithm design and analysis","Data mining","Computers","Cloud computing","Distributed databases","Computational modeling"
Publisher
ieee
Conference_Titel
Electronics Information and Emergency Communication (ICEIEC), 2015 5th International Conference on
Print_ISBN
978-1-4799-7283-8
Type
conf
DOI
10.1109/ICEIEC.2015.7284549
Filename
7284549
Link To Document