Title :
Study on GSP algorithm based on Hadoop
Author :
Huanhuan Li;Xiaofeng Zhou;Chaojun Pan
Author_Institution :
Hohai University, Institute of computer and information, Nanjing, Jiangsu Province, China
fDate :
5/1/2015 12:00:00 AM
Abstract :
According to the traditional sequential pattern mining algorithm with high complexity and not suitable for massive data, this paper puts forward Ha-GSP, which is a sequential pattern mining algorithm based on cloud computing platform Hadoop. On the basis of GSP algorithm, this paper research on implementation of GSP algorithm on Hadoop platform, and uses MapReduce model to solve the problem of massive data mining, and then finds frequent sequences. The experimental results show that: improved algorithm with less time, high efficiency.
Keywords :
"Algorithm design and analysis","Data mining","Computers","Cloud computing","Distributed databases","Computational modeling"
Conference_Titel :
Electronics Information and Emergency Communication (ICEIEC), 2015 5th International Conference on
Print_ISBN :
978-1-4799-7283-8
DOI :
10.1109/ICEIEC.2015.7284549