• DocumentCode
    3664392
  • Title

    Study on GSP algorithm based on Hadoop

  • Author

    Huanhuan Li;Xiaofeng Zhou;Chaojun Pan

  • Author_Institution
    Hohai University, Institute of computer and information, Nanjing, Jiangsu Province, China
  • fYear
    2015
  • fDate
    5/1/2015 12:00:00 AM
  • Firstpage
    321
  • Lastpage
    324
  • Abstract
    According to the traditional sequential pattern mining algorithm with high complexity and not suitable for massive data, this paper puts forward Ha-GSP, which is a sequential pattern mining algorithm based on cloud computing platform Hadoop. On the basis of GSP algorithm, this paper research on implementation of GSP algorithm on Hadoop platform, and uses MapReduce model to solve the problem of massive data mining, and then finds frequent sequences. The experimental results show that: improved algorithm with less time, high efficiency.
  • Keywords
    "Algorithm design and analysis","Data mining","Computers","Cloud computing","Distributed databases","Computational modeling"
  • Publisher
    ieee
  • Conference_Titel
    Electronics Information and Emergency Communication (ICEIEC), 2015 5th International Conference on
  • Print_ISBN
    978-1-4799-7283-8
  • Type

    conf

  • DOI
    10.1109/ICEIEC.2015.7284549
  • Filename
    7284549