• DocumentCode
    2247338
  • Title

    Massive data mining based on item sequence set grid space

  • Author

    Zhou, Lijuan ; Zhang, Zhang ; Xu, Mingsheng

  • Author_Institution
    Inf. Eng. Coll., Capital Normal Univ., Beijing, China
  • Volume
    3
  • fYear
    2010
  • fDate
    6-7 March 2010
  • Firstpage
    208
  • Lastpage
    211
  • Abstract
    According to the stored mode of massive data in the relational database, this paper proposed a fast mining algorithm to find maximum frequent item sets based on item sequence set grid space. The traditional methods for mining association rules generate frequent item sets from small to large. These approaches are either time consuming or computationally expensive, and often generate a large number of redundant candidates or frequent item sets, which is fatal for controlling mining speed as data to mass-level. The goal of this paper is first to use a self-defined structure linked list to storage item sequence then to find the frequent item sets from large to small. Several applications of association rules mining using item sequence set grid space has a good performance but it demonstrated inefficiency in massive data mining. The problem involves time spent on sub item sets finding. Experimental results will be presented to show that the fast mining algorithm ISSDL-DM proposed in this paper use much less time than the similar existing algorithm ISS-DM for achieving the same outcomes.
  • Keywords
    data mining; relational databases; ISSDL-DM algorithm; association rules; item sequence set grid space; massive data mining; relational database; self-defined structure; Asia; Association rules; Automatic control; Data engineering; Data mining; Educational institutions; Informatics; Mesh generation; Robotics and automation; Transaction databases; data structure; item sequence set grid space; massive data mining; maximum frequent item;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Informatics in Control, Automation and Robotics (CAR), 2010 2nd International Asia Conference on
  • Conference_Location
    Wuhan
  • ISSN
    1948-3414
  • Print_ISBN
    978-1-4244-5192-0
  • Electronic_ISBN
    1948-3414
  • Type

    conf

  • DOI
    10.1109/CAR.2010.5456666
  • Filename
    5456666