• DocumentCode
    3363096
  • Title

    Ascending frequency ordered prefix-tree: efficient mining of frequent patterns

  • Author

    Liu, Guimei ; Lu, Hongjun ; Xu, Yabo ; Yu, Jeffrey Xu

  • Author_Institution
    Dept. of Comput. Sci., Hong Kong Univ. of Sci. & Technol., China
  • fYear
    2003
  • fDate
    26-28 March 2003
  • Firstpage
    65
  • Lastpage
    72
  • Abstract
    Mining frequent patterns is a fundamental and important problem in many data mining applications. Many of the algorithms adopt the pattern growth approach, which is shown to be superior to the candidate generate-and-test approach significantly. We identify the key factors that influence the performance of the pattern growth approach, and optimize them to further improve the performance. Our algorithm uses a simple while compact data structure-ascending frequency ordered prefixtree (AFOPT) to organize the conditional databases, in which we use arrays to store single branches to further save space. We traverse our prefix-tree structure using a top-down strategy. Our experiment results show that the combination of the top-down traversal strategy and the ascending frequency item ordering method achieves significant performance improvement over previous works.
  • Keywords
    data mining; software performance evaluation; tree data structures; tree searching; very large databases; arrays; ascending frequency ordered prefix-tree; candidate generate-and-test approach; conditional databases; data mining; data mining applications; experiment results; frequent pattern mining; pattern growth approach; performance improvement; top-down strategy; tree data structure; very large databases; Computer science; Costs; Councils; Data mining; Frequency; Itemsets; Space exploration; Spatial databases; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Systems for Advanced Applications, 2003. (DASFAA 2003). Proceedings. Eighth International Conference on
  • Conference_Location
    Kyoto, Japan
  • Print_ISBN
    0-7695-1895-8
  • Type

    conf

  • DOI
    10.1109/DASFAA.2003.1192369
  • Filename
    1192369