• DocumentCode
    3024809
  • Title

    A Mining Maximal Frequent Itemsets over the Entire History of Data Streams

  • Author

    Mao, Yinmin ; Li, Hong ; Yang, Lumin ; Chen, Zhigang ; Liu, Lixin

  • Author_Institution
    Sch. of Inf. Sci. & Eng., Central South Univ., Changsha, China
  • fYear
    2009
  • fDate
    25-26 April 2009
  • Firstpage
    413
  • Lastpage
    417
  • Abstract
    Mining maximal frequent itemsets has been widely concerned. However, mining data streams is more difficult than mining static databases because of the huge, high-speed and continuous characteristics of streaming data. This paper presents an algorithm, called IDSM-MFI. The algorithm uses a synopsis data structure to store the items of transactions embedded data streams so far. It adopts a top-bottom and bottom-top method to mine the set of all maximal frequent itemsets in landmark windows over data stream, which can be output in real time based on users´ specified thresholds. Theoretical analysis and experimental results show that our algorithm is efficient and scalable for mining the set of all maximal frequent itemsets over the entire history of data stream.
  • Keywords
    data mining; data structures; IDSM-MFI algorithm; bottom-top method; data streams mining; landmark windows; maximal frequent itemset mining; synopsis data structure; top-bottom method; Algorithm design and analysis; Data engineering; Data mining; Data structures; Frequency; History; Information science; Itemsets; Sensor phenomena and characterization; Transaction databases; data mining; data streams; maximal frequent itemsets;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Technology and Applications, 2009 First International Workshop on
  • Conference_Location
    Wuhan, Hubei
  • Print_ISBN
    978-0-7695-3604-0
  • Type

    conf

  • DOI
    10.1109/DBTA.2009.125
  • Filename
    5207728