DocumentCode :
2171179
Title :
Mining frequent itemsets in data streams based on genetic algorithm
Author :
Chong Han ; Lijuan Sun ; Jian Guo ; Xiaodong Chen
Author_Institution :
Coll. of Comput., Nanjing Univ. of Posts & Telecommun., Nanjing, China
fYear :
2013
fDate :
17-19 Nov. 2013
Firstpage :
748
Lastpage :
753
Abstract :
Stream data is a very common data type in big data and in many data streams applications, users tend to pay more attention to the mode information of the data streams. So mining frequent patterns in data streams is a significative work. Meanwhile, finding frequent itemests in a data set with predefined fixed support threshold could be seen as an optimization problem. In this paper, the problem of frequent itemsets mining is derived as a non-linear optimization problem, then genetic algorithm is adopted to solve it. Through the formal and bitmap representation of frequent itemsets, the non-linear optimization problem is transformed to 0-1 programming. A set of experimental results show that unlike typical Apriori algorithm, the complexity of time and memory space grows exponentially as the support decrease, our proposed algorithm has a high time and space efficiency even with a very low support.
Keywords :
Big Data; computational complexity; data mining; genetic algorithms; nonlinear programming; 0-1 programming; Big Data; apriori algorithm; bitmap representation; data set; data streams applications; data type; frequent itemsets mining; frequent pattern mining; genetic algorithm; memory space; nonlinear optimization problem; time complexity; Biological cells; Data mining; Genetic algorithms; Itemsets; Sociology; Statistics; Big data; data streams; frequent itemsets; genetic algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communication Technology (ICCT), 2013 15th IEEE International Conference on
Conference_Location :
Guilin
Type :
conf
DOI :
10.1109/ICCT.2013.6820474
Filename :
6820474
Link To Document :
بازگشت