DocumentCode :
2247338
Title :
Massive data mining based on item sequence set grid space
Author :
Zhou, Lijuan ; Zhang, Zhang ; Xu, Mingsheng
Author_Institution :
Inf. Eng. Coll., Capital Normal Univ., Beijing, China
Volume :
3
fYear :
2010
fDate :
6-7 March 2010
Firstpage :
208
Lastpage :
211
Abstract :
According to the stored mode of massive data in the relational database, this paper proposed a fast mining algorithm to find maximum frequent item sets based on item sequence set grid space. The traditional methods for mining association rules generate frequent item sets from small to large. These approaches are either time consuming or computationally expensive, and often generate a large number of redundant candidates or frequent item sets, which is fatal for controlling mining speed as data to mass-level. The goal of this paper is first to use a self-defined structure linked list to storage item sequence then to find the frequent item sets from large to small. Several applications of association rules mining using item sequence set grid space has a good performance but it demonstrated inefficiency in massive data mining. The problem involves time spent on sub item sets finding. Experimental results will be presented to show that the fast mining algorithm ISSDL-DM proposed in this paper use much less time than the similar existing algorithm ISS-DM for achieving the same outcomes.
Keywords :
data mining; relational databases; ISSDL-DM algorithm; association rules; item sequence set grid space; massive data mining; relational database; self-defined structure; Asia; Association rules; Automatic control; Data engineering; Data mining; Educational institutions; Informatics; Mesh generation; Robotics and automation; Transaction databases; data structure; item sequence set grid space; massive data mining; maximum frequent item;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Informatics in Control, Automation and Robotics (CAR), 2010 2nd International Asia Conference on
Conference_Location :
Wuhan
ISSN :
1948-3414
Print_ISBN :
978-1-4244-5192-0
Electronic_ISBN :
1948-3414
Type :
conf
DOI :
10.1109/CAR.2010.5456666
Filename :
5456666
Link To Document :
بازگشت