DocumentCode
502828
Title
A research on extraction method of distributed heterogeneous dataset in multi-support association rule mining
Author
Wang, Bing
Author_Institution
Zhejiang GongShang Univ., Hangzhou, China
Volume
2
fYear
2009
fDate
8-9 Aug. 2009
Firstpage
17
Lastpage
20
Abstract
This paper proposes a method of EMD (extraction method of distributed heterogeneous dataset in multi-support association rule mining) which can be applied into filtering, abstraction, analysis and transformation of data feature record set in multi-support association rule mining. In order to ensure the efficient implementation of multi-support association rule mining, the format of data feature record set should be uniform, especially in the mining of distributed heterogeneous and mass data. Firstly, EMD utilizes XML to define extraction semantic, then extracts and filters data from corresponding data sources according to the semantic. Finally after analyzing the data and transforming it into a standard format, the data is saved in XML for multi-support association rule mining next step.
Keywords
XML; data analysis; data mining; data structures; information filtering; pattern classification; XML; data abstraction; data feature record set transformation; data filtering; dataset analysis; dataset extraction method; distributed heterogeneous dataset; multisupport association rule mining; Association rules; Communication system control; Dairy products; Data mining; Distributed computing; Feature extraction; Filtering; Information systems; Marketing and sales; XML; Feature record set; association rule mining; defining semantic; distributed heterogeneous;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing, Communication, Control, and Management, 2009. CCCM 2009. ISECS International Colloquium on
Conference_Location
Sanya
Print_ISBN
978-1-4244-4247-8
Type
conf
DOI
10.1109/CCCM.2009.5268007
Filename
5268007
Link To Document