DocumentCode :
3137507
Title :
Quality Data for Data Mining and Data Mining for Quality Data: A Constraint Based Approach in XML
Author :
Shahriar, Md Sumon ; Anam, Sarawat
Author_Institution :
Univ. of South Australia, Adelaide, SA
Volume :
2
fYear :
2008
fDate :
13-15 Dec. 2008
Firstpage :
46
Lastpage :
49
Abstract :
As quality data is important for data mining, reversely data mining is necessary to measure the quality of data. Specifically, in XML, the issue of quality data for mining purposes and also using data mining techniques for quality measures is becoming more necessary as a massive amount of data is being stored and represented over the Web. We propose two important interrelated issues: how quality XML data is useful for data mining in XML and how data mining in XML is used to measure the quality data for XML. When we address both issues, we consider XML constraints because constraints in XML can be used for quality measurement in XML data and also for finding some important patterns and association rules in XML data mining. We note that XML constraints can play an important role for data quality and data mining in XML. We address the theoretical framework rather than solutions. Our research framework is towards the broader task of data mining and data quality for XML data integrations.
Keywords :
XML; data mining; Web; XML; constraint-based approach; data mining; quality data; Association rules; Australia; Conferences; Data engineering; Data mining; Databases; Proposals; XML; CONSTRAINTS IN XML; DATA MINING; DATA QUALITY; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Future Generation Communication and Networking Symposia, 2008. FGCNS '08. Second International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-3430-5
Electronic_ISBN :
978-0-7695-3546-3
Type :
conf
DOI :
10.1109/FGCNS.2008.74
Filename :
4813519
Link To Document :
بازگشت