• DocumentCode
    3137507
  • Title

    Quality Data for Data Mining and Data Mining for Quality Data: A Constraint Based Approach in XML

  • Author

    Shahriar, Md Sumon ; Anam, Sarawat

  • Author_Institution
    Univ. of South Australia, Adelaide, SA
  • Volume
    2
  • fYear
    2008
  • fDate
    13-15 Dec. 2008
  • Firstpage
    46
  • Lastpage
    49
  • Abstract
    As quality data is important for data mining, reversely data mining is necessary to measure the quality of data. Specifically, in XML, the issue of quality data for mining purposes and also using data mining techniques for quality measures is becoming more necessary as a massive amount of data is being stored and represented over the Web. We propose two important interrelated issues: how quality XML data is useful for data mining in XML and how data mining in XML is used to measure the quality data for XML. When we address both issues, we consider XML constraints because constraints in XML can be used for quality measurement in XML data and also for finding some important patterns and association rules in XML data mining. We note that XML constraints can play an important role for data quality and data mining in XML. We address the theoretical framework rather than solutions. Our research framework is towards the broader task of data mining and data quality for XML data integrations.
  • Keywords
    XML; data mining; Web; XML; constraint-based approach; data mining; quality data; Association rules; Australia; Conferences; Data engineering; Data mining; Databases; Proposals; XML; CONSTRAINTS IN XML; DATA MINING; DATA QUALITY; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Future Generation Communication and Networking Symposia, 2008. FGCNS '08. Second International Conference on
  • Conference_Location
    Sanya
  • Print_ISBN
    978-1-4244-3430-5
  • Electronic_ISBN
    978-0-7695-3546-3
  • Type

    conf

  • DOI
    10.1109/FGCNS.2008.74
  • Filename
    4813519