DocumentCode :
3335188
Title :
DIInCX: An Approach to Discovery of Implicit Integrity Constraints from XML Data
Author :
Rodrigues, Khaue Rezende ; Mello, Ronaldo Dos Santos
Author_Institution :
Univ. Fed. de Santa Catarina-UFSC, Santa Catarina
fYear :
2007
fDate :
13-15 Aug. 2007
Firstpage :
606
Lastpage :
611
Abstract :
We propose an approach for discovery of implicit semantic integrity constraints (SIC) from XML instances called DIInCX. DIInCX is a process composed by three phases: preprocessing, discovering and conversion. Our motivation with this work is to improve the activity of XML semantic data integration or XML information extraction systems, complementing their resulting XML schemata with SIC rules that cannot be explicitly perceived by a human user. Our approach is validated through experiments that show that the discovered SIC rules are valid, human readable and not complex to be implemented because they are based on simple restrict conditions.
Keywords :
XML; data integrity; data mining; programming language semantics; DIInCX; XML information extraction systems; XML schemata; XML semantic data integration; semantic integrity constraints; Association rules; Data mining; Data models; Delta modulation; Humans; Integrated circuit modeling; Itemsets; Silicon carbide; Terminology; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
Conference_Location :
Las Vegas, IL
Print_ISBN :
1-4244-1500-4
Electronic_ISBN :
1-4244-1500-4
Type :
conf
DOI :
10.1109/IRI.2007.4296687
Filename :
4296687
Link To Document :
بازگشت