Title :
Integration of semistructured data with partial and inconsistent information
Author :
Liu, Mengchi ; Ling, Tok Wang ; Guan, Tao
Author_Institution :
Dept. of Comput. Sci., Regina Univ., Sask., Canada
Abstract :
Data integration from several sources has gained considerable attention with the recent popularity of the World Wide Web. In the real world, some information may be missing (i.e. partial) and some may be inconsistent from several sources. How to obtain information that is as complete as possible and how to detect inconsistency from these sources is thus an interesting question. Most existing work uses a simple graph-based or tree-based semistructured data model to represent heterogeneous data coming from various sites, which fails to account for the existence of partial and inconsistent information. In this paper, we redefine the notion of semistructured objects to reflect the existence of partial and inconsistent information and study how to integrate such objects spread over various sources and check their consistency in the meantime. We propose a new integration operator for this purpose and discuss its semantic properties
Keywords :
data integrity; data structures; database theory; distributed databases; information resources; World Wide Web; data integration; heterogeneous data; inconsistency detection; inconsistent information; information sources; integration operator; missing information; partial information; semantic properties; semistructured data; semistructured objects; Computer science; Data models; Database systems; Relational databases; Tree graphs; Web pages;
Conference_Titel :
Database Engineering and Applications, 1999. IDEAS '99. International Symposium Proceedings
Conference_Location :
Montreal, Que.
Print_ISBN :
0-7695-0265-2
DOI :
10.1109/IDEAS.1999.787250