Title :
Bridging the Gap between Heterogeneous and Semantically Diverse Content of Different Disciplines
Author :
Bykau, Siarhei ; Kiyavitskaya, Nadzeya ; Tsinaraki, Chrisa ; Velegrakis, Yannis
fDate :
Aug. 30 2010-Sept. 3 2010
Abstract :
The Web has been flooded with highly heterogeneous data sources that freely offer their data to the public. Careful design and compliance to standards is a way to cope with the heterogeneity. However, any agreement and compliance is practically hard to achieve across different communities. In this work we describe a framework that enables the exploitation of content across different scientific disciplines. Our approach combines several novel techniques at the syntactic, structural and semantic level. In particular, we advocate that integration should take place at the much higher level, factoring out any syntactic discrepancies, and facilitating the exchange of information. We show how a novel technique for data annotation using intentional attributes can cope with data associations in high data volumes, we present a way to overcome the multilingualism barrier, and describe a new kind of database that considers data evolution as first class citizen with the additional ability to annotate free text.
Keywords :
Internet; data analysis; distributed databases; World Wide Web; data annotation; data associations; data evolution; data volumes; heterogeneous data sources; intentional attributes; multilingualism barrier; semantically diverse content; syntactic discrepancy; Biotechnology; Context; Data models; Data structures; Databases; Internet; Semantics;
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2010 Workshop on
Conference_Location :
Bilbao
Print_ISBN :
978-1-4244-8049-4
DOI :
10.1109/DEXA.2010.67