Title :
Using ontologies for interoperability of data cleaning operations
Author :
Almeida, Ricardo ; Oliveira, Paulo
Author_Institution :
Dept. de Eng. Inf., Inst. Politec. do Porto, Porto, Portugal
Abstract :
The emergence of new business models, namely the establishment of partnerships between organizations, the possibility of companies to add existing data on the web, especially in the semantic web, to their information increase some problems already existing in the databases, particularly related to data quality. Poor data can lead to loss of competitiveness of the organizations holding these data and may even lead to their disappearance, since many of their decision-making are based on them. This makes data cleaning an essential process. The currently existing approaches to solve these problems are closely related with database schemas and specific domains. In order to use this process in different repositories, it is necessary that machines understand these data, i.e., it is necessary an associated semantic. The solution presented includes the use of ontologies: (i) for the specification of data cleaning operations and, (ii) as a way of solving the semantic heterogeneity problems of data stored in different databases. With the cleaning operations defined at the conceptual level and existing mappings between domain ontologies and an ontology associated with a database, they may be instantiated and then proposed to the user to be executed over that database, thus enabling their interoperability.
Keywords :
data analysis; database management systems; ontologies (artificial intelligence); open systems; semantic Web; business models; data cleaning operations; data quality; database schemas; decision-making; domain ontologies; interoperability; semantic heterogeneity problems; semantic web; Cleaning; Data models; Databases; OWL; Ontologies; Quality management; Data Cleaning; Data Quality; Interoperability; Ontologies;
Conference_Titel :
Information Systems and Technologies (CISTI), 2012 7th Iberian Conference on
Print_ISBN :
978-1-4673-2843-2