DocumentCode :
2306123
Title :
An ontology based semantic heterogeneity measurement framework for optimization in distributed data mining
Author :
Liu, Bin ; Cao, Shu-Gui ; Cao, Dong-Fang ; Li, Qing-Chun ; Liu, Hai-Tao ; Shi, Shao-Nan
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Volume :
1
fYear :
2012
fDate :
15-17 July 2012
Firstpage :
118
Lastpage :
123
Abstract :
In distributed data mining (DDM) systems, the semantic heterogeneity between data sources has not got universal attentions, which may produce the potential risks of damaging the quality of the final result. This paper presents a semantic distance measurement framework to extract the essential semantic heterogeneity between data sources. In this framework, an ontology-matching based multi-strategy voting method is utilized to comprehensively synthesize the semantic distances between two data source ontologies in element level and structure level. The output of the framework can be leveraged as the foundation to group the data sources for optimizing the DDM result. Finally, the framework is integrated into a DDM architecture we have proposed.
Keywords :
data mining; distributed processing; ontologies (artificial intelligence); DDM system; data source ontologies; distributed data mining system; element level; ontology based semantic heterogeneity measurement framework; ontology-matching based multistrategy voting method; optimization; semantic distance measurement framework; structure level; Abstracts; Computer aided instruction; Ontologies; Organizations; Semantics; Syntactics; Thesauri; Semantic heterogeneity; distributed data mining; ontology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location :
Xian
ISSN :
2160-133X
Print_ISBN :
978-1-4673-1484-8
Type :
conf
DOI :
10.1109/ICMLC.2012.6358897
Filename :
6358897
Link To Document :
بازگشت