DocumentCode
2306123
Title
An ontology based semantic heterogeneity measurement framework for optimization in distributed data mining
Author
Liu, Bin ; Cao, Shu-Gui ; Cao, Dong-Fang ; Li, Qing-Chun ; Liu, Hai-Tao ; Shi, Shao-Nan
Author_Institution
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Volume
1
fYear
2012
fDate
15-17 July 2012
Firstpage
118
Lastpage
123
Abstract
In distributed data mining (DDM) systems, the semantic heterogeneity between data sources has not got universal attentions, which may produce the potential risks of damaging the quality of the final result. This paper presents a semantic distance measurement framework to extract the essential semantic heterogeneity between data sources. In this framework, an ontology-matching based multi-strategy voting method is utilized to comprehensively synthesize the semantic distances between two data source ontologies in element level and structure level. The output of the framework can be leveraged as the foundation to group the data sources for optimizing the DDM result. Finally, the framework is integrated into a DDM architecture we have proposed.
Keywords
data mining; distributed processing; ontologies (artificial intelligence); DDM system; data source ontologies; distributed data mining system; element level; ontology based semantic heterogeneity measurement framework; ontology-matching based multistrategy voting method; optimization; semantic distance measurement framework; structure level; Abstracts; Computer aided instruction; Ontologies; Organizations; Semantics; Syntactics; Thesauri; Semantic heterogeneity; distributed data mining; ontology;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location
Xian
ISSN
2160-133X
Print_ISBN
978-1-4673-1484-8
Type
conf
DOI
10.1109/ICMLC.2012.6358897
Filename
6358897
Link To Document