Title :
A methodology for using edges to measure structural and semantic similarity of XML documents
Author :
Qiu, Hong-jun ; Yu, Wen-jing
Author_Institution :
Sch. of Eng., Shantou Univ., Shantou, China
Abstract :
XML is a standard data representation format that comes with its own structure and semantics. The similarity measurement of XML should include data, structure and semantics, but the semantic measurement has not yet received strong attention. Aiming at the product structure domain, a methodology for using edges to measure structural and semantic similarity of XML is presented in this paper. Based on the semantics of product structure described in XML, the edge constraint is used to improve the structural similarity efficiency. An effective weight mechanism interrelated with XML model hierarchy is adopted to address the semantics problem, to enhance the similarity precision. The implement pseudocode is presented. The experimental tests demonstrate that the proposed method can efficiently measure the structural and semantic similarity of product structures described in XML.
Keywords :
XML; data structures; XML document; edge constraint; effective weight mechanism; extensible markup language; semantic similarity; standard data representation format; structural similarity efficiency; Assembly; Computational complexity; Cybernetics; Data engineering; Machine learning; Machine learning algorithms; Measurement standards; Software measurement; Software standards; XML; Measurement; Product structure; Structural and semantic similarity; XML;
Conference_Titel :
Machine Learning and Cybernetics, 2009 International Conference on
Conference_Location :
Baoding
Print_ISBN :
978-1-4244-3702-3
Electronic_ISBN :
978-1-4244-3703-0
DOI :
10.1109/ICMLC.2009.5212295