• DocumentCode
    3093360
  • Title

    A methodology for using edges to measure structural and semantic similarity of XML documents

  • Author

    Qiu, Hong-jun ; Yu, Wen-jing

  • Author_Institution
    Sch. of Eng., Shantou Univ., Shantou, China
  • Volume
    3
  • fYear
    2009
  • fDate
    12-15 July 2009
  • Firstpage
    1653
  • Lastpage
    1658
  • Abstract
    XML is a standard data representation format that comes with its own structure and semantics. The similarity measurement of XML should include data, structure and semantics, but the semantic measurement has not yet received strong attention. Aiming at the product structure domain, a methodology for using edges to measure structural and semantic similarity of XML is presented in this paper. Based on the semantics of product structure described in XML, the edge constraint is used to improve the structural similarity efficiency. An effective weight mechanism interrelated with XML model hierarchy is adopted to address the semantics problem, to enhance the similarity precision. The implement pseudocode is presented. The experimental tests demonstrate that the proposed method can efficiently measure the structural and semantic similarity of product structures described in XML.
  • Keywords
    XML; data structures; XML document; edge constraint; effective weight mechanism; extensible markup language; semantic similarity; standard data representation format; structural similarity efficiency; Assembly; Computational complexity; Cybernetics; Data engineering; Machine learning; Machine learning algorithms; Measurement standards; Software measurement; Software standards; XML; Measurement; Product structure; Structural and semantic similarity; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2009 International Conference on
  • Conference_Location
    Baoding
  • Print_ISBN
    978-1-4244-3702-3
  • Electronic_ISBN
    978-1-4244-3703-0
  • Type

    conf

  • DOI
    10.1109/ICMLC.2009.5212295
  • Filename
    5212295