• DocumentCode
    2125420
  • Title

    A Study on XML Path Similarity

  • Author

    Song Ling ; He Wei ; Yang Tongjiang ; Liu Zhendong

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Shandong Jianzhu Univ., Jinan, China
  • fYear
    2009
  • fDate
    20-22 Sept. 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    The data model of XML document can be labeled as a tag tree of element nodes. Such tree model can be represented by the set of paths from the root node to leaf nodes, which describes the structure of XML document. This paper presents an approach for measuring similarity between two XML paths that consists of (1) ElementSim, a similarity function specifically designed for measuring linguistic similarity between two elements in two different paths, which take into account both semantic and syntactical information of elements. (2) NPathSim, a similarity function specifically designed for measuring similarity between two paths, which combines both the linguistic similarity between elements and the context descriptions of paths. Path retrieval was performed to evaluate the quality of NPathSim. The experiments show the proposed similarity approach can achieve higher quality on XML data set.
  • Keywords
    XML; data models; ElementSim; NPathSim; XML path similarity; data model; linguistic similarity; path retrieval; tree model; Algorithm design and analysis; Computer science; Content based retrieval; Context-aware services; Data mining; Data models; Electronic mail; Information retrieval; Performance evaluation; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Management and Service Science, 2009. MASS '09. International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-4638-4
  • Electronic_ISBN
    978-1-4244-4639-1
  • Type

    conf

  • DOI
    10.1109/ICMSS.2009.5302990
  • Filename
    5302990