Title :
Similarity Algorithm Based on Weighted Hierarchical Structure of XML Document
Author :
Sun, Xia ; Cheng, Hong-Bin ; Wang, Hai-Jun
Author_Institution :
Sch. of Comput. Sci., Changshu Inst. of Technol., Changshu, China
Abstract :
A similarity algorithm based on weighted hierarchical structure of XML document is brought forward. The algorithm can calculate the similarity among XML documents efficiently according to hierarchical structure. It can be powerful enough to distinguish the similar structural documents. Experimental results prove that the algorithm reduces the complexity and has fairly high performance. The approach presented in this paper can be used in many applications, such as clustering, structural extracting and change checking of XML documents, etc.
Keywords :
XML; XML document; similarity algorithm; weighted hierarchical structure; Application software; Clustering algorithms; Computer science; Computer science education; Data mining; Information retrieval; Internet; Sun; Web sites; XML; XML; hierarchical structure; similarity;
Conference_Titel :
Information Engineering, 2009. ICIE '09. WASE International Conference on
Conference_Location :
Taiyuan, Chanxi
Print_ISBN :
978-0-7695-3679-8
DOI :
10.1109/ICIE.2009.78