DocumentCode :
1968582
Title :
On XML structural similarity
Author :
Piao Yong ; Liu Chen ; Wang Xiu-Kun
Author_Institution :
EI Sch., Dalian Univ. of Technol., Dalian, China
Volume :
1
fYear :
2010
fDate :
10-11 July 2010
Firstpage :
448
Lastpage :
451
Abstract :
A model of XML document is extended by considering both path and frequency information, namely the frequency-path model. Based on this model, a structural similarity calculation algorithm with position and frequency weight by longest common subsequence (PFWLCS) is proposed, which is fast and has high precision. Furthermore the selection of the position and frequency factors are discussed in depth. Experiments show that the PFWLCS has higher recall ratio and accuracy than existing similarity calculation methods, especially on XML with different Structures.
Keywords :
XML; PFWLCS; XML document model; XML structural similarity; frequency-path model; position and frequency weight by longest common subsequence; structural similarity calculation algorithm; Algorithm design and analysis; Lead; XML; frequency weight; position weight; structure similarity; the longest common subsequence;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Industrial and Information Systems (IIS), 2010 2nd International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-1-4244-7860-6
Type :
conf
DOI :
10.1109/INDUSIS.2010.5565813
Filename :
5565813
Link To Document :
بازگشت