DocumentCode :
3172855
Title :
XML Node Semantic Weight Model Based on VSM
Author :
Xinfeng Lai ; Changxuan Wan ; Dexi Liu
Author_Institution :
Sch. of Inf. Technol., Jiangxi Univ. of Finance & Econ., Nanchang
fYear :
2008
fDate :
17-19 Oct. 2008
Firstpage :
247
Lastpage :
251
Abstract :
XML has become an important format for exchange data. Ranking of XML search results directly relates to XML information retrieval performance. Most of the existing ranking models consider words statistical characteristics in the XML document, but they do not consider position of the node a word belongs to. That is to say, all of nodes in XML document have the equal importance. However, different node plays different role in the entire XML document. So, the same content in different node should have different weight. It means different nodes should have different node semantic weight. In this paper, we present a VSM based method for XML node semantic weight (XNSW-VSM), which is scaled by the similarity between the node and the whole document. Experiment data were selected from Wiki data sets. The Pearson correlation coefficient between semantic results given by experts and the model results is 0.827. It shows the node semantic weight model can analyze the importance of node in XML document and it will be helpful for improving ranking results.
Keywords :
XML; information retrieval; semantic Web; tree data structures; Pearson correlation coefficient; XML document; XML node semantic weight model; information retrieval; vector space model; Conference management; Electronic government; Engineering management; Financial management; Information retrieval; Information technology; Knowledge management; Sorting; Technology management; XML; Node Semantic Weight; VSM; XML Node;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Management of e-Commerce and e-Government, 2008. ICMECG '08. International Conference on
Conference_Location :
Jiangxi
Print_ISBN :
978-0-7695-3366-7
Type :
conf
DOI :
10.1109/ICMECG.2008.62
Filename :
4656634
Link To Document :
بازگشت