DocumentCode :
2108888
Title :
XPC: A novel method for retrieving massive smallscale XML documents via path constraints
Author :
Yuan, Xiaojie ; Zhang, Ying ; Wen, Yanlong ; Zhang, Haiwei ; Wei, JinMao
Author_Institution :
Department of Computer Science and Technology, Nankai University, Tianjin, China, 300071
fYear :
2010
fDate :
4-6 Dec. 2010
Firstpage :
5542
Lastpage :
5546
Abstract :
This paper proposes a novel method for searching massive and small-scale XML documents via path constraints, referred to as XPC, to overcome drawbacks of conventional approaches. Firstly, we propose employing keywords with simple path constraints to retrieve XML data, which provides a user-friendly way without need of understanding complex knowledge and could express user demand accurately. This paper further proposes a novel method for computing term weight in documents via path constraints, called rtf-idf. It measures the similarity of path constraints by N-Gram and other factors according to the structure of the XML documents. Then we rank the relevant documents by an extension of the vector space model. The experimental results show that XPC indeed outperforms the baseline methods such as VSM in plain text and JuruXML.
Keywords :
Automata; Computational modeling; Educational institutions; Intrusion detection; Real time systems; Time factors; N-Gram; Path Constraint; VSM; XML Search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science and Engineering (ICISE), 2010 2nd International Conference on
Conference_Location :
Hangzhou, China
Print_ISBN :
978-1-4244-7616-9
Type :
conf
DOI :
10.1109/ICISE.2010.5689672
Filename :
5689672
Link To Document :
بازگشت