DocumentCode :
2583041
Title :
An Extended Vector Space Model for XML Information Retrieval
Author :
Guo Yongming ; Chen Dehua ; Le Jiajin
Author_Institution :
Coll. of Inf. Sci. & Technol., Donghua Univ., Shanghai
fYear :
2009
fDate :
23-25 Jan. 2009
Firstpage :
797
Lastpage :
800
Abstract :
With the emergence of more and more XML documents, effectively and efficiently retrieving information from XML documents has become an active research area. Since XML documents lie between structured data and unstructured data which describe both content and structure, it is a huge challenge for effectively and efficiently retrieving information from XML documents. This paper develops a novel retrieval model named as extend vector space model which effectively combines XPath and vector space model for XML information retrieval. A prototype system for XML information retrieval based on this retrieval model has been implemented, and several corresponding algorithms have been introduced. The experiments show that this model has effectively improved recall and precision.
Keywords :
XML; information retrieval; XML documents; XML information retrieval; XPath; extended vector space model; retrieval model; structured data; unstructured data; Books; Content based retrieval; Data mining; Database languages; Educational institutions; Information retrieval; Prototypes; Q measurement; Space technology; XML; Extended Vectoe Space Model; XML; XPath; informatiion retrieval;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Knowledge Discovery and Data Mining, 2009. WKDD 2009. Second International Workshop on
Conference_Location :
Moscow
Print_ISBN :
978-0-7695-3543-2
Type :
conf
DOI :
10.1109/WKDD.2009.218
Filename :
4772056
Link To Document :
بازگشت