DocumentCode
2034189
Title
A new indexing strategy for XML keyword search
Author
Xiang, Yongqing ; Deng, Zhihong ; Yu, Hang ; Wang, Sijing ; Gao, Ning
Author_Institution
Key Lab. of Machine Perception (Minist. of Educ.), Peking Univ., Beijing, China
Volume
5
fYear
2010
fDate
10-12 Aug. 2010
Firstpage
2412
Lastpage
2416
Abstract
With the rapid increase of XML documents on the web, how to index, store and retrieve these documents has become a very popular and valuable problem. At present, there are two normal ways of retrieving XML documents. One is structure-based retrieval; the other is keyword-based retrieval. However, XML keyword search is becoming more and more popular because it is easy to master and manipulate. In XML keyword search system, a key problem is how to store the structure information into XML indices efficiently. At present, Dewey numbers are often used to label XML nodes in XML indices. However, Dewey numbers may lead to redundancy in XML indices. In this paper, we propose a new labeling method called LAF numbers for XML indices and we device a new indexing structure called Two-Layer index for XML keyword retrieval systems. At last, we have conducted an extensive experimental study and the experimental results show that our indexing method achieves better space efficiency than prevailing Dewey-number-based indexing method.
Keywords
XML; digital arithmetic; indexing; information retrieval systems; redundancy; semantic Web; Dewey numbers; LAF numbers; XML documents; document retrieval; indexing; keyword retrieval systems; keyword search system; redundancy; structure based retrieval; two-layer index; web; Encoding; Indexing; Keyword search; Labeling; Redundancy; XML; Dewey numbers; Indexing; LAF numbers; Two-Layer; XML keyword Search;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location
Yantai, Shandong
Print_ISBN
978-1-4244-5931-5
Type
conf
DOI
10.1109/FSKD.2010.5569522
Filename
5569522
Link To Document