• DocumentCode
    2034189
  • Title

    A new indexing strategy for XML keyword search

  • Author

    Xiang, Yongqing ; Deng, Zhihong ; Yu, Hang ; Wang, Sijing ; Gao, Ning

  • Author_Institution
    Key Lab. of Machine Perception (Minist. of Educ.), Peking Univ., Beijing, China
  • Volume
    5
  • fYear
    2010
  • fDate
    10-12 Aug. 2010
  • Firstpage
    2412
  • Lastpage
    2416
  • Abstract
    With the rapid increase of XML documents on the web, how to index, store and retrieve these documents has become a very popular and valuable problem. At present, there are two normal ways of retrieving XML documents. One is structure-based retrieval; the other is keyword-based retrieval. However, XML keyword search is becoming more and more popular because it is easy to master and manipulate. In XML keyword search system, a key problem is how to store the structure information into XML indices efficiently. At present, Dewey numbers are often used to label XML nodes in XML indices. However, Dewey numbers may lead to redundancy in XML indices. In this paper, we propose a new labeling method called LAF numbers for XML indices and we device a new indexing structure called Two-Layer index for XML keyword retrieval systems. At last, we have conducted an extensive experimental study and the experimental results show that our indexing method achieves better space efficiency than prevailing Dewey-number-based indexing method.
  • Keywords
    XML; digital arithmetic; indexing; information retrieval systems; redundancy; semantic Web; Dewey numbers; LAF numbers; XML documents; document retrieval; indexing; keyword retrieval systems; keyword search system; redundancy; structure based retrieval; two-layer index; web; Encoding; Indexing; Keyword search; Labeling; Redundancy; XML; Dewey numbers; Indexing; LAF numbers; Two-Layer; XML keyword Search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
  • Conference_Location
    Yantai, Shandong
  • Print_ISBN
    978-1-4244-5931-5
  • Type

    conf

  • DOI
    10.1109/FSKD.2010.5569522
  • Filename
    5569522