• DocumentCode
    3143335
  • Title

    XClean: Providing valid spelling suggestions for XML keyword queries

  • Author

    Lu, Yifei ; Wang, Wei ; Li, Jianxin ; Liu, Chengfei

  • Author_Institution
    Univ. of New South Wales, Sydney, NSW, Australia
  • fYear
    2011
  • fDate
    11-16 April 2011
  • Firstpage
    661
  • Lastpage
    672
  • Abstract
    An important facility to aid keyword search on XML data is suggesting alternative queries when user queries contain typographical errors. Query suggestion thus can improve users´ search experience by avoiding returning empty result or results of poor qualities. In this paper, we study the problem of effectively and efficiently providing quality query suggestions for keyword queries on an XML document. We illustrate certain biases in previous work and propose a principled and general framework, XClean, based on the state-of-the-art language model. Compared with previous methods, XClean can accommodate different error models and XML keyword query semantics without losing rigor. Algorithms have been developed that compute the top-k suggestions efficiently. We performed an extensive experiment study using two large-scale real datasets. The experiment results demonstrate the effectiveness and efficiency of the proposed methods.
  • Keywords
    XML; query processing; XClean; XML document; XML keyword query semantics; keyword search; quality query suggestions; top-k suggestions; valid spelling suggestion; Algorithm design and analysis; Cleaning; Databases; Insurance; Probabilistic logic; Vocabulary; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2011 IEEE 27th International Conference on
  • Conference_Location
    Hannover
  • ISSN
    1063-6382
  • Print_ISBN
    978-1-4244-8959-6
  • Electronic_ISBN
    1063-6382
  • Type

    conf

  • DOI
    10.1109/ICDE.2011.5767847
  • Filename
    5767847