DocumentCode
1300627
Title
Returning Clustered Results for Keyword Search on XML Documents
Author
Liu, Xiping ; Wan, Changxuan ; Chen, Lei
Author_Institution
Sch. of Inf. Technol., Jiangxi Univ. of Finance & Econ., Nanchang, China
Volume
23
Issue
12
fYear
2011
Firstpage
1811
Lastpage
1825
Abstract
Keyword search is an effective paradigm for information discovery and has been introduced recently to query XML documents. In this paper, we address the problem of returning clustered results for keyword search on XML documents. We first propose a novel semantics for answers to an XML keyword query. The core of the semantics is the conceptually related relationship between keyword matches, which is based on the conceptual relationship between nodes in XML trees. Then, we propose a new clustering methodology for XML search results, which clusters results according to the way they match the given query. Two approaches to implement the methodology are discussed. The first approach is a conventional one which does clustering after search results are retrieved; the second one clusters search results actively, which has characteristics of clustering on the fly. The generated clusters are then organized into a cluster hierarchy with different granularities to enable users locate the results of interest easily and precisely. Experimental results demonstrate the meaningfulness of the proposed semantics as well as the efficiency of the proposed methods.
Keywords
XML; query processing; XML document query; XML keyword query; XML trees; eXtensible Markup Language; information discovery; keyword search; Cloud computing; Clustering algorithms; Databases; Keyword search; Pattern matching; Search methods; Semantics; XML; information retrieval; XML keyword search; cluster hierarchy.; search results clustering;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2011.183
Filename
5989812
Link To Document