DocumentCode :
1305863
Title :
Efficient top-k algorithm for eXtensible Markup Language keyword search
Author :
Yu, Haoyong ; Deng, Zhi-Hong ; Gao, Nan
Author_Institution :
Key Lab. of Machine Perception (Minist. of Educ.), Peking Univ., Beijing, China
Volume :
6
Issue :
4
fYear :
2012
fDate :
8/1/2012 12:00:00 AM
Firstpage :
342
Lastpage :
349
Abstract :
The ability to compute top-k matches to eXtensible Markup Language (XML) queries is gaining importance owing to the increasing of large XML repositories. Current work on top-k match to XML queries mainly focuses on employing XPath, XQuery or NEXI as the query language, whereas little work has concerned on top-k match to XML keyword search. In this study, the authors propose a novel two-layer-based index construction and associated algorithm for efficiently computing top-k results for XML keyword search. Our core contribution, the two-layer-based inverted Index and associated algorithm for XML keyword search take both score-sorted-sequence and Dewey ID-sorted-sequence into consideration, and thus gain performance benefits during querying process. The authors have conducted expensive experiments and our experimental results show efficiency advantages compared with existing approaches.
Keywords :
XML; query languages; query processing; Dewey ID-sorted-sequence; NEXI; XML keyword search; XML queries; XML repositories; XPath; XQuery; extensible markup language keyword search; query language; querying process; score-sorted-sequence; top-k algorithm; two-layer-based index construction; two-layer-based inverted Index;
fLanguage :
English
Journal_Title :
Software, IET
Publisher :
iet
ISSN :
1751-8806
Type :
jour
DOI :
10.1049/iet-sen.2011.0082
Filename :
6322856
Link To Document :
بازگشت