DocumentCode :
2832463
Title :
Parallel processing the keyword search in uncertain environment
Author :
Ning, Bo ; Zhou, Xiaoping ; Shi, Yimin
Author_Institution :
Sch. of Inf. Sci. & Technol., Dalian Maritime Univ., Dalian, China
fYear :
2012
fDate :
June 30 2012-July 2 2012
Firstpage :
409
Lastpage :
414
Abstract :
XML is nature to express the uncertainty in real world, therefore the data in uncertain environment can be stored it the format of XML. For improving the efficiency of keyword search in uncertain environment, we use dewey code for indexing the XML elements, which is a kind of prefix-based encoding method. When dealing with big data, the lengths of element´s Dewey codes are quit big, which leads to low efficiency of judging the relationships among the elements and needs large storage space. Thus, the big XML data and complicated XML schema are the bottlenecks of keyword search. In this paper, we incorporate the map-reduce mechanism to manage the uncertain data with partition, and design a parallel method to process information retrieve. The different XML fragments are stored in distributed network, and these can be parallel processed to retrieve the Smallest Lowest Common Ancestors (SLCAs) and return the k results with the largest probabilistic values. In our experiment, the result shows that our approach can improve the efficiency of executing parallel keyword search.
Keywords :
XML; information retrieval; parallel processing; probability; search problems; SLCA; XML data; XML fragments; XML schema; dewey code; distributed network; information retrieval; keyword search; parallel processing; prefix based encoding method; probabilistic values; smallest lowest common ancestors; uncertain environment; Algorithm design and analysis; Data models; Indexes; Keyword search; Partitioning algorithms; Probabilistic logic; XML; Keyword Search; Parallel Processing; SLCA; Uncertain Data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Science and Engineering (ICSSE), 2012 International Conference on
Conference_Location :
Dalian, Liaoning
Print_ISBN :
978-1-4673-0944-8
Electronic_ISBN :
978-1-4673-0943-1
Type :
conf
DOI :
10.1109/ICSSE.2012.6257218
Filename :
6257218
Link To Document :
بازگشت