DocumentCode :
169178
Title :
TSIR: A Chinese Temporal semantics Information Retrieval system based on MapReduce
Author :
Zhongmei Shu ; Yayao Zuo ; Yong Tang
Author_Institution :
Sch. of Educ., Sun Yat-sen Univ., Guangzhou, China
fYear :
2014
fDate :
21-23 May 2014
Firstpage :
259
Lastpage :
264
Abstract :
The significance of time in information production and consumption has been recognised in information retrieval research. Temporal information plays an important role in the webpage retrieval. The webpage has both the temporal metadata and temporal semantics in the content. However, the existing search engines conduct the information retrieval based on text keywords rather than temporal semantics. To address this issue, a Temporal semantics Information Retrieval (TSIR) System is proposed to deal with the Chinese temporal information retrieval. The TSIR system is deployed on Hadoop and implemented by the means of MapReduce. Firstly, the Chinese temporal regular expression rule is introduced to extract the explicit and implicit temporal phrases in the query keywords and webpages. Secondly, the scores of webpages are re-evaluated by taking text relevance and temporal semantics relevance into account and the returned results are ranked according to re-evaluation. Experiment shows that TSIR system could precisely and effectively match the keywords queries related to temporal expression.
Keywords :
Internet; information retrieval systems; meta data; natural language processing; parallel programming; public domain software; query processing; text analysis; Chinese temporal regular expression rule; Chinese temporal semantic information retrieval system; Hadoop; MapReduce; TSIR system; Web page retrieval; explicit temporal phrase extraction; implicit temporal phrase extraction; information consumption; information production; information retrieval research; query keywords; temporal metadata; temporal semantics relevance; text relevance; Data mining; Earthquakes; Educational institutions; Search engines; Semantics; Standards; Chinese temporal semantics; Information retrieval; MapReduce; Temporal ranking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Supported Cooperative Work in Design (CSCWD), Proceedings of the 2014 IEEE 18th International Conference on
Conference_Location :
Hsinchu
Type :
conf
DOI :
10.1109/CSCWD.2014.6846852
Filename :
6846852
Link To Document :
بازگشت