DocumentCode :
3106695
Title :
A Modified System for Weblog Topic Relevance Retrieval
Author :
Li, Si ; Du, Lei ; Xu, Weiran ; Guo, Jun
Author_Institution :
Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear :
2009
fDate :
13-14 Dec. 2009
Firstpage :
392
Lastpage :
395
Abstract :
Weblog is widely used, and the number of users is increasing rapidly. Weblog reflects every aspect of the society, such as politics, economy and culture, so the topic relevance retrieval research on Weblog becomes necessary. Because of a lot of noise in the corpus and it is usually difficult to obtain the appropriate query, the common methods sometimes fail to reach an acceptable precision. We design a Modified Topic Relevance Retrieval System (MTRRS) containing query formulation and a combination model. To design the query, manual adjustment and machine learning are used. During the machine learning processing, we define a center word list which helps to generate a novel distance feature. The result can be improved 22.97% on MAP by query formulation. The results of document retrieval model and passage retrieval model are combined. 33.55% increase on MAP can be received. Also by using the combination model, the retrieval result of the semi-machine learning query is closely approaching the manually adjusted result.
Keywords :
Web sites; learning (artificial intelligence); query formulation; Weblog topic relevance retrieval; combination model; document retrieval model; machine learning; modified topic relevance retrieval system; passage retrieval model; query formulation; Conference management; Content based retrieval; Feedback; Information retrieval; Information services; Information technology; Internet; Machine learning; Search engines; Web sites; Combination Model; Query Formulation; Topic Relevance Retrieval;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Future Information Technology and Management Engineering, 2009. FITME '09. Second International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-5339-9
Type :
conf
DOI :
10.1109/FITME.2009.104
Filename :
5381010
Link To Document :
بازگشت