DocumentCode
3106695
Title
A Modified System for Weblog Topic Relevance Retrieval
Author
Li, Si ; Du, Lei ; Xu, Weiran ; Guo, Jun
Author_Institution
Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2009
fDate
13-14 Dec. 2009
Firstpage
392
Lastpage
395
Abstract
Weblog is widely used, and the number of users is increasing rapidly. Weblog reflects every aspect of the society, such as politics, economy and culture, so the topic relevance retrieval research on Weblog becomes necessary. Because of a lot of noise in the corpus and it is usually difficult to obtain the appropriate query, the common methods sometimes fail to reach an acceptable precision. We design a Modified Topic Relevance Retrieval System (MTRRS) containing query formulation and a combination model. To design the query, manual adjustment and machine learning are used. During the machine learning processing, we define a center word list which helps to generate a novel distance feature. The result can be improved 22.97% on MAP by query formulation. The results of document retrieval model and passage retrieval model are combined. 33.55% increase on MAP can be received. Also by using the combination model, the retrieval result of the semi-machine learning query is closely approaching the manually adjusted result.
Keywords
Web sites; learning (artificial intelligence); query formulation; Weblog topic relevance retrieval; combination model; document retrieval model; machine learning; modified topic relevance retrieval system; passage retrieval model; query formulation; Conference management; Content based retrieval; Feedback; Information retrieval; Information services; Information technology; Internet; Machine learning; Search engines; Web sites; Combination Model; Query Formulation; Topic Relevance Retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Future Information Technology and Management Engineering, 2009. FITME '09. Second International Conference on
Conference_Location
Sanya
Print_ISBN
978-1-4244-5339-9
Type
conf
DOI
10.1109/FITME.2009.104
Filename
5381010
Link To Document