• DocumentCode
    3106695
  • Title

    A Modified System for Weblog Topic Relevance Retrieval

  • Author

    Li, Si ; Du, Lei ; Xu, Weiran ; Guo, Jun

  • Author_Institution
    Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2009
  • fDate
    13-14 Dec. 2009
  • Firstpage
    392
  • Lastpage
    395
  • Abstract
    Weblog is widely used, and the number of users is increasing rapidly. Weblog reflects every aspect of the society, such as politics, economy and culture, so the topic relevance retrieval research on Weblog becomes necessary. Because of a lot of noise in the corpus and it is usually difficult to obtain the appropriate query, the common methods sometimes fail to reach an acceptable precision. We design a Modified Topic Relevance Retrieval System (MTRRS) containing query formulation and a combination model. To design the query, manual adjustment and machine learning are used. During the machine learning processing, we define a center word list which helps to generate a novel distance feature. The result can be improved 22.97% on MAP by query formulation. The results of document retrieval model and passage retrieval model are combined. 33.55% increase on MAP can be received. Also by using the combination model, the retrieval result of the semi-machine learning query is closely approaching the manually adjusted result.
  • Keywords
    Web sites; learning (artificial intelligence); query formulation; Weblog topic relevance retrieval; combination model; document retrieval model; machine learning; modified topic relevance retrieval system; passage retrieval model; query formulation; Conference management; Content based retrieval; Feedback; Information retrieval; Information services; Information technology; Internet; Machine learning; Search engines; Web sites; Combination Model; Query Formulation; Topic Relevance Retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Future Information Technology and Management Engineering, 2009. FITME '09. Second International Conference on
  • Conference_Location
    Sanya
  • Print_ISBN
    978-1-4244-5339-9
  • Type

    conf

  • DOI
    10.1109/FITME.2009.104
  • Filename
    5381010