• DocumentCode
    3043302
  • Title

    Learning to Rank Using Semantic Features in Document Retrieval

  • Author

    Weixin, Tian ; Fuxi, Zhu

  • Author_Institution
    Comput. Sch., Wuhan Univ., Wuhan, China
  • Volume
    3
  • fYear
    2009
  • fDate
    19-21 May 2009
  • Firstpage
    500
  • Lastpage
    504
  • Abstract
    This paper describes an approach to retrieve documents adopting machine learning method. The application of machine learning to document retrieval, which can be so called ldquolearning to rankrdquo, has been a hot research topic in the information retrieval and machine learning communities recently. One of the characters that discriminates this work from other studies is the use of semantic features while classifying the documents. Firstly, we extract the semantic structures from collection and index documents on these structures. Later, we construct a SVM classifier and generate it using pairwise training data. Lastly, we use this SVM to judge the relevance of documents according to given queries. In the extracting of semantic structures phrase, an analysis system that we developed previously is used. The analysis system is based on modifying relations (MR) and on the help of a knowledge base called MRKB. We give a general introduction about the system in this paper. The experiment is done on the benchmark dataset OHSUMED, and the experimental results shows that the proposed method outperforms other approaches based barely on string frequency.
  • Keywords
    classification; document handling; indexing; information retrieval; learning (artificial intelligence); support vector machines; SVM classifier; document classification; document indexing; document retrieval; information retrieval; knowledge base; machine learning; pairwise training data; semantic features; semantic structures phrase; string frequency; support vector machine; Data mining; Educational institutions; Frequency; Information retrieval; Information technology; Intelligent systems; Learning systems; Machine learning; Support vector machine classification; Support vector machines; Learning to Rank; Modifying Relation; Semantic Features;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems, 2009. GCIS '09. WRI Global Congress on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-0-7695-3571-5
  • Type

    conf

  • DOI
    10.1109/GCIS.2009.148
  • Filename
    5209100