• DocumentCode
    2477376
  • Title

    A New Method to Compute the Word Relevance in News Corpus

  • Author

    Liu Jinpan ; He Liang ; Lin Xin ; Xu Mingmin ; Lu Wei

  • Author_Institution
    Dept. of Comput. Sci. & Technol., East China Normal Univ., Shanghai, China
  • fYear
    2010
  • fDate
    22-23 May 2010
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In this paper we propose a new method to compute the relevance of term in news corpus. According to the characteristics of news corpus, we first propose that the news corpus should be divided into different channels, second we make use of the feature of news document, we divide the co-occurrence of terms into two cases, on the one hand the co-occurrence in the title of the news, On the other hand the co-occurrence in the news text, we use different methods to compute the co-occurrence. In the end, we introduce the web corpus Wikipedia to overcome some shortcomings of the news corpus.
  • Keywords
    natural language processing; text analysis; word processing; Wikipedia; news corpus; term cooccurrence; word combination; Computer applications; Computer science; Data mining; Helium; Information retrieval; Lighting; Ontologies; Rockets; TV; Wikipedia;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems and Applications (ISA), 2010 2nd International Workshop on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-5872-1
  • Electronic_ISBN
    978-1-4244-5874-5
  • Type

    conf

  • DOI
    10.1109/IWISA.2010.5473239
  • Filename
    5473239