• DocumentCode
    2025295
  • Title

    Analyzing Multiple News Sites by Contrasting Articles

  • Author

    Yoshioka, Masaharu

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Hokkaido Univ., Sapporo
  • fYear
    2008
  • fDate
    Nov. 30 2008-Dec. 3 2008
  • Firstpage
    45
  • Lastpage
    51
  • Abstract
    Today, there is access to large numbers of news sites in different countries, and there are several experimental systems, such as Newsblaster and NewsExplorer, that integrate news articles about a particular event from multiple news sites. These systems enable a good understanding of particular events by using multiple news sites, but they ignore the characteristics of the various news sites. To characterize the differences between news sites, the News Site Contrast (NSContrast) system has been proposed. This system analyzes multiple news sites using the concept of contrast- set mining. However, NSContrast has only limited analysis functions and is not mature enough for evaluation via user experiments. Therefore, in this paper, we analyze contrast set mining results for NSContrast, aiming to understand the requirements for extracting useful information that also reflects the interests of different countries. Based on this analysis, a new NSContrast is introduced and applied to a news article database constructed from multiple news sites in different countries for user experimentation.
  • Keywords
    Web sites; data mining; NSContrast; contrast-set mining analysis; news Web sites; news article database; news site contrast system; Broadcasting; Data mining; Databases; Image analysis; Information analysis; Information science; Internet; Natural languages; Performance analysis; Signal analysis; Contrast set mining; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Image Technology and Internet Based Systems, 2008. SITIS '08. IEEE International Conference on
  • Conference_Location
    Bali
  • Print_ISBN
    978-0-7695-3493-0
  • Type

    conf

  • DOI
    10.1109/SITIS.2008.42
  • Filename
    4725785