• DocumentCode
    570182
  • Title

    Finding story chains in newswire articles

  • Author

    Zhu, Xianshu ; Oates, Tim

  • Author_Institution
    CSEE Dept., Univ. of Maryland, Baltimore, MD, USA
  • fYear
    2012
  • fDate
    8-10 Aug. 2012
  • Firstpage
    93
  • Lastpage
    100
  • Abstract
    Massive amounts of information about news events are published on the Internet every day in online newspapers, blogs, and social network messages. While search engines like Google help retrieve information using keywords, the large volumes of unstructured search results returned by search engines make it hard to track the evolution of an event. A story chain is composed of a set of news articles that reveal hidden relationships among different events. Traditional keyword-based search engines provide limited support for finding story chains. In this paper, we propose a random walk based algorithm to find story chains. When breaking news happens, many media outlets report the same event. We have two pruning mechanisms in the algorithm to automatically exclude redundant articles from the story chain and to ensure efficiency of the algorithm. Experimental results show that our proposed algorithm can generate coherent story chains without redundancy.
  • Keywords
    information retrieval; search engines; social networking (online); Google; Internet; blogs; information retrieval; keyword-based search engines; newswire articles; online newspapers; pruning mechanisms; random walk based algorithm; social network messages; story chains; Bipartite graph; Coherence; Earthquakes; Hurricanes; Joining processes; Redundancy; Search engines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration (IRI), 2012 IEEE 13th International Conference on
  • Conference_Location
    Las Vegas, NV
  • Print_ISBN
    978-1-4673-2282-9
  • Electronic_ISBN
    978-1-4673-2283-6
  • Type

    conf

  • DOI
    10.1109/IRI.2012.6302996
  • Filename
    6302996