• DocumentCode
    651998
  • Title

    Multi-lingual Analysis of Future-Related Information on the Web

  • Author

    Jatowt, Adam ; Kawai, Hiroyuki ; Kanazawa, Kenji ; Tanaka, Kiyoshi ; Kunieda, Kazuo ; Yamada, Koji

  • Author_Institution
    Kyoto Univ., Kyoto, Japan
  • fYear
    2013
  • fDate
    16-18 Sept. 2013
  • Firstpage
    27
  • Lastpage
    32
  • Abstract
    Future prediction is one of the crucial activities of humans. In this paper, we report the results of exploratory analysis of future-related information on the Web in three different languages: English, Japanese and Polish. We focus on the future-related information which is grounded in time, that is, the information on events whose expected occurrence dates are already known. Our datasets are constructed by crawling search engine indices. We investigate multiple aspects of future-related information in web pages across different languages such as its amount, time span, topics, associated sentiment levels as well as the relation to the future-related content in news articles.
  • Keywords
    Web sites; information retrieval; natural language processing; search engines; text analysis; English; Japanese; Polish; Web page; crawling search engine indices; future related information prediction; multilingual analysis; news article; Forecasting; Market research; Meteorology; Search engines; Sociology; Statistics; Web pages; collective predictions; future-related information; multi-lingual analysis; opinion analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Culture and Computing (Culture Computing), 2013 International Conference on
  • Conference_Location
    Kyoto
  • Type

    conf

  • DOI
    10.1109/CultureComputing.2013.13
  • Filename
    6680326