• DocumentCode
    2500816
  • Title

    A simple approach for Monolingual Event Tracking system in Bengali

  • Author

    Kolya, Anup Kumar ; Ekbal, Asif ; Bandyopadhyay, Sivaji

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Jadavpur Univ., Kolkata, India
  • fYear
    2009
  • fDate
    20-22 Oct. 2009
  • Firstpage
    48
  • Lastpage
    53
  • Abstract
    Real-world applications have to take into consideration both topics and sentiments for precise opinion measurement. Topic and sentiment alignment is crucial for opinion retrieval, extraction, categorization, and aggregation on various issues. In this paper, we have reported a Monolingual Event (or, topic) tracking system for Bengali. The system has been developed based on a newspaper corpus developed from the Web archive of a leading Bengali newspaper. The goal of the system is to determine whether two news documents within a range of dates describe the same event. An event is a vector consisting of person, location, organization, title and date. A particular news document is described as a collection of such event vectors. A particular threshold value has been considered to check whether the number of event vectors of two separate news documents match at least by this threshold. Any particular news document of a date has been selected as the initial story. All the news documents within the preceding 15 and following 15 days have been considered as the target stories (or, documents). Evaluation results have demonstrated the Recall and Precision of 58.93% and 84.62%, respectively. The future works will look for interactions between topics and associated sentiments.
  • Keywords
    Internet; document handling; electronic publishing; information retrieval; natural languages; tracking; Bengali newspaper; Web archive; categorization; event vector; information retrieval; monolingual event tracking system; new document match; opinion extraction; opinion measurement; opinion retrieval; topic-sentiment alignment; Application software; Blogs; Computer science; Discussion forums; Ecosystems; Event detection; Information analysis; Motion pictures; Natural language processing; Thumb;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing, 2009. SNLP '09. Eighth International Symposium on
  • Conference_Location
    Bangkok
  • Print_ISBN
    978-1-4244-4138-9
  • Electronic_ISBN
    978-1-4244-4139-6
  • Type

    conf

  • DOI
    10.1109/SNLP.2009.5340908
  • Filename
    5340908