• DocumentCode
    3155357
  • Title

    An Evolutionary-Based Method for Reconstructing Conversation Threads in Email Corpora

  • Author

    Dehghani, Mohammad ; Asadpour, Mahdi ; Shakery, A.

  • Author_Institution
    Intell. Inf. Syst. Lab., Univ. of Tehran, Tehran, Iran
  • fYear
    2012
  • fDate
    26-29 Aug. 2012
  • Firstpage
    1132
  • Lastpage
    1137
  • Abstract
    Email is a type of Web data which is produced in enormous quantities. It is beneficial to detect conversation threads contained in the email corpora for various applications, including discussion search, expert finding and even email clustering and classification. Conversation thread in email corpora can be defined as a cluster of exchanged emails among the same group of people by reply or forwarding on the same topic. According to this definition, we can define parent-child relation between emails, so email conversation threads seem to demonstrate tree structure. This paper presents a new approach based on genetic programming for reconstruction of conversation threads in emails data. This approach considers finding email conversation threads as an optimization problem, and exploits genetic programming to search intelligently in the space of possible solutions. Rather than several studies that have been conducted on this problem, this work concentrates on detecting accurate structure of conversation threads in high recall. This paper provides a comprehensive evaluation on the BC3 data set. Preliminary results suggest that our method provides acceptable precision and higher recall than existing methods.
  • Keywords
    Internet; electronic mail; genetic algorithms; pattern classification; pattern clustering; BC3 data set; Web data; conversation thread reconstruction; discussion search; email classification; email clustering; email corpora; evolutionary-based method; expert finding; genetic programming; optimization problem; parent-child relation; Biological cells; Educational institutions; Electronic mail; Social network services; Sociology; Statistics; conversation; email; emails thread; genetic programming;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Social Networks Analysis and Mining (ASONAM), 2012 IEEE/ACM International Conference on
  • Conference_Location
    Istanbul
  • Print_ISBN
    978-1-4673-2497-7
  • Type

    conf

  • DOI
    10.1109/ASONAM.2012.195
  • Filename
    6425605