Title :
An Evolutionary-Based Method for Reconstructing Conversation Threads in Email Corpora
Author :
Dehghani, Mohammad ; Asadpour, Mahdi ; Shakery, A.
Author_Institution :
Intell. Inf. Syst. Lab., Univ. of Tehran, Tehran, Iran
Abstract :
Email is a type of Web data which is produced in enormous quantities. It is beneficial to detect conversation threads contained in the email corpora for various applications, including discussion search, expert finding and even email clustering and classification. Conversation thread in email corpora can be defined as a cluster of exchanged emails among the same group of people by reply or forwarding on the same topic. According to this definition, we can define parent-child relation between emails, so email conversation threads seem to demonstrate tree structure. This paper presents a new approach based on genetic programming for reconstruction of conversation threads in emails data. This approach considers finding email conversation threads as an optimization problem, and exploits genetic programming to search intelligently in the space of possible solutions. Rather than several studies that have been conducted on this problem, this work concentrates on detecting accurate structure of conversation threads in high recall. This paper provides a comprehensive evaluation on the BC3 data set. Preliminary results suggest that our method provides acceptable precision and higher recall than existing methods.
Keywords :
Internet; electronic mail; genetic algorithms; pattern classification; pattern clustering; BC3 data set; Web data; conversation thread reconstruction; discussion search; email classification; email clustering; email corpora; evolutionary-based method; expert finding; genetic programming; optimization problem; parent-child relation; Biological cells; Educational institutions; Electronic mail; Social network services; Sociology; Statistics; conversation; email; emails thread; genetic programming;
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2012 IEEE/ACM International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4673-2497-7
DOI :
10.1109/ASONAM.2012.195