Title :
A Content-Oriented Framework for Online Discussion Analysis
Author :
Stavrianou, Anna ; Chauchat, Jean-Hugues ; Velcin, Julien
Author_Institution :
ERIC Lab., Univ. Lumiere Lyon 2, Lyon
Abstract :
Mining and extracting quality knowledge from online discussions is significant for the industrial and marketing sector, as well as for e-commerce applications. Most of the existing techniques model a discussion as a social network of users represented by a user-based graph. In this paper, we propose a new framework for discussion analysis. It is based on message-based graphs where each vertex represents a message object and each edge points out which message the specific node replies to. The edges can be weighted by the keywords that characterize the exchanged messages. This model allows a content-oriented representation of the discussion and it facilitates the identification of discussion chains. We compare the two representations (user-based and message-based graphs) and we analyze the different information that can be extracted from them. Our experiments with real data validate the proposed framework and show the additional information that can be extracted from a message-based graph.
Keywords :
data mining; graph theory; content-oriented framework; content-oriented representation; message-based graphs; online discussion analysis; quality knowledge extraction; social network; user-based graph; Data mining; Information analysis; Information services; Internet; Java; Mining industry; Social network services; Statistical analysis; Text mining; Web sites; discussion analysis; forums; message-based graphs; social networks;
Conference_Titel :
Advanced Information Networking and Applications Workshops, 2009. WAINA '09. International Conference on
Conference_Location :
Bradford
Print_ISBN :
978-1-4244-3999-7
Electronic_ISBN :
978-0-7695-3639-2
DOI :
10.1109/WAINA.2009.57