Title :
Computational analysis of thematic blog data for sociological inference mining
Author :
Singh, V.K. ; Waila, P. ; Sadat, R. ; Piryani, R. ; Uddin, Ahsan
Author_Institution :
Dept. of Comput. Sci., South Asian Univ., New Delhi, India
Abstract :
This paper describes our proposed approach for computational analysis of thematic blog data through a novel combine of sophisticated Information Retrieval and Language Processing Techniques. We have implemented algorithms for Topic Modeling, Entity Extraction and Sentiment Classification with a view to draw sociologically relevant inferences from freeform unstructured social media data. Our experimental data comprised of more than 600 blog posts on the broader theme of `Discrimination, Abuse and Sexual Crime against Women´ collected during two discrete time periods. We have tried to extract some important inferences from the data such as key persons and organizations mentioned in the data, key themes encountered in the entire data collection, sentiment orientation inherent in the texts and variation in topic trends during the two discrete time periods. The results obtained are very interesting and validate the usefulness of our approach for computational analysis of social media data.
Keywords :
data mining; inference mechanisms; information retrieval; natural language processing; pattern classification; social networking (online); social sciences computing; abuse against women; computational analysis; discrete time periods; discrimination against women; entity extraction; freeform unstructured social media data; inference extraction; information retrieval techniques; language processing techniques; sentiment classification; sentiment orientation; sexual crime against women; social media data; sociological inference mining; sociologically relevant inferences; thematic blog data; topic modeling; Blogs; Computational modeling; Data mining; Educational institutions; Media; Organizations; Tag clouds; Information Extraction; Sentiment Classification; Social Computing; Social Media; Text Analytics;
Conference_Titel :
Applied Computational Intelligence and Informatics (SACI), 2013 IEEE 8th International Symposium on
Conference_Location :
Timisoara
Print_ISBN :
978-1-4673-6397-6
DOI :
10.1109/SACI.2013.6608985