DocumentCode :
2315406
Title :
Web opinions analysis with scalable distance-based clustering
Author :
Yang, Christopher C. ; Ng, Tobun D.
Author_Institution :
Coll. of Inf. Sci. & Technol., Drexel Univ., Philadelphia, PA, USA
fYear :
2009
fDate :
8-11 June 2009
Firstpage :
65
Lastpage :
70
Abstract :
Due to the advance of Web 2.0 technologies, a large volume of Web opinions are available in computer-mediated communication sites such as forums and blogs. Many of these Web opinions involve terrorism and crime related issues. For instances, some terrorist groups may use Web forums to propagandize their ideology, some may post threaten messages, and some criminals may recruit members or identify victims through Web social networks. Analyzing and clustering Web opinions are extremely challenging. Unlike regular documents, Web opinions usually appear as short and sparse text messages. Using typical document clustering techniques on Web opinions produce unsatisfying result. In this work, we propose the scalable distance-based clustering technique for Web opinions clustering. We have conducted experiments and benchmarked with the density-based algorithm. It shows that it obtains higher micro and macro accuracy. This Web opinions clustering technique is useful in identifying the themes of discussions in Web social networks and studying their development as well as the interactions of active participants.
Keywords :
computer mediated communication; social networking (online); terrorism; text analysis; Web 2.0 technology; Web blog; Web forum; Web opinions clustering; Web social network; computer-mediated communication site; crime-related issues; density-based clustering algorithm; scalable distance-based document clustering technique; terrorism issues; text message; Blogs; Clustering algorithms; Computer mediated communication; Educational institutions; Information analysis; Information science; Laboratories; Social network services; Software libraries; Visualization; Web forum analysis; content analysis; document clustering; social media analytics; social networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligence and Security Informatics, 2009. ISI '09. IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4171-6
Electronic_ISBN :
978-1-4244-4173-0
Type :
conf
DOI :
10.1109/ISI.2009.5137273
Filename :
5137273
Link To Document :
بازگشت