DocumentCode :
2918707
Title :
Categorization of Blogs through Similarity Analysis
Author :
Choi, Hwan-Joon ; Krishnamoorthy, Mukkai S.
Author_Institution :
Rensselaer Polytech. Inst., Troy
fYear :
2007
fDate :
23-24 May 2007
Firstpage :
160
Lastpage :
165
Abstract :
We describe a new model for evaluating similarities among a large number of web logs, and compare several algorithms using the model. Possible uses of this include isolating and tracking like-minded networks for surveillance and improved categorization. Our model consists of similarity analysis combined with clustering. Experimental results show that our algorithm is able to separate blogs into categories, consistently achieving over 90% success rate.
Keywords :
Web sites; graph theory; information retrieval; Web browsing; Web log categorization; search engine; similarity analysis; social networking; weighted undirected graph; Algorithm design and analysis; Blogs; Clustering algorithms; Computer science; Joining processes; MySpace; Social network services; Surveillance; Traffic control; Uniform resource locators; clustering; social networking; weblog;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligence and Security Informatics, 2007 IEEE
Conference_Location :
New Brunswick, NJ
Electronic_ISBN :
1-4244-1329-X
Type :
conf
DOI :
10.1109/ISI.2007.379549
Filename :
4258690
Link To Document :
بازگشت