Title :
Exploiting the Social Tagging Network for Web Clustering
Author :
Lu, Caimei ; Hu, Xiaohua ; Park, Jung-ran
Author_Institution :
Coll. of Inf. Sci. & Technol., Drexel Univ., Philadelphia, PA, USA
Abstract :
Social tagging is a major characteristic of Web 2.0. A social tagging system can be modeled with a tripartite network of users, resources, and tags. In this paper, we investigate how to enhance Web clustering by leveraging the tripartite network of social tagging systems. We propose a clustering method called “Tripartite Clustering” which clusters the three types of nodes (resources, users, and tags) simultaneously by only utilizing the links in the social tagging network. We also investigate two other approaches to exploit social tagging for clustering with K-means and Link K-means. All the clustering methods are experimented on a real-world social tagging data set sampled from del.icio.us. The clustering results are evaluated against a human-maintained Web directory. The experimental results show that the social tagging network is a very useful information source for document clustering. All social-annotation-based clustering methods can significantly improve the performance of content-based clustering. Compared to social-annotation-based K-means and Link K-means, Tripartite Clustering achieves equivalent or better performance and produces more useful information.
Keywords :
Internet; document handling; information resources; pattern clustering; Link K-means; Web 2.0; Web clustering; content based clustering; document clustering; human maintained Web directory; information source; social annotation based K-means; social annotation based clustering method; social tagging data set; social tagging network; tripartite clustering; tripartite network; Clustering algorithms; Clustering methods; Measurement; Semantics; Tagging; Vectors; Web pages; Clustering methods; social annotation; social tagging; tripartite network;
Journal_Title :
Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on
DOI :
10.1109/TSMCA.2011.2157128