DocumentCode :
2874648
Title :
Post-Level Spam Detection for Social Bookmarking Web Sites
Author :
Yang, Hsin-Chang ; Lee, Chung-Hong
Author_Institution :
Dept. of Inf. Manage., Nat. Univ. of Kaohsiung, Kaohsiung, Taiwan
fYear :
2011
fDate :
25-27 July 2011
Firstpage :
180
Lastpage :
185
Abstract :
Social book marking Web sites have emerged recently for collecting and sharing of interesting Web sites among users. People can add Web pages to such sites as bookmarks and allow themselves as well as others to manipulate them. One of the key features of the social book marking sites is the ability of annotating a Web page when it is being bookmarked. The annotation usually contains a set of words or phrases, which are collectively known as tags, that could reveal the semantics of the annotated Web page. Efficient and effective search of Web pages can then be achieved via such tags. However, spam tags that are irrelevant to the content of Web pages often appear to deceive other users for malicious or commercial purposes. Various techniques have been devised to tackle such tag spam detection problem. Most of these techniques were able to detect a user that always annotate spam tags. However, finer levels of detection are seldom discussed. In this work, we will propose a method based on a text mining approach to discover the relations between Web pages and there tag posts. These relations are then used to compute the similarity between a Web page and its tag post to decide if it is a spam post. Preliminary experiments show that the accuracy of the post-level spam detection task is 83%.
Keywords :
Web sites; semantic Web; text analysis; unsolicited e-mail; Web page; Website; post level spam detection; social bookmarking; text mining approach; Neurons; Phase change materials; Semantics; Text mining; Training; Vocabulary; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2011 International Conference on
Conference_Location :
Kaohsiung
Print_ISBN :
978-1-61284-758-0
Electronic_ISBN :
978-0-7695-4375-8
Type :
conf
DOI :
10.1109/ASONAM.2011.81
Filename :
5992578
Link To Document :
بازگشت