Title :
IdentifyingWeb Spam by Densely Connected Sites and its Statistics in a JapaneseWeb Snapshot
Author :
Ono, Hiroshi ; Toyoda, Masashi ; Kitsuregawa, Masaru
Author_Institution :
The University of Tokyo, Japan
Abstract :
Web spamming refers to actions intended to mislead search engines into ranking certain pages higher than they deserve. Recently, the amount of web spam has increased dramatically, leading to a degradation of search results. One of the most effective spamming techniques is link spamming. This is done by setting up an interconnected structure of pages for deceiving link-based ranking methods, such as PageRank. In this paper, we analyze distributions of link spam in our archive of Japanese web pages using link analysis techniques.
Keywords :
Data engineering; Data mining; Degradation; Information analysis; Optimization methods; Search engines; Statistics; Toy industry; Unsolicited electronic mail; Web pages;
Conference_Titel :
Data Engineering Workshops, 2006. Proceedings. 22nd International Conference on
Conference_Location :
Atlanta, GA, USA
Print_ISBN :
0-7695-2571-7
DOI :
10.1109/ICDEW.2006.64