Title :
Formalization of Link Farm Structure Using Graph Grammar
Author :
Chobtham, Kiattikun ; Surarerks, Athasit ; Rungsawang, Arnon
Author_Institution :
Chulalongkorn Univ., Bangkok
Abstract :
A link farm is a set of web pages constructed to mislead the importance of target pages in search engine results by boosting their link-based ranking scores. In this paper, we introduce a new graph grammar model for expressing the structure of a link farm. Supervised graph grammar induction created by an expert is modified to fit the training data to explain the behavior and the properties of link farms. In the experiments, graph grammar can effectively recognize link farms from Yahoo´s web spam dataset. The comparison among the number of applying production rules of spam and normal hosts indicates that graph grammar seem to be a good mechanism for detecting link spam.
Keywords :
graph grammars; search engines; unsolicited e-mail; Web pages; Web spam dataset; link farm structure; link spam detection; link-based ranking scores; search engine; supervised graph grammar induction; target pages; Application software; Boosting; Computer networks; Grid computing; Knowledge engineering; Laboratories; Production; Search engines; Training data; Web pages; GRAPH GRAMMAR; LINK FARM; PARSING; WEB GRAPH;
Conference_Titel :
Advanced Information Networking and Applications, 2008. AINA 2008. 22nd International Conference on
Conference_Location :
Okinawa
Print_ISBN :
978-0-7695-3095-6
DOI :
10.1109/AINA.2008.96