Title :
Designing a web spam classifier based on feature fusion in the Layered Multi-population Genetic Programming framework
Author :
Keyhanipour, Amir Hosein ; Moshiri, Behzad
Author_Institution :
Center of Excellence, Univ. of Tehran, Tehran, Iran
Abstract :
Nowadays, Web spam pages are a critical challenge for Web retrieval systems which have drastic influence on the performance of such systems. Although these systems try to combat the impact of spam pages on their final results list, spammers increasingly use more sophisticated techniques to increase the number of views for their intended pages in order to have more commercial success. This paper employs the recently proposed Layered Multi-population Genetic Programming model for Web spam detection task as well application of correlation coefficient analysis for feature space reduction. Based on our tentative results, the designed classifier, which is based on a combination of easy to compute features, has a very reasonable performance in comparison with similar methods.
Keywords :
Internet; Web sites; correlation methods; feature extraction; genetic algorithms; information retrieval; pattern classification; unsolicited e-mail; Web retrieval systems; Web spam classifier; Web spam detection task; Web spam pages; correlation coefficient analysis; feature fusion; feature space reduction; layered multipopulation genetic programming framework; layered multipopulation genetic programming model; Correlation coefficient; Feature extraction; Genetic programming; Sociology; Unsolicited electronic mail; Web pages; Classifier; Layered Multi-Population Genetic Programming; Spam; Web;
Conference_Titel :
Information Fusion (FUSION), 2013 16th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-605-86311-1-3