Title :
Customizable Instance-Driven Webpage Filtering Based on Semi-Supervised Learning
Author :
Zhu, Mingliang ; Hu, Weiming ; Li, Xi ; Wu, Ou
Abstract :
The World Wide Web has been growing rapidly in recent years, along with increasing needs for content-based Webpage filtering. But most existing filtering systems cannot easily satisfy the personalized filtering demands from different users at the same time. In this paper, a customizable instance-driven Webpage filtering strategy is proposed. For different users, different Webpage filters are produced by our system through mining the certain Webpage classes they focus on. A semi-supervised learning (SSL) approach is applied for obtaining a precise description of the Webpage class which a user wants to filter based on the small sized user instance set he or she provided. Subsequently, a feature selection step is performed and a Bayes classifier is created over the enlarged training set. Experimental results show the great stability and high performance of our proposed method, and it outperforms existing methods.
Keywords :
Bayes methods; Web sites; information filtering; learning (artificial intelligence); Bayes classifier; Webpage class; World Wide Web; content-based Webpage filtering; customizable instance-driven Webpage filtering; feature selection step; personalized filtering; semi-supervised learning; Frequency; Information filtering; Information filters; Laboratories; Pattern recognition; Semisupervised learning; Support vector machine classification; Support vector machines; Text categorization; Uniform resource locators;
Conference_Titel :
Web Intelligence, IEEE/WIC/ACM International Conference on
Conference_Location :
Fremont, CA
Print_ISBN :
978-0-7695-3026-0