Title :
Research on web filtering technology based on the dual feature selection
Author :
Bin Zhang ; Miao Xu ; Minli Wu
Author_Institution :
Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
In the topic search system, some of web pages got by crawling are inconsistent with user demands. For this situation, this paper had a research on content-based web filtering technology. This paper proposed a dual feature selection method based on the CHI statistical method and N-gram, and then made binary text classification by SVM in order to achieve Web Filtering. The experiments showed that the proposed web filtering method has better results.
Keywords :
Internet; content-based retrieval; information filtering; pattern classification; query formulation; support vector machines; text analysis; SVM; Web pages; binary text classification; content-based Web filtering technology; dual feature selection; support vector machines; topic search system; Feature extraction; Filtering; Procurement; Statistical analysis; Support vector machines; Text categorization; Web pages; CHI statistical method; Feature selection; TF-IDF; Web filtering;
Conference_Titel :
Network Infrastructure and Digital Content (IC-NIDC), 2012 3rd IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-2201-0
DOI :
10.1109/ICNIDC.2012.6418841