DocumentCode :
424240
Title :
Comparison of machine learning algorithms in Chinese Web filtering
Author :
Du, A-Ning ; Fang, Bin-Xing
Author_Institution :
Res. Center of Comput. Network & Inf. Security Technol., Harbin Inst. of Technol., China
Volume :
4
fYear :
2004
fDate :
26-29 Aug. 2004
Firstpage :
2526
Abstract :
Web filtering based on user´s demand has witnessed a booming interest due to the development of Internet In the research community the dominant approach to this problem is based on machine learning algorithms. Web filtering is an inductive process which automatically builds a filter by learning from a set of pre-assigned document and the description of user´s interest, and then uses it to assign unfiltered Web pages. This survey compares four main machine learning algorithms (decision tree, rule induction, Bayesian algorithm and support vector machines) on Chinese web pages set of their filtering effectiveness and computer resources consumed, focusing on the influence of feature set size and training set size. It induces that support vector machines earn high score in Chinese Web filtering applications.
Keywords :
Bayes methods; Internet; decision trees; information filters; learning (artificial intelligence); support vector machines; Bayesian algorithm; Chinese Web filtering; Internet; computer resource; decision tree; machine learning algorithm; rule induction; support vector machine; unfiltered Web page; Application software; Bayesian methods; Decision trees; Filtering algorithms; Information filtering; Information filters; Internet; Machine learning algorithms; Support vector machines; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN :
0-7803-8403-2
Type :
conf
DOI :
10.1109/ICMLC.2004.1382229
Filename :
1382229
Link To Document :
بازگشت