DocumentCode
477797
Title
A Novel Approach to Naive Bayes Web Page Automatic Classification
Author
He, Zhongli ; Liu, Zhijing
Author_Institution
Sch. of Comput. Sci. & Technol., Xidian Univ., Xian
Volume
2
fYear
2008
fDate
18-20 Oct. 2008
Firstpage
361
Lastpage
365
Abstract
In this paper, a novel approach of Web page classification using Naive Bayes (NB) classifier based on independent component analysis (ICA) is proposed. In order to perform the classification, a Web page is firstly represented by a vector of features with different weights, and the weight calculated method is improved. As the number of the features is big, principal component analysis (PCA) which is to select the relevant features will perform in preprocessing section as input for improved ICA algorithm (MFICA). Finally, the output of MFICA is sent to NB classifier for classification to boost the classifierpsilas performance. The experimental evaluation demonstrates that the NB classifier based on ICA model provides acceptable classification accuracy.
Keywords
Bayes methods; Web sites; independent component analysis; information analysis; principal component analysis; Naive Bayes classifier; Web page automatic classification; independent component analysis; principal component analysis; Data mining; Fuzzy systems; Independent component analysis; Information retrieval; Internet; Neural networks; Niobium; Principal component analysis; Signal processing algorithms; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location
Shandong
Print_ISBN
978-0-7695-3305-6
Type
conf
DOI
10.1109/FSKD.2008.284
Filename
4666139
Link To Document