• DocumentCode
    477797
  • Title

    A Novel Approach to Naive Bayes Web Page Automatic Classification

  • Author

    He, Zhongli ; Liu, Zhijing

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Xidian Univ., Xian
  • Volume
    2
  • fYear
    2008
  • fDate
    18-20 Oct. 2008
  • Firstpage
    361
  • Lastpage
    365
  • Abstract
    In this paper, a novel approach of Web page classification using Naive Bayes (NB) classifier based on independent component analysis (ICA) is proposed. In order to perform the classification, a Web page is firstly represented by a vector of features with different weights, and the weight calculated method is improved. As the number of the features is big, principal component analysis (PCA) which is to select the relevant features will perform in preprocessing section as input for improved ICA algorithm (MFICA). Finally, the output of MFICA is sent to NB classifier for classification to boost the classifierpsilas performance. The experimental evaluation demonstrates that the NB classifier based on ICA model provides acceptable classification accuracy.
  • Keywords
    Bayes methods; Web sites; independent component analysis; information analysis; principal component analysis; Naive Bayes classifier; Web page automatic classification; independent component analysis; principal component analysis; Data mining; Fuzzy systems; Independent component analysis; Information retrieval; Internet; Neural networks; Niobium; Principal component analysis; Signal processing algorithms; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
  • Conference_Location
    Shandong
  • Print_ISBN
    978-0-7695-3305-6
  • Type

    conf

  • DOI
    10.1109/FSKD.2008.284
  • Filename
    4666139