• DocumentCode
    2860819
  • Title

    A Fuzzy Classification Based on Feature Selection for Web Pages

  • Author

    Mao-Yuan, Zhang ; Zheng-Ding, Lu

  • Author_Institution
    HuaZhong University of Science and Technology, Wuhan
  • fYear
    2004
  • fDate
    20-24 Sept. 2004
  • Firstpage
    469
  • Lastpage
    472
  • Abstract
    An automatic web page classification is needed for web information extraction, but the number of keywords of web pages is so giant that many classifications are not speedy or capable of self-learning. In this paper, a fuzzy classification method for web pages, which is based on fuzzy learning and parallel feature selection, is proposed. Fuzzy learning of parameter c{ik} is adopted to increase the accuracy, while parallel feature selection based on weighted similarity is used not only to decrease the dimension of the features but also to let parameter σ{ik} need no learning. The weights of features are deducted in theory, and to speed up the calculation of weights, a parallel sum algorithm of the matrix is proposed.
  • Keywords
    Computer science; Costs; Data mining; Decision trees; Design methodology; Euclidean distance; Information security; Organizing; Principal component analysis; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
  • Print_ISBN
    0-7695-2100-2
  • Type

    conf

  • DOI
    10.1109/WI.2004.10063
  • Filename
    1410846