• DocumentCode
    1700757
  • Title

    Applying a novel combined classifier for pornographic web filtering in a grid computing environment

  • Author

    Gao, Zhong ; Lu, Guanming ; Zhao, Xin ; Qin, Danni ; Qin, Mei

  • Author_Institution
    Coll. of Telecommun. & Inf. Eng., Nanjing Univ. of Posts & Telecommun., Nanjing
  • fYear
    2008
  • Firstpage
    513
  • Lastpage
    517
  • Abstract
    As the Web expands exponentially, there are a flood of pornographic Web sites on the Internet. Thus effective and fast web filtering systems are essential. Web filtering based on hypertext classification has become one of the important techniques to handle and filter inappropriate information on the Web. The task involved can be parallelized and distributed in a grid environment. However, how to improve the performance of the hypertext classification under the situation of noisy data is still a challenging problem. In this paper, we propose a new approach for hypertext classification in Web filtering, which uses a novel support vector machine and k-nearest neighbor (KNN-SVM) to remove noisy training examples. The task of text categorization is distributed in several computers. The experimental results show that the generalization performance in the accuracy of classification and the processing time are improved significantly compared to that of the traditional SVM classifier over the grid, and adapt to engineering applications.
  • Keywords
    Internet; grid computing; hypermedia; information filters; support vector machines; text analysis; Internet; grid computing; hypertext classification; k-nearest neighbor; pornographic Web filtering; pornographic Web sites; support vector machine; text categorization; Distributed computing; Floods; Grid computing; Information filtering; Information filters; Internet; Support vector machine classification; Support vector machines; Text categorization; Working environment noise; grid computing; hypertext Classification; k-nearest neighbor; support vector machine; web page filtering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Supported Cooperative Work in Design, 2008. CSCWD 2008. 12th International Conference on
  • Conference_Location
    Xi´an
  • Print_ISBN
    978-1-4244-1650-9
  • Electronic_ISBN
    978-1-4244-1651-6
  • Type

    conf

  • DOI
    10.1109/CSCWD.2008.4537031
  • Filename
    4537031