• DocumentCode
    1831538
  • Title

    An efficient text filter for adult Web documents

  • Author

    Kim, Youngsoo ; Nam, Taekyong

  • Author_Institution
    Network Security Group, Electron. & Telecommun. Res. Inst.
  • Volume
    1
  • fYear
    2006
  • fDate
    20-22 Feb. 2006
  • Lastpage
    440
  • Abstract
    The openness of the Web allows any users to access almost any type of information. However, some information, such as adult content, is not appropriate for all users, notably children. Additionally for adults, some contents included in abnormal pornographic sites can do ordinary people´s mental health harm. In this paper, we propose a new criterion and divide contents of Web documents into 4 grades. We use a hierarchical way of filtering texts. At first, we filter off 0-grade texts contain no adult contents using a pattern matching algorithm, and classify 1-grade, 2-grade and 3-grade texts using a machine learning algorithm
  • Keywords
    Internet; information filtering; information filters; learning (artificial intelligence); pattern matching; text analysis; adult Web documents; adult content; contents filtering; machine learning algorithm; pattern matching algorithm; pornographic sites; text filter; Drugs; Educational products; Information filtering; Information filters; Information security; Internet; Machine learning algorithms; Matched filters; Pattern matching; Pediatrics; Contents Filtering; Contents Rating Services; Text Classification; adult contents; web documents;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Communication Technology, 2006. ICACT 2006. The 8th International Conference
  • Conference_Location
    Phoenix Park
  • Print_ISBN
    89-5519-129-4
  • Type

    conf

  • DOI
    10.1109/ICACT.2006.206003
  • Filename
    1625608