DocumentCode
1831538
Title
An efficient text filter for adult Web documents
Author
Kim, Youngsoo ; Nam, Taekyong
Author_Institution
Network Security Group, Electron. & Telecommun. Res. Inst.
Volume
1
fYear
2006
fDate
20-22 Feb. 2006
Lastpage
440
Abstract
The openness of the Web allows any users to access almost any type of information. However, some information, such as adult content, is not appropriate for all users, notably children. Additionally for adults, some contents included in abnormal pornographic sites can do ordinary people´s mental health harm. In this paper, we propose a new criterion and divide contents of Web documents into 4 grades. We use a hierarchical way of filtering texts. At first, we filter off 0-grade texts contain no adult contents using a pattern matching algorithm, and classify 1-grade, 2-grade and 3-grade texts using a machine learning algorithm
Keywords
Internet; information filtering; information filters; learning (artificial intelligence); pattern matching; text analysis; adult Web documents; adult content; contents filtering; machine learning algorithm; pattern matching algorithm; pornographic sites; text filter; Drugs; Educational products; Information filtering; Information filters; Information security; Internet; Machine learning algorithms; Matched filters; Pattern matching; Pediatrics; Contents Filtering; Contents Rating Services; Text Classification; adult contents; web documents;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Communication Technology, 2006. ICACT 2006. The 8th International Conference
Conference_Location
Phoenix Park
Print_ISBN
89-5519-129-4
Type
conf
DOI
10.1109/ICACT.2006.206003
Filename
1625608
Link To Document