Title :
Identifying vulgar content in eMule network through text classification
Author :
Liu, Xiangtao ; Cheng, Xueqi ; Li, Jingyuan ; Zhai, Haijun ; Bai, Shuo
Author_Institution :
Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
Abstract :
Through years of development, the cyberspace has been dominated by traffic of peer-to-peer (P2P) file sharing applications. Among them, eMule is especially favored by millions of P2P users all over the world. However, it is very difficult to manage the content which is delivered through eMule due to its distributed property, thus a large number of vulgar content (e.g., pornographic and violent files) is existing in eMule. Since children and adolescents are the main force of eMule users, it is quite necessary to provide an efficient method to identify and filter the vulgar content for the sake of innocent children and adolescents. In this study, an automatic framework based on text classification is proposed to identify and filter vulgar content in eMule. Filename is used as the feature to carry out the elementary research on the effectiveness of our framework, although filename may be changed freely by eMule users. We aim to achieve high accuracy when identifying and filtering vulgar content, thus to raise the quality of the content delivered in eMule to a higher level.
Keywords :
Assembly; Computers; Content management; Crawlers; Filtering; Filters; Peer to peer computing; Search engines; Spatial databases; Text categorization;
Conference_Titel :
Intelligence and Security Informatics (ISI), 2010 IEEE International Conference on
Conference_Location :
Vancouver, BC, Canada
Print_ISBN :
978-1-4244-6444-9
DOI :
10.1109/ISI.2010.5484751