DocumentCode
64742
Title
ISC: An Iterative Social Based Classifier for Adult Account Detection on Twitter
Author
Hanqiang Cheng ; Xinyu Xing ; Xue Liu ; Qin Lv
Author_Institution
Sch. of Comput. Sci., McGill Univ., Montreal, QC, Canada
Volume
27
Issue
4
fYear
2015
fDate
April 1 2015
Firstpage
1045
Lastpage
1056
Abstract
The widespread of adult content on online social networks (e.g., Twitter) is becoming an emerging yet critical problem. An automatic method to identify accounts spreading sexually explicit content (i.e., adult account) is of significant values in protecting children and improving user experiences. Traditional adult content detection techniques are ill-suited for detecting adult accounts on Twitter due to the diversity and dynamics in Twitter content. In this paper, we formulate the adult account detection as a graph based classification problem and demonstrate our detection method on Twitter by using social links between Twitter accounts and entities in tweets. As adult Twitter accounts are mostly connected with normal accounts and post many normal entities, which makes the graph full of noisy links, existing graph based classification techniques cannot work well on such a graph. To address this problem, we propose an iterative social based classifier (ISC), a novel graph based classification technique resistant to the noisy links. Evaluations using large-scale real-world Twitter data show that, by labeling a small number of popular Twitter accounts, ISC can achieve satisfactory performance in adult account detection, significantly outperforming existing techniques.
Keywords
graph theory; iterative methods; pattern classification; social networking (online); ISC; Twitter accounts; adult account detection; graph based classification problem; iterative social based classifier; online social networks; social links; tweets; Correlation; Educational institutions; Feature extraction; Labeling; Noise measurement; Twitter; Twitter; adult content; graph based classification;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2014.2357012
Filename
6895278
Link To Document