• DocumentCode
    64742
  • Title

    ISC: An Iterative Social Based Classifier for Adult Account Detection on Twitter

  • Author

    Hanqiang Cheng ; Xinyu Xing ; Xue Liu ; Qin Lv

  • Author_Institution
    Sch. of Comput. Sci., McGill Univ., Montreal, QC, Canada
  • Volume
    27
  • Issue
    4
  • fYear
    2015
  • fDate
    April 1 2015
  • Firstpage
    1045
  • Lastpage
    1056
  • Abstract
    The widespread of adult content on online social networks (e.g., Twitter) is becoming an emerging yet critical problem. An automatic method to identify accounts spreading sexually explicit content (i.e., adult account) is of significant values in protecting children and improving user experiences. Traditional adult content detection techniques are ill-suited for detecting adult accounts on Twitter due to the diversity and dynamics in Twitter content. In this paper, we formulate the adult account detection as a graph based classification problem and demonstrate our detection method on Twitter by using social links between Twitter accounts and entities in tweets. As adult Twitter accounts are mostly connected with normal accounts and post many normal entities, which makes the graph full of noisy links, existing graph based classification techniques cannot work well on such a graph. To address this problem, we propose an iterative social based classifier (ISC), a novel graph based classification technique resistant to the noisy links. Evaluations using large-scale real-world Twitter data show that, by labeling a small number of popular Twitter accounts, ISC can achieve satisfactory performance in adult account detection, significantly outperforming existing techniques.
  • Keywords
    graph theory; iterative methods; pattern classification; social networking (online); ISC; Twitter accounts; adult account detection; graph based classification problem; iterative social based classifier; online social networks; social links; tweets; Correlation; Educational institutions; Feature extraction; Labeling; Noise measurement; Twitter; Twitter; adult content; graph based classification;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2014.2357012
  • Filename
    6895278