• DocumentCode
    1918215
  • Title

    Automatic Detection of Social Tag Spams Using a Text Mining Approach

  • Author

    Yang, Hsin-Chang ; Lee, Chung-Hong

  • Author_Institution
    Dept. of Inf. Manage., Nat. Univ. of Kaohsiung, Kaohsiung, Taiwan
  • fYear
    2010
  • fDate
    9-11 Aug. 2010
  • Firstpage
    441
  • Lastpage
    445
  • Abstract
    Social tags are annotations for Web pages collaboratively added by users. It will be much easier to understand the meaning of Web pages and classify them according to their tags. The precision in retrieving Web pages may also increase using such tags. Nowadays social tags are mostly annotated manually by users via social bookmarking Web sites. Such manual annotation process may produce diverse, redundant, and inconsistent tags. Besides, many tags which are inconsistent with their annotated Web pages exist and deteriorate the feasibility of social tags. In this work we will develop an automatic scheme to discover the associations between Web pages and social tags and apply such associations on applications of social tag spam detection. We applied a text mining approach based on self-organizing maps to find the relationships between Web pages and social tags. The disadvantages of manual annotation will be remedied through such relationships. The discovered associations were then used to identify social tag spams. Preliminary experiments show that the quality and usability of social tags were improved through this method.
  • Keywords
    Web sites; data mining; information retrieval; text analysis; Web pages; Web sites bookmarking; automatic detection; social tag spam detection; social tag spams; text mining approach; Labeling; Neurons; Phase change materials; Semantics; Training; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Social Networks Analysis and Mining (ASONAM), 2010 International Conference on
  • Conference_Location
    Odense
  • Print_ISBN
    978-1-4244-7787-6
  • Electronic_ISBN
    978-0-7695-4138-9
  • Type

    conf

  • DOI
    10.1109/ASONAM.2010.11
  • Filename
    5563061