• DocumentCode
    2513471
  • Title

    Automatic Detection of Phishing Target from Phishing Webpage

  • Author

    Liu, Gang ; Qiu, Bite ; Wenyin, Liu

  • Author_Institution
    Dept. of Comput. Sci., City Univ. of Hong Kong, Hong Kong, China
  • fYear
    2010
  • fDate
    23-26 Aug. 2010
  • Firstpage
    4153
  • Lastpage
    4156
  • Abstract
    An approach to identification of the phishing target of a given (suspicious) webpage is proposed by clustering the webpage set consisting of its all associated webpages and the given webpage itself. We first find its associated webpages, and then explore their relationships to the given webpage as their features for clustering. Such relationships include link relationship, ranking relationship, text similarity, and webpage layout similarity relationship. A DBSCAN clustering method is employed to find if there is a cluster around the given webpage. If such cluster exists, we claim the given webpage is a phishing webpage and then find its phishing target (i.e., the legitimate webpage it is attacking) from this cluster. Otherwise, we identify it as a legitimate webpage. Our test dataset consists of 8745 phishing pages (targeting at 76 well-known websites) selected from Phish Tank and preliminary experiments show that the approach can successfully identify 91.44% of their phishing targets. Another dataset of 1000 legitimate webpages is collected to test our method´s false alarm rate, which is 3.40%.
  • Keywords
    Internet; computer crime; pattern clustering; text analysis; DBSCAN clustering method; Webpage layout similarity relationship; Webpage set clustering; legitimate Webpage; link relationship; phishing Webpage; phishing target automatic detection; ranking relationship; text similarity; Accuracy; Clustering algorithms; Data mining; Feature extraction; Layout; Object detection; Visualization; Anti-Phishing; DBSCAN Clustering; Phishing; Web Document Analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.1010
  • Filename
    5597725