DocumentCode
2513471
Title
Automatic Detection of Phishing Target from Phishing Webpage
Author
Liu, Gang ; Qiu, Bite ; Wenyin, Liu
Author_Institution
Dept. of Comput. Sci., City Univ. of Hong Kong, Hong Kong, China
fYear
2010
fDate
23-26 Aug. 2010
Firstpage
4153
Lastpage
4156
Abstract
An approach to identification of the phishing target of a given (suspicious) webpage is proposed by clustering the webpage set consisting of its all associated webpages and the given webpage itself. We first find its associated webpages, and then explore their relationships to the given webpage as their features for clustering. Such relationships include link relationship, ranking relationship, text similarity, and webpage layout similarity relationship. A DBSCAN clustering method is employed to find if there is a cluster around the given webpage. If such cluster exists, we claim the given webpage is a phishing webpage and then find its phishing target (i.e., the legitimate webpage it is attacking) from this cluster. Otherwise, we identify it as a legitimate webpage. Our test dataset consists of 8745 phishing pages (targeting at 76 well-known websites) selected from Phish Tank and preliminary experiments show that the approach can successfully identify 91.44% of their phishing targets. Another dataset of 1000 legitimate webpages is collected to test our method´s false alarm rate, which is 3.40%.
Keywords
Internet; computer crime; pattern clustering; text analysis; DBSCAN clustering method; Webpage layout similarity relationship; Webpage set clustering; legitimate Webpage; link relationship; phishing Webpage; phishing target automatic detection; ranking relationship; text similarity; Accuracy; Clustering algorithms; Data mining; Feature extraction; Layout; Object detection; Visualization; Anti-Phishing; DBSCAN Clustering; Phishing; Web Document Analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location
Istanbul
ISSN
1051-4651
Print_ISBN
978-1-4244-7542-1
Type
conf
DOI
10.1109/ICPR.2010.1010
Filename
5597725
Link To Document