• DocumentCode
    3026166
  • Title

    Fine-Grained Mining and Classification of Malicious Web Pages

  • Author

    Tao Yue ; Jianhua Sun ; Hao Chen

  • Author_Institution
    Coll. of Inf. Sci. & Eng., Hunan Univ., Changsha, China
  • fYear
    2013
  • fDate
    29-30 June 2013
  • Firstpage
    616
  • Lastpage
    619
  • Abstract
    With the World Wide Web expanding continuously, more and more malicious web pages including phishing, malware and spamming spread rapidly, we are facing a great threat. The work of detecting malicious web pages and identifying their threat types has some shortcomings. In existing studies of malicious webpage detection, most are just for detecting a single attack type. In this paper, we extract a variety of webpage features and use machine learning algorithms to build an efficient classifier. By the classifier, we can detect malicious web pages and identify all the popular threat types. The features extracted in our method are derived from the HTML contents, the associated JavaScript code, and the corresponding URL. We collected 1000 benign web pages and 1500 malicious web pages as experimental data sets. The experimental results show that our method achieves a superior performance: the accuracy was over 95% in detecting malicious web pages and over 88% in identifying threat types.
  • Keywords
    Internet; Java; data mining; feature extraction; hypermedia markup languages; learning (artificial intelligence); pattern classification; security of data; HTML contents; JavaScript code; World Wide Web; features extraction; fine grained mining; machine learning algorithms; malicious Web page detection; malicious Web pages; single attack type; Automation; Manufacturing; Classifier; Decetion; Feature Extration; Malicious; Trainning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Manufacturing and Automation (ICDMA), 2013 Fourth International Conference on
  • Conference_Location
    Qingdao
  • Type

    conf

  • DOI
    10.1109/ICDMA.2013.145
  • Filename
    6598066