• DocumentCode
    2245530
  • Title

    Detecting Trojan horses based on system behavior using machine learning method

  • Author

    Liu, Yu-Feng ; Zhang, Li-Wei ; Liang, Jian ; Qu, Sheng ; Ni, Zhi-Qiang

  • Author_Institution
    Data Min. Group, Tsinghua Univ., Beijing, China
  • Volume
    2
  • fYear
    2010
  • fDate
    11-14 July 2010
  • Firstpage
    855
  • Lastpage
    860
  • Abstract
    The Research of detection malware using machine learning method attracts much attention recent years. However, most of research focused on code analysis which is signature-based or analysis of system call sequence in Linux environment. Obviously, all methods have their strengths and weaknesses. In this paper, we concentrate on detection Trojan horse by operation system information in Windows environment using data mining technology. Our main content and contribution contains as follows: First, we collect Trojan horse samples in true network environment and classify them by scanner. Secondly, we collect operation system behavior features under infected and clean circumstances separately by WMI manager tools. And then, several classic classification algorithms are applied and a performance comparison is given. Feature selection methods are applied to those features and we get a feature order list which reflects the relevance order of Trojan horse activities and the system feature. We believe the instructive meaning of the list is significant. Finally, a feature combination method is applied and features belongs different groups are combined according their characteristic for high classification performance. Results of experiments demonstrate the feasibility of our assumption that detecting Trojan horses by system behavior information is feasible and affective.
  • Keywords
    data mining; invasive software; learning (artificial intelligence); operating systems (computers); pattern classification; Linux environment; Trojan horse detection; WMI manager tools; Windows environment; classification algorithms; code analysis; data mining; feature combination; feature selection; machine learning; malware detection; operation system information; system behavior; Accuracy; Computers; Cybernetics; Learning systems; Machine learning; Trojan horses; Classification; Feature selection; System behavior; Trojan horse;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2010 International Conference on
  • Conference_Location
    Qingdao
  • Print_ISBN
    978-1-4244-6526-2
  • Type

    conf

  • DOI
    10.1109/ICMLC.2010.5580591
  • Filename
    5580591