J4 ›› 2013, Vol. 40 ›› Issue (6): 116-124.doi: 10.3969/j.issn.1001-2400.2013.06.021

• Original Articles • Previous Articles     Next Articles

Improved behavior-based malware detection algorithm  with AdaBoost

CAO Ying;LIU Jiachen;MIAO Qiguang;GAO Lin   

  1. (School of Computer Science and Technology, Xidian Univ., Xi'an  710071, China)
  • Received:2012-08-08 Online:2013-12-20 Published:2014-01-10
  • Contact: CAO Ying E-mail:yingcao@stu.xidian.edu.cn

Abstract:

We present a new algorithm for abstracting features of a program from its API calls, network packages and static analysis characteristics. API calls are aggregated by a low level data dependence analysis to form the abstract behaviors.Network packages and static analysis characteristics are directly utilized as discrete value features.All of these abstract features are then embedded in a high dimension vector space. Besides, we further design a new behavior-based malware classification algorithm, which advances the AdaBoost boosted decision tree algorithm. Firstly, the new algorithm optimizes an anti-noise loss function to lower the probability of the noise data to train the next classifier, and thus improves the anti-noise ability of the AdaBoost algorithm. Secondly, to improve the algorithm's performance in multi-class classif bication problem, a vote vector is adopted to combine base classifiers, which discriminates the accuracy with which a classifier classifies samples from different classes.

Key words: malware, behavior abstraction, classification, decision tree, AdaBoost, loss function

CLC Number: 

  • TP339