Journal of Xidian University ›› 2020, Vol. 47 ›› Issue (1): 37-43.doi: 10.19665/j.issn1001-2400.2020.01.006

Previous Articles     Next Articles

Improved algorithm for detection of the malicious domain name based on the convolutional neural network

YANG Luhui1,LIU Guangjie1,2,ZHAI Jiangtao2,LIU Weiwei1,BAI Huiwen1,DAI Yuewei1,2   

  1. 1. School of Automation, Nanjing University of Science & Technology, Nanjing 210094, China
    2. School of Electronic & Information Engineering, Nanjing University of Information Science & Technology, Nanjing 210044, China
  • Received:2019-09-06 Online:2020-02-20 Published:2020-03-19

Abstract:

Aiming at the problem that the existing detection methods are not efficient in detecting the malicious domain name generated by the algorithm, especially the detection rate of several types of malicious domain names that are difficult to detect is low, an improved algorithm for detection of the malicious domain name based on the convolutional neural network is proposed. Based on the existing convolutional neural network model, this algorithm adds convolutional branches to extract deeper character-level features, so that both shallow and deep character-level features of malicious domain names could be extracted and fused simultaneously. A focal loss function is introduced as a loss function to solve the problem of sample imbalance caused by difficulty and quantity, which is used to improve the detection accuracy of hard-to-detect samples. The average detection accuracy of the improved algorithm for 20 types of malicious domain names is 97.62%, that is, 0.94% higher than that of the original algorithm, and the detection accuracy of four hard-to-detect domain names is increased by 3.71%, 4.6%, 11.18% and 17.8%, respectively. Experimental results show that the improved algorithm can effectively improve the detection accuracy of malicious domain names, especially for some hard-to-detect domain names.

Key words: convolutional neural network, domain generation algorithms, deep learning, information security

CLC Number: 

  • TP309