Journal of Xidian University ›› 2023, Vol. 50 ›› Issue (4): 206-214.doi: 10.19665/j.issn1001-2400.2023.04.020

• Special Issue on Cyberspace Security • Previous Articles     Next Articles

Dark web author alignment based on attention augmented convolutional networks

YANG Yanyan1(),DU Yanhui1(),LIU Hongmeng2(),ZHAO Jiapeng2(),SHI Jinqiao2(),WANG Xuebin3()   

  1. 1. Department of Information Technology and Cyber Security,People’s Public Security University of China,Beijing 100038,China
    2. School of Cyber Space Security,Beijing University of Posts and Telecommunications,Beijing 100876,China
    3. Institute of Information Engineering,Chinese Academy of Sciences,Beijing 100080,China
  • Received:2023-01-21 Online:2023-08-20 Published:2023-10-17
  • Contact: Jiapeng ZHAO E-mail:53996587@qq.com;duyanhui@ppsuc.edu.cn;cs2lhm@bupt.edu.cn;zhaojiapeng@bupt.edu.cn;shijinqiao@bupt.edu.cn;wangxuebin@iie.ac.cn

Abstract:

Dark network users engage in a large number of illegal and criminal activities in the underground market.The anonymity of the dark network brings great convenience to the communication between users of the dark network,but great difficulties to the police.In recent years,the deep neural network has been widely successful in various fields,and more and more researchers have begun to use the neural network to identify anonymous network text authors.In order to better align users in the dark web and find more different users with the same identity,we use the neural network method to identify and align users in the dark web.However,the existing methods focus mainly on the short text and are not good at dealing with the global and long sequence information.In this paper,we propose a self-attention mechanism to enhance the convolution operator and use long sequence information to strengthen the user representation,named DACN.DACN starts from the text content,and multiple account associations are carried out for anonymous dark web users to aggregate information from multiple anonymous accounts,proving mores clues for obtaining the users’true identity.Our recent analysis involves conducting a thorough assessment of two distinct dark web market forums,whereby we evaluate our methodology in comparison to the current state-of-the-art techniques.Experimental results show that our approach is remarkably effective,with a demonstrated average mean retrieval ranking (MRR) enhancement of 2.9% and 3.6%,as well as an improved Recall@10 of 2.3% and 3.0%.This evaluation offers robust evidence of the efficacy of our approach in dark web market forums.

Key words: text embedding, attention mechanism, convolutional networks, long sequence information

CLC Number: 

  • TP18