Journal of Xidian University ›› 2019, Vol. 46 ›› Issue (2): 83-88.doi: 10.19665/j.issn1001-2400.2019.02.014

Previous Articles     Next Articles

User interesting mining method in the heterogeneous social network

TU Shouzhong1,YAN Zhou2,WEI Lingwei2,ZHU Xiaoyan1   

  1. 1. School of Computer Science and Technology, Tsinghua Univ., Beijing 100084, China
    2. Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China
  • Received:2018-08-21 Online:2019-04-20 Published:2019-04-20

Abstract:

Due to great advances in the mobile Internet, the Social Network Service (SNS) has become an indispensable service. The development of current mainstream social media shows a trend that social service and information service are combined and interwork to provide a better experience. Meanwhile there is increasing polarization among users. The heterogeneous features, say, the combination of information content and sociality as well as the polarization of user roles, present challenges to traditional research in social media. Some studies of social media are mainly based on the equal position among nodes or similar relations. If the algorithms brought about by these studies are applied directly to the networks where the users are highly polarized, the results may be distorted or even be quite different from the fact. A new model for interests mining based on social relations is proposed in this paper. Dealing with the polarization in social media, we incorporate matrix factorization and the label propagation algorithm to treat information disseminators and average users, respectively, in order to discover interests of average users in a large-scale heterogeneous network. The validness of the model and the performance and advantages of the algorithm are tested and verified in Zhihu datasets. Experiments show that the maximum increase in the recall of the proposed method, compared with the baseline, is 42%.

Key words: heterogeneous network, social networks, interest model, non-negative matrix factorization, label Propagation

CLC Number: 

  • TP391.1