Journal of Xidian University ›› 2023, Vol. 50 ›› Issue (6): 219-236.doi: 10.19665/j.issn1001-2400.20230207

• Cyberspace Security • Previous Articles     Next Articles

Histogram publishing algorithm for degree distribution via shuffled differential privacy

DING Hongfa1,2,3(),FU Peiwang1(),PENG Changgen2(),LONG Shigong2(),WU Ningbo1()   

  1. 1. College of Information,Guizhou Key Laboratory of Big Data Statistical Analysis,Guizhou University of Finance and Economics,Guiyang 550025,China
    2. State Key Laboratory of Pubic Big Data,Guizhou University,Guiyang 550025,China
    3. Guian Science and Technology Industry Development Co.,Ltd.,Guiyang 550025,China
  • Received:2022-11-01 Online:2023-12-20 Published:2024-01-22

Abstract:

At present,the existing histogram publishing algorithms based on centralized or local differential privacy for graph data degree distribution can neither balance the privacy and utility of published data,nor preserve the identity privacy of end users.To solve this problem,a histogram publishing algorithm for degree distribution via shuffled differential privacy(SDP) is proposed under the framework of Encode-Shuffle-Analyze.First,a privacy preserving framework for histogram publishing of degree distribution is designed based on shuffled differential privacy.In this framework,the noisy impact that the encoder brings to distributed users is reduced by employing interactive user grouping,the shuffler and the square wave noise mechanism,while adding noise via local differential privacy.The noisy histogram of degree distribution is reconciled via the maximum likelihood estimation at the analyzer end,thus improving the utility of published data.Second,specific algorithms are proposed for concreting distributed user grouping,adding shuffled differential privacy noise and reconciling the noisy data,respectively.Furthermore,it is proved that these algorithms meet the requirement of(ε,σ)-SDP.Experiments and comparisons illustrate that the proposed algorithms can preserve the privacy of distributed users,and that the data utility is improved more than 26% with metrics in terms of L1 distance,H distance and MSE in comparison with the existing related algorithms.The proposed algorithms also perform with a low overhead and stable data utility,and are suitable for publishing and sharing the histogram of degree distribution for different scales of graph data.

Key words: privacy-preserving techniques, graph structures, shuffled differential privacy, degree distribution histogram publishing, data utility

CLC Number: 

  • TN918