论文标题
RDP-GAN:基于Rényi-Differential的生成对抗网络
RDP-GAN: A Rényi-Differential Privacy based Generative Adversarial Network
论文作者
论文摘要
由于其令人印象深刻的生成具有高隐私保护的现实样本的能力,最近引起了越来越多的关注。如果没有直接与培训示例进行交互,则可以完全使用生成模型来估计原始数据集的基础分布,而判别模型可以通过将标签值与培训示例进行比较来检查生成的样品的质量。但是,当Gans应用于敏感或私人培训示例(例如医疗或财务记录)时,仍然有可能泄露个人的敏感和私人信息。为了减轻此信息泄漏并构建私人gan,在这项工作中,我们提出了一个rényi-diffitiental私人gan(RDP-GAN),该私人私人gan(RDP-GAN)通过仔细添加训练过程中损失功能的价值的随机声音,在GAN中实现差异隐私(DP)。此外,我们得出了在亚采样方法和累积迭代下总隐私损失的分析结果,这表明了其对隐私预算分配的有效性。此外,为了减轻注入噪声带来的负面影响,我们通过添加自适应噪声调谐步骤来增强所提出的算法,这将根据测试精度改变添加噪声的体积。通过广泛的实验结果,我们验证了所提出的算法可以达到更好的隐私水平,同时与基于训练梯度的噪声扰动基于基准的DP-GAN方案相比,在产生高质量的样品中。
Generative adversarial network (GAN) has attracted increasing attention recently owing to its impressive ability to generate realistic samples with high privacy protection. Without directly interactive with training examples, the generative model can be fully used to estimate the underlying distribution of an original dataset while the discriminative model can examine the quality of the generated samples by comparing the label values with the training examples. However, when GANs are applied on sensitive or private training examples, such as medical or financial records, it is still probable to divulge individuals' sensitive and private information. To mitigate this information leakage and construct a private GAN, in this work we propose a Rényi-differentially private-GAN (RDP-GAN), which achieves differential privacy (DP) in a GAN by carefully adding random noises on the value of the loss function during training. Moreover, we derive the analytical results of the total privacy loss under the subsampling method and cumulated iterations, which show its effectiveness on the privacy budget allocation. In addition, in order to mitigate the negative impact brought by the injecting noise, we enhance the proposed algorithm by adding an adaptive noise tuning step, which will change the volume of added noise according to the testing accuracy. Through extensive experimental results, we verify that the proposed algorithm can achieve a better privacy level while producing high-quality samples compared with a benchmark DP-GAN scheme based on noise perturbation on training gradients.