论文标题

私人K-均值聚类,并保证融合

Differentially Private k-Means Clustering with Guaranteed Convergence

论文作者

Lu, Zhigang, Shen, Hong

论文摘要

迭代聚类算法有助于我们学习数据背后的见解。不幸的是,这可能会使对手可以推断出具有一些背景知识的个人的隐私。在最坏的情况下,对手知道任意迭代的质心和n个项目中N-1的信息。为了保护个人隐私免受这种推理攻击,在交互式设置中广泛研究了迭代聚类算法的差异隐私(DP)。但是,现有的交互式差异性私有聚类算法遇到了一个非共流问题,即,如果没有预定义的迭代,这些算法可能不会终止。这个问题严重影响了差异私有算法的聚类质量和效率。为了解决这个问题,在本文中,我们在交互式设置中提出了一个新颖的私有聚类框架,该框架控制着质心在迭代上的运动方向,以确保通过在选定区域中注入DP噪声来确保收敛。我们证明,在预期的情况下,我们的框架下的算法最多收敛于劳埃德算法的迭代。我们对现实世界数据集进行实验评估,以表明我们的算法优于具有保证的收敛性和更好的聚类质量的交互式差异性私有聚类算法的最先进,以满足相同的DP要求。

Iterative clustering algorithms help us to learn the insights behind the data. Unfortunately, this may allow adversaries to infer the privacy of individuals with some background knowledge. In the worst case, the adversaries know the centroids of an arbitrary iteration and the information of n-1 out of n items. To protect individual privacy against such an inference attack, preserving differential privacy (DP) for the iterative clustering algorithms has been extensively studied in the interactive settings. However, existing interactive differentially private clustering algorithms suffer from a non-convergence problem, i.e., these algorithms may not terminate without a predefined number of iterations. This problem severely impacts the clustering quality and the efficiency of a differentially private algorithm. To resolve this problem, in this paper, we propose a novel differentially private clustering framework in the interactive settings which controls the orientation of the movement of the centroids over the iterations to ensure the convergence by injecting DP noise in a selected area. We prove that, in the expected case, algorithm under our framework converges in at most twice the iterations of Lloyd's algorithm. We perform experimental evaluations on real-world datasets to show that our algorithm outperforms the state-of-the-art of the interactive differentially private clustering algorithms with guaranteed convergence and better clustering quality to meet the same DP requirement.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源