双边依赖性优化：防御模型中的攻击

论文标题

双边依赖性优化：防御模型中的攻击

Bilateral Dependency Optimization: Defending Against Model-inversion Attacks

论文作者

Peng, Xiong, Liu, Feng, Zhang, Jingfen, Lan, Long, Ye, Junjie, Liu, Tongliang, Han, Bo

论文摘要

通过仅使用训练有素的分类器，模型插入（MI）攻击可以恢复用于训练分类器的数据，从而导致培训数据的隐私泄漏。为了防止MI攻击，先前的工作利用单方面依赖优化策略，即，在训练分类器期间，最大程度地减少了输入（即功能）和输出（即标签）之间的依赖关系。但是，这样的最小化过程与最小化监督损失相冲突，旨在最大程度地提高输入和输出之间的依赖关系，从而在模型鲁棒性针对MI攻击与分类任务上的模型效用之间进行明确的权衡。在本文中，我们旨在最大程度地减少潜在表示和输入之间的依赖关系，同时最大化潜在表示和输出之间的依赖关系，称为双边依赖性优化（BIDO）策略。特别是，除了对深神经网络的常用损失（例如，跨渗透性）外，我们还将依赖性约束用作普遍适用的正常化程序，可以根据不同的任务将其实例化使用适当的依赖标准。为了验证我们策略的功效，我们通过使用两种不同的依赖性度量提出了两种BIDO的实施：具有约束协方差的Bido（Bido-Coco）（Bido-Coco）和Bido具有Hilbert-Schmidt Indepentence Criterion（Bido-Hsic）。实验表明，Bido与训练有素的分类器相比，在各种数据集，分类器和MI攻击方面取得了最新的防御性能，同时遭受了较小的分类 - 精度下降，而没有防御能力，这点了一条新颖的道路，可以防御MI攻击。

Through using only a well-trained classifier, model-inversion (MI) attacks can recover the data used for training the classifier, leading to the privacy leakage of the training data. To defend against MI attacks, previous work utilizes a unilateral dependency optimization strategy, i.e., minimizing the dependency between inputs (i.e., features) and outputs (i.e., labels) during training the classifier. However, such a minimization process conflicts with minimizing the supervised loss that aims to maximize the dependency between inputs and outputs, causing an explicit trade-off between model robustness against MI attacks and model utility on classification tasks. In this paper, we aim to minimize the dependency between the latent representations and the inputs while maximizing the dependency between latent representations and the outputs, named a bilateral dependency optimization (BiDO) strategy. In particular, we use the dependency constraints as a universally applicable regularizer in addition to commonly used losses for deep neural networks (e.g., cross-entropy), which can be instantiated with appropriate dependency criteria according to different tasks. To verify the efficacy of our strategy, we propose two implementations of BiDO, by using two different dependency measures: BiDO with constrained covariance (BiDO-COCO) and BiDO with Hilbert-Schmidt Independence Criterion (BiDO-HSIC). Experiments show that BiDO achieves the state-of-the-art defense performance for a variety of datasets, classifiers, and MI attacks while suffering a minor classification-accuracy drop compared to the well-trained classifier with no defense, which lights up a novel road to defend against MI attacks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题