马尔可夫链蒙特卡洛的机器学习：学习需要忘记什么

论文标题

马尔可夫链蒙特卡洛的机器学习：学习需要忘记什么

Markov Chain Monte Carlo-Based Machine Unlearning: Unlearning What Needs to be Forgotten

论文作者

Nguyen, Quoc Phong, Oikawa, Ryutaro, Divakaran, Dinil Mon, Chan, Mun Choon, Low, Bryan Kian Hsiang

论文摘要

随着机器学习（ML）模型在许多实际应用中变得越来越流行，需要解决模型维护需要解决的实际挑战。一个这样的挑战是“撤消”用于训练模型的数据集的特定子集的效果。该特定子集可能包含攻击者注入的恶意或对抗数据，这会影响模型性能。另一个原因可能是服务提供商需要删除与特定用户有关的数据以尊重用户的隐私。在这两种情况下，问题都是从训练有素的模型中“删除”培训数据的特定子集，而不会产生从头开始验证整个模型的昂贵程序。为了实现这一目标，本文介绍了马尔可夫链的基于蒙特卡洛的机器（MCU）算法。 MCU有助于从培训数据集子集中有效，有效地学习训练有素的模型。此外，我们证明，使用MCU，我们能够解释培训数据集子集对模型预测的影响。因此，MCU可用于检查数据子集以识别要删除的对抗数据。同样，MCU可以用来从训练有素的ML模型中删除用户个人数据的血统，从而维护用户的“被遗忘的权利”。我们经验评估了我们提出的MCU算法在现实世界网络钓鱼和糖尿病数据集上的性能。结果表明，MCU可以通过有效删除培训数据集的效果并胜过利用其余数据集的现有算法来实现理想的性能。

As the use of machine learning (ML) models is becoming increasingly popular in many real-world applications, there are practical challenges that need to be addressed for model maintenance. One such challenge is to 'undo' the effect of a specific subset of dataset used for training a model. This specific subset may contain malicious or adversarial data injected by an attacker, which affects the model performance. Another reason may be the need for a service provider to remove data pertaining to a specific user to respect the user's privacy. In both cases, the problem is to 'unlearn' a specific subset of the training data from a trained model without incurring the costly procedure of retraining the whole model from scratch. Towards this goal, this paper presents a Markov chain Monte Carlo-based machine unlearning (MCU) algorithm. MCU helps to effectively and efficiently unlearn a trained model from subsets of training dataset. Furthermore, we show that with MCU, we are able to explain the effect of a subset of a training dataset on the model prediction. Thus, MCU is useful for examining subsets of data to identify the adversarial data to be removed. Similarly, MCU can be used to erase the lineage of a user's personal data from trained ML models, thus upholding a user's "right to be forgotten". We empirically evaluate the performance of our proposed MCU algorithm on real-world phishing and diabetes datasets. Results show that MCU can achieve a desirable performance by efficiently removing the effect of a subset of training dataset and outperform an existing algorithm that utilizes the remaining dataset.

下载PDF全文

下载文献需遵守相关版权规定

论文标题