与线性梯度开销的强大协作学习

论文标题

与线性梯度开销的强大协作学习

Robust Collaborative Learning with Linear Gradient Overhead

论文作者

Farhadkhani, Sadegh, Guerraoui, Rachid, Gupta, Nirupam, Hoang, Lê Nguyên, Pinot, Rafael, Stephan, John

论文摘要

协作学习算法（例如分布式SGD（或D-SGD））容易出现故障机器，由于软件或硬件错误，中毒数据或恶意行为，可能会偏离其处方算法。尽管已经提出了许多解决方案来增强D-SGD对此类机器的鲁棒性，但先前的工作要么诉诸强大的假设（可信服务的服务器，均匀数据，特定的噪声模型），要么施加梯度计算成本，该梯度计算成本比D-SGD高的几个数量级。我们提出了MONNA，这是一种新算法，在标准假设下证明（a）非常健壮，并且（b）具有梯度计算开销，该开销在有故障机器的分数中是线性的，该计算机的分数是紧密的。本质上，MONNA分别使用Polyak的局部梯度动量进行局部更新和最近的邻平均（NNA）进行全局混合。尽管MONNA相当简单，但其分析却更具挑战性，并依赖于可能具有独立兴趣的两个关键要素。具体而言，我们介绍了$（α，λ）$ - 还原的混合标准，以分析非故障机器的非线性混合，并提出一种控制动量和模型漂移之间张力的方法。我们通过对图像分类的实验来验证我们的理论，并在https://github.com/lpd-epfl/robust-collaborative-learning上提供代码。

Collaborative learning algorithms, such as distributed SGD (or D-SGD), are prone to faulty machines that may deviate from their prescribed algorithm because of software or hardware bugs, poisoned data or malicious behaviors. While many solutions have been proposed to enhance the robustness of D-SGD to such machines, previous works either resort to strong assumptions (trusted server, homogeneous data, specific noise model) or impose a gradient computational cost that is several orders of magnitude higher than that of D-SGD. We present MoNNA, a new algorithm that (a) is provably robust under standard assumptions and (b) has a gradient computation overhead that is linear in the fraction of faulty machines, which is conjectured to be tight. Essentially, MoNNA uses Polyak's momentum of local gradients for local updates and nearest-neighbor averaging (NNA) for global mixing, respectively. While MoNNA is rather simple to implement, its analysis has been more challenging and relies on two key elements that may be of independent interest. Specifically, we introduce the mixing criterion of $(α, λ)$-reduction to analyze the non-linear mixing of non-faulty machines, and present a way to control the tension between the momentum and the model drifts. We validate our theory by experiments on image classification and make our code available at https://github.com/LPD-EPFL/robust-collaborative-learning.

下载PDF全文

下载文献需遵守相关版权规定

论文标题