使用NSGA-II优化联邦学习中的沟通开销

论文标题

使用NSGA-II优化联邦学习中的沟通开销

Optimising Communication Overhead in Federated Learning Using NSGA-II

论文作者

Morell, José Ángel, Dahi, Zakaria Abdelmoiz, Chicano, Francisco, Luque, Gabriel, Alba, Enrique

论文摘要

联合学习是一种培训范式，根据该范式，使用在边缘设备上运行并确保数据隐私的本地模型对基于服务器的模型进行了合作培训。这些设备交换了引起大量通信负载的信息，这危害了功能效率。减少这种间接费用的困难在于实现这一目标而不降低模型的效率（矛盾的关系）。为此，许多作品研究了训练前/中/后培训模型和通信回合的压缩，尽管它们共同促进了通信过载。我们的工作旨在通过（i）将其建模为多目标问题，以及（ii）应用多目标优化算法（NSGA-II）来优化联合学习中的通信开销。据作者所知，这是\ texttt {（i）}探讨进化计算可以为解决此类问题带来的加载项，而\ texttt {（ii）}认为神经元和设备的功能。我们通过模拟使用4个奴隶的服务器/客户端体系结构来执行实验。我们分别研究了卷积和完全连接的神经网络，分别为12和3层，分别为887,530和33,400个重量。我们对包含70,000张图像的\ texttt {mnist}数据集进行了验证。实验表明，我们的建议可以将通信减少99％，并保持与使用100％通信的FedAvg算法获得的准确性相等。

Federated learning is a training paradigm according to which a server-based model is cooperatively trained using local models running on edge devices and ensuring data privacy. These devices exchange information that induces a substantial communication load, which jeopardises the functioning efficiency. The difficulty of reducing this overhead stands in achieving this without decreasing the model's efficiency (contradictory relation). To do so, many works investigated the compression of the pre/mid/post-trained models and the communication rounds, separately, although they jointly contribute to the communication overload. Our work aims at optimising communication overhead in federated learning by (I) modelling it as a multi-objective problem and (II) applying a multi-objective optimization algorithm (NSGA-II) to solve it. To the best of the author's knowledge, this is the first work that \texttt{(I)} explores the add-in that evolutionary computation could bring for solving such a problem, and \texttt{(II)} considers both the neuron and devices features together. We perform the experimentation by simulating a server/client architecture with 4 slaves. We investigate both convolutional and fully-connected neural networks with 12 and 3 layers, 887,530 and 33,400 weights, respectively. We conducted the validation on the \texttt{MNIST} dataset containing 70,000 images. The experiments have shown that our proposal could reduce communication by 99% and maintain an accuracy equal to the one obtained by the FedAvg Algorithm that uses 100% of communications.

下载PDF全文

下载文献需遵守相关版权规定

论文标题