在合作任务中基准测试多代理深入学习算法

论文标题

在合作任务中基准测试多代理深入学习算法

Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks

论文作者

Papoudakis, Georgios, Christianos, Filippos, Schäfer, Lukas, Albrecht, Stefano V.

论文摘要

多代理深入强化学习（MARL）遭受缺乏常用的评估任务和标准，从而使方法之间的比较变得困难。在这项工作中，我们在各种合作的多代理学习任务中提供了三种不同类别的MARL算法（独立学习，集中的多代理政策梯度，价值分解）的系统评估和比较。我们的实验是跨不同学习任务的算法预期性能的参考，我们提供了有关不同学习方法有效性的见解。我们开放源ePymarl，它扩展了Pymarl代码库以包含其他算法，并允许灵活地配置算法实现详细信息，例如参数共享。最后，我们开放了两个用于多机构研究的环境，这些环境侧重于稀疏奖励下的协调。

Multi-agent deep reinforcement learning (MARL) suffers from a lack of commonly-used evaluation tasks and criteria, making comparisons between approaches difficult. In this work, we provide a systematic evaluation and comparison of three different classes of MARL algorithms (independent learning, centralised multi-agent policy gradient, value decomposition) in a diverse range of cooperative multi-agent learning tasks. Our experiments serve as a reference for the expected performance of algorithms across different learning tasks, and we provide insights regarding the effectiveness of different learning approaches. We open-source EPyMARL, which extends the PyMARL codebase to include additional algorithms and allow for flexible configuration of algorithm implementation details such as parameter sharing. Finally, we open-source two environments for multi-agent research which focus on coordination under sparse rewards.

下载PDF全文

下载文献需遵守相关版权规定

论文标题