论文标题

在合作任务中基准测试多代理深入学习算法

Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks

论文作者

Papoudakis, Georgios, Christianos, Filippos, Schäfer, Lukas, Albrecht, Stefano V.

论文摘要

多代理深入强化学习(MARL)遭受缺乏常用的评估任务和标准,从而使方法之间的比较变得困难。在这项工作中,我们在各种合作的多代理学习任务中提供了三种不同类别的MARL算法(独立学习,集中的多代理政策梯度,价值分解)的系统评估和比较。我们的实验是跨不同学习任务的算法预期性能的参考,我们提供了有关不同学习方法有效性的见解。我们开放源ePymarl,它扩展了Pymarl代码库以包含其他算法,并允许灵活地配置算法实现详细信息,例如参数共享。最后,我们开放了两个用于多机构研究的环境,这些环境侧重于稀疏奖励下的协调。

Multi-agent deep reinforcement learning (MARL) suffers from a lack of commonly-used evaluation tasks and criteria, making comparisons between approaches difficult. In this work, we provide a systematic evaluation and comparison of three different classes of MARL algorithms (independent learning, centralised multi-agent policy gradient, value decomposition) in a diverse range of cooperative multi-agent learning tasks. Our experiments serve as a reference for the expected performance of algorithms across different learning tasks, and we provide insights regarding the effectiveness of different learning approaches. We open-source EPyMARL, which extends the PyMARL codebase to include additional algorithms and allow for flexible configuration of algorithm implementation details such as parameter sharing. Finally, we open-source two environments for multi-agent research which focus on coordination under sparse rewards.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源