通过深入的强化学习玩完整的MOBA游戏

论文标题

通过深入的强化学习玩完整的MOBA游戏

Towards Playing Full MOBA Games with Deep Reinforcement Learning

论文作者

Ye, Deheng, Chen, Guibin, Zhang, Wen, Chen, Sheng, Yuan, Bo, Liu, Bo, Chen, Jia, Liu, Zhao, Qiu, Fuhao, Yu, Hongsheng, Yin, Yinyuting, Shi, Bei, Wang, Liang, Shi, Tengfei, Fu, Qiang, Yang, Wei, Huang, Lanxiao, Liu, Wei

论文摘要

MOBA Games，例如国王荣誉，英雄联盟和Dota 2，对AI系统（例如多代理，巨大的国家行动空间，复杂的行动控制等）构成了巨大的挑战。开发AI用于玩MOBA Games的AI引起了很多关注。但是，现有工作在处理代理组合（即阵容的爆炸式爆炸）上，在扩展英雄池时，如果Openai的Dota AI将游戏限制在只有17位英雄的池中时，就无法处理阵容的原始游戏复杂性。结果，无限制的完整MOBA游戏远非被任何现有的AI系统掌握。在本文中，我们提出了一个MOBA AI学习范式，该范式从方法论上可以通过深入的强化学习来玩完整的MOBA游戏。具体而言，我们开发了新颖和现有的学习技术的结合，包括课程自我播放学习，政策蒸馏，非政策改编，多头价值估计以及蒙特 - 卡洛树搜索，培训和扮演大量英雄，并熟练地处理可伸缩性问题。经受了一款受欢迎的MOBA游戏King的荣誉测试，我们展示了如何构建可以击败顶级电子竞技运动员的超人AI特工。文献中首次对MOBA AI代理的大规模性能测试证明了我们AI的优势。

MOBA games, e.g., Honor of Kings, League of Legends, and Dota 2, pose grand challenges to AI systems such as multi-agent, enormous state-action space, complex action control, etc. Developing AI for playing MOBA games has raised much attention accordingly. However, existing work falls short in handling the raw game complexity caused by the explosion of agent combinations, i.e., lineups, when expanding the hero pool in case that OpenAI's Dota AI limits the play to a pool of only 17 heroes. As a result, full MOBA games without restrictions are far from being mastered by any existing AI system. In this paper, we propose a MOBA AI learning paradigm that methodologically enables playing full MOBA games with deep reinforcement learning. Specifically, we develop a combination of novel and existing learning techniques, including curriculum self-play learning, policy distillation, off-policy adaption, multi-head value estimation, and Monte-Carlo tree-search, in training and playing a large pool of heroes, meanwhile addressing the scalability issue skillfully. Tested on Honor of Kings, a popular MOBA game, we show how to build superhuman AI agents that can defeat top esports players. The superiority of our AI is demonstrated by the first large-scale performance test of MOBA AI agent in the literature.

下载PDF全文

下载文献需遵守相关版权规定

论文标题