论文标题

广义嵌套推出策略改编

Generalized Nested Rollout Policy Adaptation

论文作者

Cazenave, Tristan

论文摘要

嵌套推出策略适应(NRPA)是单播放器游戏的蒙特卡洛搜索算法。在本文中,我们建议以温度和偏见概括NRPA,并从理论上分析算法。广义算法称为GNRPA。实验表明,对于不同的应用程序域的NRPA,它在NRPA上有所改善:SameGame和Time Windows的旅行推销员问题。

Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games. In this paper we propose to generalize NRPA with a temperature and a bias and to analyze theoretically the algorithms. The generalized algorithm is named GNRPA. Experiments show it improves on NRPA for different application domains: SameGame and the Traveling Salesman Problem with Time Windows.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源