DMCA：使用注意力和沟通的密集多代理导航

论文标题

DMCA：使用注意力和沟通的密集多代理导航

DMCA: Dense Multi-agent Navigation using Attention and Communication

论文作者

Arul, Senthil Hariharan, Bedi, Amrit Singh, Manocha, Dinesh

论文摘要

在分散的多机器人导航中，确保有限的环境意识的安全有效运动仍然是一个挑战。尽管机器人传统上根据本地观察结果进行导航，但这种方法在复杂的环境中摇摆不定。一个可能的解决方案是通过主体间的沟通来增强对世界的理解，但是仅信息广播的效率不足。在这项工作中，我们通过同时学习分散的多机器人碰撞避免和选择性的代理间交流来解决这个问题。我们使用多头自我发场机制，该机制将相邻机器人的可观察信息编码为简洁而固定的观察载体，从而处理不同数量的邻居。我们的方法着重于通过选择性通信改善导航性能。我们将通信选择作为链接预测问题，在该问题中，网络决定基于可观察的状态信息与特定邻居建立通信链接的必要性。传达的信息增强了邻居的观察，并有助于选择适当的导航计划。通过训练网络端到端，我们同时学习观察编码，通信选择和导航组件的最佳权重。我们通过在密集和充满挑战的环境中实现安全有效的导航来展示我们的方法的好处。对各种基于学习的基线的比较评估证明了我们的出色导航性能，从而在复杂的评估方案中取得了令人印象深刻的成功率。

In decentralized multi-robot navigation, ensuring safe and efficient movement with limited environmental awareness remains a challenge. While robots traditionally navigate based on local observations, this approach falters in complex environments. A possible solution is to enhance understanding of the world through inter-agent communication, but mere information broadcasting falls short in efficiency. In this work, we address this problem by simultaneously learning decentralized multi-robot collision avoidance and selective inter-agent communication. We use a multi-head self-attention mechanism that encodes observable information from neighboring robots into a concise and fixed-length observation vector, thereby handling varying numbers of neighbors. Our method focuses on improving navigation performance through selective communication. We cast the communication selection as a link prediction problem, where the network determines the necessity of establishing a communication link with a specific neighbor based on the observable state information. The communicated information enhances the neighbor's observation and aids in selecting an appropriate navigation plan. By training the network end-to-end, we concurrently learn the optimal weights for the observation encoder, communication selection, and navigation components. We showcase the benefits of our approach by achieving safe and efficient navigation among multiple robots, even in dense and challenging environments. Comparative evaluations against various learning-based and model-based baselines demonstrate our superior navigation performance, resulting in an impressive improvement of up to 24% in success rate within complex evaluation scenarios.

下载PDF全文

下载文献需遵守相关版权规定

论文标题