论文标题

Wayformer:通过简单有效的注意网络进行运动预测

Wayformer: Motion Forecasting via Simple & Efficient Attention Networks

论文作者

Nayakanti, Nigamaa, Al-Rfou, Rami, Zhou, Aurick, Goel, Kratarth, Refaat, Khaled S., Sapp, Benjamin

论文摘要

自动驾驶的运动预测是一项艰巨的任务,因为复杂的驾驶场景导致静态和动态输入的异质组合。这是一个开放的问题,如何最好地代表和融合有关道路几何,车道连接,交通信号灯状态以及动态代理集的历史及其相互作用的历史。为了模拟这一不同的输入功能集,许多提出的方法旨在设计具有多种模态模块的同样复杂系统。这会导致难以以严格的方式进行扩展,扩展或调整的系统来取消质量和效率。在本文中,我们介绍了Wayformer,这是一个基于注意力的运动架构,用于运动预测,简单而均匀。 Wayformer提供了一个紧凑的模型描述,该描述由基于注意力的场景编码和解码器组成。在场景编码器中,我们研究了输入方式的早期,晚和等级融合的选择。对于每种融合类型,我们通过分解的注意力或潜在的查询关注来探索策略来折衷效率和质量。我们表明,尽管早期融合的构建简单,但不仅是情感不可知论,而且在Waymo Open MotionDataSet(WOMD)和Argoverse排行榜上都取得了最新的结果,证明了我们设计理念的有效性

Motion forecasting for autonomous driving is a challenging task because complex driving scenarios result in a heterogeneous mix of static and dynamic inputs. It is an open problem how best to represent and fuse information about road geometry, lane connectivity, time-varying traffic light state, and history of a dynamic set of agents and their interactions into an effective encoding. To model this diverse set of input features, many approaches proposed to design an equally complex system with a diverse set of modality specific modules. This results in systems that are difficult to scale, extend, or tune in rigorous ways to trade off quality and efficiency. In this paper, we present Wayformer, a family of attention based architectures for motion forecasting that are simple and homogeneous. Wayformer offers a compact model description consisting of an attention based scene encoder and a decoder. In the scene encoder we study the choice of early, late and hierarchical fusion of the input modalities. For each fusion type we explore strategies to tradeoff efficiency and quality via factorized attention or latent query attention. We show that early fusion, despite its simplicity of construction, is not only modality agnostic but also achieves state-of-the-art results on both Waymo Open MotionDataset (WOMD) and Argoverse leaderboards, demonstrating the effectiveness of our design philosophy

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源