事后见解为20/20：利用过去的遍历来帮助3D感知

论文标题

事后见解为20/20：利用过去的遍历来帮助3D感知

Hindsight is 20/20: Leveraging Past Traversals to Aid 3D Perception

论文作者

You, Yurong, Luo, Katie Z, Chen, Xiangyu, Chen, Junan, Chao, Wei-Lun, Sun, Wen, Hariharan, Bharath, Campbell, Mark, Weinberger, Kilian Q.

论文摘要

自动驾驶汽车必须准确地检测汽车，行人和其他交通参与者才能安全运行。小型，遥远或高度遮挡的物体特别具有挑战性，因为LiDar Point云中的信息有限。为了应对这一挑战，我们利用过去的有价值的信息：尤其是在同一场景的过去遍历中收集的数据。我们认为，这些过去通常被丢弃的过去数据提供了丰富的上下文信息，以消除上述具有挑战性的案例。为此，我们提出了一个新颖的端到端可训练的事后视觉框架，以从过去的遍历中提取这些上下文信息，并将其存储在易于播种的数据结构中，然后可以利用该结构来帮助同一场景的未来3D对象检测。我们表明，该框架与大多数现代3D检测体系结构兼容，并且可以显着提高其在多个自动驾驶数据集上的平均精度，而在具有挑战性的情况下最著名的是超过300％。

Self-driving cars must detect vehicles, pedestrians, and other traffic participants accurately to operate safely. Small, far-away, or highly occluded objects are particularly challenging because there is limited information in the LiDAR point clouds for detecting them. To address this challenge, we leverage valuable information from the past: in particular, data collected in past traversals of the same scene. We posit that these past data, which are typically discarded, provide rich contextual information for disambiguating the above-mentioned challenging cases. To this end, we propose a novel, end-to-end trainable Hindsight framework to extract this contextual information from past traversals and store it in an easy-to-query data structure, which can then be leveraged to aid future 3D object detection of the same scene. We show that this framework is compatible with most modern 3D detection architectures and can substantially improve their average precision on multiple autonomous driving datasets, most notably by more than 300% on the challenging cases.

下载PDF全文

下载文献需遵守相关版权规定

论文标题