点：3D对象跟踪的时空建模

论文标题

点：3D对象跟踪的时空建模

SpOT: Spatiotemporal Modeling for 3D Object Tracking

论文作者

Stearns, Colton, Rempe, Davis, Li, Jie, Ambrus, Rares, Zakharov, Sergey, Guizilini, Vitor, Yang, Yanchao, Guibas, Leonidas J

论文摘要

3D多目标跟踪旨在随着时间的推移唯一，始终如一地识别所有移动实体。尽管在此设置中提供了丰富的时空信息，但当前的3D跟踪方法主要依赖于抽象的信息和有限的历史记录，例如单帧对象边界框。在这项工作中，我们开发了交通场景的整体表示，该场景利用了现场演员的空间和时间信息。具体而言，我们通过将跟踪对象表示为时空点和边界框的序列来重新将跟踪作为时空问题，并在悠久的时间历史上重新制定。在每个时间戳，我们通过对对象历史记录的完整顺序进行的细化来改善跟踪对象的位置和运动估计。通过共同考虑时间和空间，我们的代表自然地编码了基本的物理先验，例如对象持久性和整个时间的一致性。我们的时空跟踪框架在Waymo和Nuscenes基准测试中实现了最先进的性能。

3D multi-object tracking aims to uniquely and consistently identify all mobile entities through time. Despite the rich spatiotemporal information available in this setting, current 3D tracking methods primarily rely on abstracted information and limited history, e.g. single-frame object bounding boxes. In this work, we develop a holistic representation of traffic scenes that leverages both spatial and temporal information of the actors in the scene. Specifically, we reformulate tracking as a spatiotemporal problem by representing tracked objects as sequences of time-stamped points and bounding boxes over a long temporal history. At each timestamp, we improve the location and motion estimates of our tracked objects through learned refinement over the full sequence of object history. By considering time and space jointly, our representation naturally encodes fundamental physical priors such as object permanence and consistency across time. Our spatiotemporal tracking framework achieves state-of-the-art performance on the Waymo and nuScenes benchmarks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题