论文标题

以观察为中心:重新思考用于鲁棒的多对象跟踪

Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking

论文作者

Cao, Jinkun, Pang, Jiangmiao, Weng, Xinshuo, Khirodkar, Rawal, Kitani, Kris

论文摘要

基于Kalman滤波器(KF)多对象跟踪(MOT)的方法是对象线性移动的假设。尽管该假设在非常短的闭塞时间内是可以接受的,但是长时间的运动线性估计值可能是高度不准确的。此外,当没有可用于更新Kalman滤波器参数的测量值时,标准惯例是信任后验更新的先验状态估计。这导致在闭塞期间积累错误。误差会导致实践中的显着运动方向差异。在这项工作中,我们表明,如果采取适当的护理来固定闭塞过程中累积的噪声,那么基本的卡尔曼过滤器仍然可以获得最新的跟踪性能。我们不仅依靠线性状态估计(即以估计为中心的方法),我们使用对象观测值(即,对象检测器的测量值)在遮挡周期内计算虚拟轨迹来固定在咬合期间滤波器参数的误差积累。这允许更多的时间步骤纠正在阻塞过程中累积的错误。我们命名我们的方法观察为中心(OC-SORT)。它仍然保持简单,在线和实时,但在阻塞和非线性运动过程中提高了鲁棒性。给定现场检测作为输入,OC-SORT在单个CPU上以700+ fps的速度运行。它在多个数据集上实现了最新的,包括Mot17,Mot20,Kitti,Head Tracking,尤其是Dancetrack,其中对象运动是高度非线性的。代码和型号可在\ url {https://github.com/noahcao/oc_sort}上找到。

Kalman filter (KF) based methods for multi-object tracking (MOT) make an assumption that objects move linearly. While this assumption is acceptable for very short periods of occlusion, linear estimates of motion for prolonged time can be highly inaccurate. Moreover, when there is no measurement available to update Kalman filter parameters, the standard convention is to trust the priori state estimations for posteriori update. This leads to the accumulation of errors during a period of occlusion. The error causes significant motion direction variance in practice. In this work, we show that a basic Kalman filter can still obtain state-of-the-art tracking performance if proper care is taken to fix the noise accumulated during occlusion. Instead of relying only on the linear state estimate (i.e., estimation-centric approach), we use object observations (i.e., the measurements by object detector) to compute a virtual trajectory over the occlusion period to fix the error accumulation of filter parameters during the occlusion period. This allows more time steps to correct errors accumulated during occlusion. We name our method Observation-Centric SORT (OC-SORT). It remains Simple, Online, and Real-Time but improves robustness during occlusion and non-linear motion. Given off-the-shelf detections as input, OC-SORT runs at 700+ FPS on a single CPU. It achieves state-of-the-art on multiple datasets, including MOT17, MOT20, KITTI, head tracking, and especially DanceTrack where the object motion is highly non-linear. The code and models are available at \url{https://github.com/noahcao/OC_SORT}.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源