论文标题
自动驾驶的计算机立体声愿景
Computer Stereo Vision for Autonomous Driving
论文作者
论文摘要
作为自主系统的重要组成部分,自动驾驶汽车感知取得了很大的飞跃,并且在并行计算体系结构方面取得了最新的进步。通过使用微型但全功能嵌入式超级计算机,计算机立体声视觉已普遍应用于自动驾驶汽车中,以进行深度感知。计算机立体声视觉的两个关键方面是速度和准确性。它们都是理想的,但属性矛盾,因为具有更好差异精度的算法通常具有更高的计算复杂性。因此,为资源有限的硬件开发计算机立体声视觉算法的主要目的是改善速度和准确性之间的权衡。在本章中,我们介绍了自动驾驶汽车系统计算机立体声愿景的硬件和软件方面。然后,我们讨论四个自主汽车感知任务,包括1)视觉特征检测,描述和匹配,2)3D信息采集,3)对象检测/识别和4)语义图像分割。然后详细介绍了多线程CPU和GPU体系结构的计算机立体视觉和并行计算的原理。
As an important component of autonomous systems, autonomous car perception has had a big leap with recent advances in parallel computing architectures. With the use of tiny but full-feature embedded supercomputers, computer stereo vision has been prevalently applied in autonomous cars for depth perception. The two key aspects of computer stereo vision are speed and accuracy. They are both desirable but conflicting properties, as the algorithms with better disparity accuracy usually have higher computational complexity. Therefore, the main aim of developing a computer stereo vision algorithm for resource-limited hardware is to improve the trade-off between speed and accuracy. In this chapter, we introduce both the hardware and software aspects of computer stereo vision for autonomous car systems. Then, we discuss four autonomous car perception tasks, including 1) visual feature detection, description and matching, 2) 3D information acquisition, 3) object detection/recognition and 4) semantic image segmentation. The principles of computer stereo vision and parallel computing on multi-threading CPU and GPU architectures are then detailed.