有效的RGB-D语义分段用于室内场景分析

论文标题

有效的RGB-D语义分段用于室内场景分析

Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis

论文作者

Seichter, Daniel, Köhler, Mona, Lewandowski, Benjamin, Wengefeld, Tim, Gross, Horst-Michael

论文摘要

彻底分析场景对于在不同环境中起作用的移动机器人至关重要。语义细分可以增强随后的各种任务，例如（语义辅助）人的感知，（语义）自由空间检测，（语义）映射和（语义）导航。在本文中，我们提出了一种有效且可靠的RGB-D分割方法，该方法可以使用NVIDIA Tensorrt高度优化，因此非常适合作为复杂系统的常见初始处理步骤，以在移动机器人上进行场景分析。我们表明，RGB-D分割优于处理RGB图像，并且如果仔细设计网络体系结构，它仍然可以实时执行。我们在常见的室内数据集NYUV2和SUNRGB-D上评估了我们提出的有效场景分析网络（ESANET），并表明我们达到了最先进的性能，同时可以更快地推断推断。此外，我们对户外数据集CityScapes的评估表明，我们的方法也适用于其他应用领域。最后，我们不仅在室内应用方案之一中展示了定性结果。

Analyzing scenes thoroughly is crucial for mobile robots acting in different environments. Semantic segmentation can enhance various subsequent tasks, such as (semantically assisted) person perception, (semantic) free space detection, (semantic) mapping, and (semantic) navigation. In this paper, we propose an efficient and robust RGB-D segmentation approach that can be optimized to a high degree using NVIDIA TensorRT and, thus, is well suited as a common initial processing step in a complex system for scene analysis on mobile robots. We show that RGB-D segmentation is superior to processing RGB images solely and that it can still be performed in real time if the network architecture is carefully designed. We evaluate our proposed Efficient Scene Analysis Network (ESANet) on the common indoor datasets NYUv2 and SUNRGB-D and show that we reach state-of-the-art performance while enabling faster inference. Furthermore, our evaluation on the outdoor dataset Cityscapes shows that our approach is suitable for other areas of application as well. Finally, instead of presenting benchmark results only, we also show qualitative results in one of our indoor application scenarios.

下载PDF全文

下载文献需遵守相关版权规定

论文标题