使用综合和真实数据的全景进行分割

论文标题

使用综合和真实数据的全景进行分割

Panoptic Segmentation using Synthetic and Real Data

论文作者

Quattrocchi, Camillo, Di Mauro, Daniele, Furnari, Antonino, Farinella, Giovanni Maria

论文摘要

能够理解用户和周围环境之间的关系对帮助用户进行工作场所至关重要。例如，了解用户正在通过可穿戴设备收集的图像和视频进行交互的对象，对于告诉工人使用特定对象以提高生产率并防止事故的情况很有用。尽管现代视觉系统可以依靠先进的算法来检测对象检测，语义和全景分段，但这些方法仍然需要大量特定于领域的标记数据，这在工业场景中可能很难获得。在这种观察过程中，我们提出了一条管道，该管道允许从真实环境和真实对象的3D模型中生成合成图像。生成的图像会自动标记，因此毫不费力地获得。利用所提出的管道，我们生成一个包含合成图像的数据集，该数据集自动标记了用于全景分段的数据集。该集合与少数手动标记的真实图像进行微调相辅相成。实验表明，合成图像的使用可以大大减少获得合理的全景分割性能所需的真实图像数量。

Being able to understand the relations between the user and the surrounding environment is instrumental to assist users in a worksite. For instance, understanding which objects a user is interacting with from images and video collected through a wearable device can be useful to inform the worker on the usage of specific objects in order to improve productivity and prevent accidents. Despite modern vision systems can rely on advanced algorithms for object detection, semantic and panoptic segmentation, these methods still require large quantities of domain-specific labeled data, which can be difficult to obtain in industrial scenarios. Motivated by this observation, we propose a pipeline which allows to generate synthetic images from 3D models of real environments and real objects. The generated images are automatically labeled and hence effortless to obtain. Exploiting the proposed pipeline, we generate a dataset comprising synthetic images automatically labeled for panoptic segmentation. This set is complemented by a small number of manually labeled real images for fine-tuning. Experiments show that the use of synthetic images allows to drastically reduce the number of real images needed to obtain reasonable panoptic segmentation performance.

下载PDF全文

下载文献需遵守相关版权规定

论文标题