论文标题

城市数据集扩展的人工假人

Artificial Dummies for Urban Dataset Augmentation

论文作者

Vobecký, Antonín, Hurych, David, Uřičář, Michal, Pérez, Patrick, Šivic, Josef

论文摘要

现有用于训练图像中的行人探测器的数据集遭受外观有限和姿势变化的影响。最具挑战性的情况很少被包括在内,因为由于安全原因,它们太难捕获了,或者它们不太可能发生。在这种罕见情况下,辅助和自动驾驶应用程序的严格安全要求也要求额外的高检测准确性。具有任意外观并嵌入具有不同照明和天气条件的不同背景场景中的人形成人物图像的能力是开发和测试此类应用程序的关键组成部分。本文的贡献是三倍。首先,我们描述了一种增强方法,用于控制包含人的城市场景,从而产生罕见或从未见过的情况。这是通过数据生成器(称为dummynet)具有对姿势,外观和目标背景场景的控制的。其次,拟议的发电机依赖于新颖的网络体系结构和相关的损失,这些损失考虑到了前景人员的细分及其成分,以构成对背景场景。最后,我们证明了我们的dummynet生成的数据改善了各个数据集的几个现有人检测器的性能以及在艰苦的情况下(例如夜间条件),在夜间条件下,只有有限的培训数据可用。在仅提供日常数据的设置中,我们将夜间检测器提高了$ 17 \%$ $ log-a-Log-a-aim-a-aim-a aims率,而仅使用日常数据培训的检测器。

Existing datasets for training pedestrian detectors in images suffer from limited appearance and pose variation. The most challenging scenarios are rarely included because they are too difficult to capture due to safety reasons, or they are very unlikely to happen. The strict safety requirements in assisted and autonomous driving applications call for an extra high detection accuracy also in these rare situations. Having the ability to generate people images in arbitrary poses, with arbitrary appearances and embedded in different background scenes with varying illumination and weather conditions, is a crucial component for the development and testing of such applications. The contributions of this paper are three-fold. First, we describe an augmentation method for controlled synthesis of urban scenes containing people, thus producing rare or never-seen situations. This is achieved with a data generator (called DummyNet) with disentangled control of the pose, the appearance, and the target background scene. Second, the proposed generator relies on novel network architecture and associated loss that takes into account the segmentation of the foreground person and its composition into the background scene. Finally, we demonstrate that the data generated by our DummyNet improve performance of several existing person detectors across various datasets as well as in challenging situations, such as night-time conditions, where only a limited amount of training data is available. In the setup with only day-time data available, we improve the night-time detector by $17\%$ log-average miss rate over the detector trained with the day-time data only.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源