论文标题

KTN:学习多人2d-3d通信的知识转移网络

KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences

论文作者

Wang, Xuanhan, Gao, Lianli, Zhou, Yixuan, Song, Jingkuan, Wang, Meng

论文摘要

人类茂密的估计旨在建立人体2D像素与3D人体模板之间的密集对应关系,是使机器能够了解图像中人员的关键技术。由于实际场景很复杂,只有部分注释可用,导致无能为力或错误的估计,它仍然构成了几个挑战。在这项工作中,我们提出了一个新颖的框架,以检测图像中多人的密集。我们指的是知识转移网络(KTN)的拟议方法解决了两个主要问题:1)如何完善图像表示以减轻不完整的估计,以及2)如何减少由低质量培训标签引起的错误估计(即有限的注释和班级失衡标签)。与现有的作品直接传播区域的金字塔特征以进行致密估计,KTN使用了金字塔表示的改进,在此中,它同时保持特征分辨率并抑制背景像素,并且这种策略会大大增加准确性。此外,KTN通过外部知识提高了基于3D的身体解析的能力,在该知识中,它通过结构性的身体知识图,从足够的注释作为基于3D的身体解析器进行了训练。通过这种方式,它大大减少了由低质量注释引起的不利影响。 KTN的有效性是通过其优越的性能与致密coco数据集的最先进方法所证明的。关于代表性任务(例如,人体分割,人体部分分割和关键点检测)和两个流行的致密估计管道(即RCNN和全面趋化框架)的广泛消融研究和实验结果,进一步表明了提议方法的概括性。

Human densepose estimation, aiming at establishing dense correspondences between 2D pixels of human body and 3D human body template, is a key technique in enabling machines to have an understanding of people in images. It still poses several challenges due to practical scenarios where real-world scenes are complex and only partial annotations are available, leading to incompelete or false estimations. In this work, we present a novel framework to detect the densepose of multiple people in an image. The proposed method, which we refer to Knowledge Transfer Network (KTN), tackles two main problems: 1) how to refine image representation for alleviating incomplete estimations, and 2) how to reduce false estimation caused by the low-quality training labels (i.e., limited annotations and class-imbalance labels). Unlike existing works directly propagating the pyramidal features of regions for densepose estimation, the KTN uses a refinement of pyramidal representation, where it simultaneously maintains feature resolution and suppresses background pixels, and this strategy results in a substantial increase in accuracy. Moreover, the KTN enhances the ability of 3D based body parsing with external knowledges, where it casts 2D based body parsers trained from sufficient annotations as a 3D based body parser through a structural body knowledge graph. In this way, it significantly reduces the adverse effects caused by the low-quality annotations. The effectiveness of KTN is demonstrated by its superior performance to the state-of-the-art methods on DensePose-COCO dataset. Extensive ablation studies and experimental results on representative tasks (e.g., human body segmentation, human part segmentation and keypoints detection) and two popular densepose estimation pipelines (i.e., RCNN and fully-convolutional frameworks), further indicate the generalizability of the proposed method.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源