论文标题
ITKD:3D对象检测的基于互换转移的知识蒸馏
itKD: Interchange Transfer-based Knowledge Distillation for 3D Object Detection
论文作者
论文摘要
基于点云的3D对象检测器最近取得了显着的进展。但是,大多数研究仅限于在不考虑计算效率的情况下仅提高其准确性的网络体系结构的发展。在本文中,我们首先提出了一个自动编码器式框架,其中包括基于互换转移的知识蒸馏,包括渠道压缩和减压。为了了解教师网络的地图视图功能,教师和学生网络的功能独立传递了共享的自动编码器;在这里,我们使用压缩表示损失,将学生和教师网络的渠道明智的压缩知识绑定为一种正则化。解压缩的特征以相反的方向传输,以减少交换重建中的间隙。最后,我们提出了头部注意力丧失,以匹配由多头自我发项机制绘制的3D对象检测信息。通过广泛的实验,我们验证了我们的方法可以训练与3D点云检测任务相结合的轻量级模型,并使用众所周知的公共数据集证明了它的优势。例如,Waymo和Nuscenes。
Point-cloud based 3D object detectors recently have achieved remarkable progress. However, most studies are limited to the development of network architectures for improving only their accuracy without consideration of the computational efficiency. In this paper, we first propose an autoencoder-style framework comprising channel-wise compression and decompression via interchange transfer-based knowledge distillation. To learn the map-view feature of a teacher network, the features from teacher and student networks are independently passed through the shared autoencoder; here, we use a compressed representation loss that binds the channel-wised compression knowledge from both student and teacher networks as a kind of regularization. The decompressed features are transferred in opposite directions to reduce the gap in the interchange reconstructions. Lastly, we present an head attention loss to match the 3D object detection information drawn by the multi-head self-attention mechanism. Through extensive experiments, we verify that our method can train the lightweight model that is well-aligned with the 3D point cloud detection task and we demonstrate its superiority using the well-known public datasets; e.g., Waymo and nuScenes.