论文标题
朝正确方向的一个点:空间意识的自我观察的体积表示学习的矢量预测
A Point in the Right Direction: Vector Prediction for Spatially-aware Self-supervised Volumetric Representation Learning
论文作者
论文摘要
高注释成本和密集的3D医学成像任务的标签有限,最近激发了各种3D自我监管的预审计方法,以改善转移学习绩效。但是,尽管这些方法核心在实现有效的3D图像分析方面,这些方法通常缺乏空间意识。更具体地说,位置,规模和方向不仅有用,而且在生成图像作物进行培训时自动可用。然而,迄今为止,尚无任何工作提出借口任务来提炼所有关键的空间特征。为了满足这一需求,我们开发了一种新的自我监督方法矢量,该方法通过两个新颖的借口任务来促进更好的空间理解:向量预测(VP)和以边界为中心的重建(BFR)。副总裁侧重于全球空间概念(即3D贴片的属性),而BFR则解决了最近重建方法的弱点,以学习更有效的局部表示。我们在三个3D医疗图像分割任务上评估了矢量,这通常表明它的表现通常超过了最先进的方法,尤其是在有限的注释设置中。
High annotation costs and limited labels for dense 3D medical imaging tasks have recently motivated an assortment of 3D self-supervised pretraining methods that improve transfer learning performance. However, these methods commonly lack spatial awareness despite its centrality in enabling effective 3D image analysis. More specifically, position, scale, and orientation are not only informative but also automatically available when generating image crops for training. Yet, to date, no work has proposed a pretext task that distills all key spatial features. To fulfill this need, we develop a new self-supervised method, VectorPOSE, which promotes better spatial understanding with two novel pretext tasks: Vector Prediction (VP) and Boundary-Focused Reconstruction (BFR). VP focuses on global spatial concepts (i.e., properties of 3D patches) while BFR addresses weaknesses of recent reconstruction methods to learn more effective local representations. We evaluate VectorPOSE on three 3D medical image segmentation tasks, showing that it often outperforms state-of-the-art methods, especially in limited annotation settings.