论文标题

dumlp-pin:用于设定特征提取的双MLP-DOT-DOT-DOT-PROPODUCT-PROPODUCT INVARINT网络

DuMLP-Pin: A Dual-MLP-dot-product Permutation-invariant Network for Set Feature Extraction

论文作者

Fei, Jiajun, Zhu, Ziyu, Liu, Wenlei, Deng, Zhidong, Li, Mingyang, Deng, Huanjun, Zhang, Shuo

论文摘要

现有的置换不变方法可以根据汇总范围(即全球聚合和局部)分为两类。尽管全球聚合方法,e。 g。,PointNet和Deep Sets,参与更简单的结构,它们的性能比PointNet ++和Point Transformer(例如PointNet ++)的局部聚合更差。如果存在具有简单结构,竞争性能甚至更少的参数的全球聚合方法,那么它仍然是一个开放的问题。在本文中,我们提出了一个基于双MLP点产品的新型全局聚合置换不变网络,称为DUMLP-PIN,该网络能够用于提取集合输入的功能,包括无序或非结构的像素,属性,属性,属性和点云数据集。我们严格地证明,由于给定输入集的基数大于阈值,因此DUMLP-PIN实现的任何置换不变函数都可以分解为两个或多个置换量的函数。我们还表明,在某些条件下,DUMLP-PIN可以看作是具有很强限制的深度集。 DUMLP-PIN的性能在具有不同数据集的几个不同任务上进行了评估。实验结果表明,我们的DUMLP-PIN在像素集和属性集的两个分类问题上取得了最佳结果。在点云分类和零件分割上,DUMLP-PIN的精度非常接近SO-FAR表现最佳的局部聚合方法,其差异仅为1-2%,而所需参数的数量在分类中分别显着降低了85%以上,分别为69%。该代码可在https://github.com/jaronthu/dumlp-pin上公开获取。

Existing permutation-invariant methods can be divided into two categories according to the aggregation scope, i.e. global aggregation and local one. Although the global aggregation methods, e. g., PointNet and Deep Sets, get involved in simpler structures, their performance is poorer than the local aggregation ones like PointNet++ and Point Transformer. It remains an open problem whether there exists a global aggregation method with a simple structure, competitive performance, and even much fewer parameters. In this paper, we propose a novel global aggregation permutation-invariant network based on dual MLP dot-product, called DuMLP-Pin, which is capable of being employed to extract features for set inputs, including unordered or unstructured pixel, attribute, and point cloud data sets. We strictly prove that any permutation-invariant function implemented by DuMLP-Pin can be decomposed into two or more permutation-equivariant ones in a dot-product way as the cardinality of the given input set is greater than a threshold. We also show that the DuMLP-Pin can be viewed as Deep Sets with strong constraints under certain conditions. The performance of DuMLP-Pin is evaluated on several different tasks with diverse data sets. The experimental results demonstrate that our DuMLP-Pin achieves the best results on the two classification problems for pixel sets and attribute sets. On both the point cloud classification and the part segmentation, the accuracy of DuMLP-Pin is very close to the so-far best-performing local aggregation method with only a 1-2% difference, while the number of required parameters is significantly reduced by more than 85% in classification and 69% in segmentation, respectively. The code is publicly available on https://github.com/JaronTHU/DuMLP-Pin.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源