论文标题
重新思考通过对比度学习对半监督节点分类带来的促销
Rethinking the Promotion Brought by Contrastive Learning to Semi-Supervised Node Classification
论文作者
论文摘要
事实证明,图对比度学习(GCL)在促进半监督节点分类(SSNC)的性能方面非常有效。但是,现有的GCL方法通常是从CV或NLP等其他领域转移的,CV或NLP的基本工作机制仍然不足。在这项工作中,我们首先深入探究了GCL在SSNC中的工作机制,并发现GCL提出的促进性的分布严重不均匀:改进主要来自带有带注释的信息较少的子图,这与其他领域的对比学习根本不同。但是,现有的GCL方法通常忽略了带注释的信息的不均匀分布,并将GCL均匀地应用于整个图表。为了解决此问题并进一步改善SSNC中的GCL,我们提出了拓扑信息获得图形对比度学习(TIFA-GCL)框架,该框架考虑了GCL中跨图的注释信息分布。在六个基准图数据集上进行的广泛实验,包括巨大的OGB生产图,表明TIFA-GCL在转导性和电感设置中都可以比现有的GCL方法带来更大的改进。进一步的实验证明了TIFA-GCL的普遍性和解释性。
Graph Contrastive Learning (GCL) has proven highly effective in promoting the performance of Semi-Supervised Node Classification (SSNC). However, existing GCL methods are generally transferred from other fields like CV or NLP, whose underlying working mechanism remains under-explored. In this work, we first deeply probe the working mechanism of GCL in SSNC, and find that the promotion brought by GCL is severely unevenly distributed: the improvement mainly comes from subgraphs with less annotated information, which is fundamentally different from contrastive learning in other fields. However, existing GCL methods generally ignore this uneven distribution of annotated information and apply GCL evenly to the whole graph. To remedy this issue and further improve GCL in SSNC, we propose the Topology InFormation gain-Aware Graph Contrastive Learning (TIFA-GCL) framework that considers the annotated information distribution across graph in GCL. Extensive experiments on six benchmark graph datasets, including the enormous OGB-Products graph, show that TIFA-GCL can bring a larger improvement than existing GCL methods in both transductive and inductive settings. Further experiments demonstrate the generalizability and interpretability of TIFA-GCL.