羽毛信任的鸟类在一起：知道何时通过自适应社区聚集来信任分类器

论文标题

羽毛信任的鸟类在一起：知道何时通过自适应社区聚集来信任分类器

Birds of a Feather Trust Together: Knowing When to Trust a Classifier via Adaptive Neighborhood Aggregation

论文作者

Xiong, Miao, Li, Shen, Feng, Wenjie, Deng, Ailin, Zhang, Jihai, Hooi, Bryan

论文摘要

我们怎么知道分类器的预测何时可以信任？这是一个基本问题，也具有巨大的实际适用性，尤其是在安全至关重要的地区，例如医学和自动驾驶。使用分类器的SoftMax输出作为可信赖性的替代方法的事实上的方法遭受了过度信心的问题。尽管最新的作品遇到了诸如额外的再培训成本和准确性与可信赖性权衡的问题。在这项工作中，我们认为分类器对样本的预测的可信度与两个因素高度相关：样本的邻居信息和分类器的输出。为了结合两全其美的最好，我们设计了一种模型的事后方法，邻居邻居通过自适应邻里聚集来利用这两个基本信息。从理论上讲，我们表明邻居是单跳图卷积网络的广义版本，它继承了强大的建模能力，以捕获每个类中样本之间的不同相似性。我们还将方法扩展到了密切相关检测的紧密相关任务，并提供了理论保证，以绑定假负的束缚。从经验上讲，关于图像和表格基准的广泛实验验证了我们的理论，并表明邻居表现优于其他方法，实现了最先进的可信度性能。

How do we know when the predictions made by a classifier can be trusted? This is a fundamental problem that also has immense practical applicability, especially in safety-critical areas such as medicine and autonomous driving. The de facto approach of using the classifier's softmax outputs as a proxy for trustworthiness suffers from the over-confidence issue; while the most recent works incur problems such as additional retraining cost and accuracy versus trustworthiness trade-off. In this work, we argue that the trustworthiness of a classifier's prediction for a sample is highly associated with two factors: the sample's neighborhood information and the classifier's output. To combine the best of both worlds, we design a model-agnostic post-hoc approach NeighborAgg to leverage the two essential information via an adaptive neighborhood aggregation. Theoretically, we show that NeighborAgg is a generalized version of a one-hop graph convolutional network, inheriting the powerful modeling ability to capture the varying similarity between samples within each class. We also extend our approach to the closely related task of mislabel detection and provide a theoretical coverage guarantee to bound the false negative. Empirically, extensive experiments on image and tabular benchmarks verify our theory and suggest that NeighborAgg outperforms other methods, achieving state-of-the-art trustworthiness performance.

下载PDF全文

下载文献需遵守相关版权规定

论文标题