评估图像域中的特征归因方法

论文标题

评估图像域中的特征归因方法

Evaluating Feature Attribution Methods in the Image Domain

论文作者

Gevaert, Arne, Rousseau, Axel-Jan, Becker, Thijs, Valkenborg, Dirk, De Bie, Tijl, Saeys, Yvan

论文摘要

特征归因地图是一种流行的方法，可以突出显示模型预测的图像中最重要的像素。尽管最近的受欢迎程度和可用方法的增长，但对此类归因地图的客观评估很少关注。在此领域的先前工作的基础上，我们研究了现有的指标，并提出了新的指标变体，以评估归因地图。我们确认了最近的发现，即不同的归因指标似乎可以衡量归因图的不同基础概念，并将此发现扩展到更大的属性指标。我们还发现，一个数据集上的度量结果不一定会推广到其他数据集，并且具有理想理论属性（例如DeepShap）的方法不一定超过计算上更便宜的替代方案。基于这些发现，我们提出了一种通用的基准测试方法，以确定给定用例的理想特征归因方法。归因指标的实现和我们的实验可在线获得。

Feature attribution maps are a popular approach to highlight the most important pixels in an image for a given prediction of a model. Despite a recent growth in popularity and available methods, little attention is given to the objective evaluation of such attribution maps. Building on previous work in this domain, we investigate existing metrics and propose new variants of metrics for the evaluation of attribution maps. We confirm a recent finding that different attribution metrics seem to measure different underlying concepts of attribution maps, and extend this finding to a larger selection of attribution metrics. We also find that metric results on one dataset do not necessarily generalize to other datasets, and methods with desirable theoretical properties such as DeepSHAP do not necessarily outperform computationally cheaper alternatives. Based on these findings, we propose a general benchmarking approach to identify the ideal feature attribution method for a given use case. Implementations of attribution metrics and our experiments are available online.

下载PDF全文

下载文献需遵守相关版权规定

论文标题