论文标题
一种解释是不够的:图像分类的结构化注意图
One Explanation is Not Enough: Structured Attention Graphs for Image Classification
论文作者
论文摘要
注意图是解释图像分类的卷积网络决策的一种流行方式。通常,对于每一个感兴趣的图像,都会产生一个注意图,该图将权重分配给像素对分类的重要性。但是,单个注意力图提供了不完整的理解,因为通常还有许多其他地图可以很好地解释分类。在本文中,我们介绍了结构化的注意图(SAG),该图通过捕获图像区域的不同组合如何影响分类器的置信度,以紧凑地表示图像的注意图集。我们提出了一种计算下垂的方法和一种可视化垂直的方法,以便可以将更深入的见解作为分类器的决策获得。我们进行了一项用户研究,将SAG与传统注意力图的使用进行比较,以回答有关图像分类的反事实问题。我们的结果表明,与基线相比,在回答基于下垂的比较反事实问题时,用户更正确。
Attention maps are a popular way of explaining the decisions of convolutional networks for image classification. Typically, for each image of interest, a single attention map is produced, which assigns weights to pixels based on their importance to the classification. A single attention map, however, provides an incomplete understanding since there are often many other maps that explain a classification equally well. In this paper, we introduce structured attention graphs (SAGs), which compactly represent sets of attention maps for an image by capturing how different combinations of image regions impact a classifier's confidence. We propose an approach to compute SAGs and a visualization for SAGs so that deeper insight can be gained into a classifier's decisions. We conduct a user study comparing the use of SAGs to traditional attention maps for answering counterfactual questions about image classifications. Our results show that the users are more correct when answering comparative counterfactual questions based on SAGs compared to the baselines.