论文标题
在低维表示中解释一组
Explaining Groups of Points in Low-Dimensional Representations
论文作者
论文摘要
数据探索中的一个常见工作流程是学习数据的低维表示,确定该表示中的积分组,并检查组之间的差异以确定它们的代表。我们通过利用学习低维表示的模型来帮助确定组之间的关键差异,将此工作流视为可解释的机器学习问题。为了解决这个问题,我们介绍了一种新的解释,全局反事实解释(GCE)以及我们的算法,及时的全局翻译(TGT),用于计算GCE。 TGT使用压缩传感来识别每对组之间的差异,但将这些成对差异限制为在所有组之间保持一致。从经验上讲,我们证明TGT能够识别出在相对稀疏的同时准确解释该模型的解释,并且这些解释与数据中的真实模式匹配。
A common workflow in data exploration is to learn a low-dimensional representation of the data, identify groups of points in that representation, and examine the differences between the groups to determine what they represent. We treat this workflow as an interpretable machine learning problem by leveraging the model that learned the low-dimensional representation to help identify the key differences between the groups. To solve this problem, we introduce a new type of explanation, a Global Counterfactual Explanation (GCE), and our algorithm, Transitive Global Translations (TGT), for computing GCEs. TGT identifies the differences between each pair of groups using compressed sensing but constrains those pairwise differences to be consistent among all of the groups. Empirically, we demonstrate that TGT is able to identify explanations that accurately explain the model while being relatively sparse, and that these explanations match real patterns in the data.