论文标题
最佳分类森林的数学编程方法
A Mathematical Programming Approach to Optimal Classification Forests
论文作者
论文摘要
本文介绍了加权最佳分类森林(WOCFS),这是一个新的分类器系列,它利用最佳的决策树合奏来得出准确且可解释的分类器。我们提出了一种新型的基于数学优化的方法,该方法同时构建了给定数量的树木,每个树都为特征空间中的观测值提供了预测类。分类规则是通过分配给每个观察结果中最常预测的类别来得出的。我们为问题提供了混合整数线性编程公式(MIP),并提供了几种新型的MIP加强 /缩放技术。我们报告了计算实验的结果,我们得出的结论是,与最新的中小型实例的基于树木的分类方法相比,我们的方法具有相等或卓越的性能。我们还提出了三个现实世界中的案例研究,表明我们的方法在解释性方面具有非常有趣的含义。总体而言,WOCF补充了现有方法,例如购物车,最佳分类树,随机森林和XGBoost。除了其对准确性和可解释性的提高外,我们还看到了针对不同特征变量的不同树木出现的独特特性。这为受过反事实解释而言,这提供了训练有素模型的可解释性和可用性的非平凡改善。因此,尽管WOCF的计算挑战明显,这些WOCF限制了可以用当前MIP有效解决问题的问题的大小,但这是一个重要的研究方向,可以为研究人员带来定性不同的见解,并补充从业人员的工具箱,以解决高级问题问题。
This paper introduces Weighted Optimal Classification Forests (WOCFs), a new family of classifiers that takes advantage of an optimal ensemble of decision trees to derive accurate and interpretable classifiers. We propose a novel mathematical optimization-based methodology which simultaneously constructs a given number of trees, each of them providing a predicted class for the observations in the feature space. The classification rule is derived by assigning to each observation its most frequently predicted class among the trees. We provide a mixed integer linear programming formulation (MIP) for the problem and several novel MIP strengthening / scaling techniques. We report the results of our computational experiments, from which we conclude that our method has equal or superior performance compared with state-of-the-art tree-based classification methods for small to medium-sized instances. We also present three real-world case studies showing that our methodology has very interesting implications in terms of interpretability. Overall, WOCFs complement existing methods such as CART, Optimal Classification Trees, Random Forests and XGBoost. In addition to its Pareto improvement on accuracy and interpretability, we also see unique properties emerging in terms of different trees focusing on different feature variables. This provides nontrivial improvement in interpretability and usability of the trained model in terms of counterfactual explanation. Thus, despite the apparent computational challenge of WOCFs that limit the size of the problems that can be efficiently solved with current MIP, this is an important research direction that can lead to qualitatively different insights for researchers and complement the toolbox of practitioners for high stakes problems.