论文标题
供奉献:关于规则的可解释推理的证明生成
PRover: Proof Generation for Interpretable Reasoning over Rules
论文作者
论文摘要
Clark等人的最新工作。 (2020)表明,通过明确提供自然语言知识的问题,变形金刚可以用作“软定理掠夺”。在我们的工作中,我们通过提出一个可解释的基于变压器的模型来模拟形式定理掠夺,迈向模拟形式定理掠夺者,该模型共同回答规则基础上的二进制问题并生成相应的证据。我们的模型学会了预测与有效约束训练范式中证明图相对应的节点和边缘。在推断期间,生成了满足一组全局约束的有效证明。我们对合成,手工作品和人类参数的规则基础进行实验,以显示质量保证和证明产生的有希望的结果,并具有强烈的概括性能。首先,与RuleTakers相比,供奉献精度为87%,同时保留或提高质量检查的性能(在零拍摄评估方面提高了6%)。其次,当接受需要较低推理深度的问题的培训时,它会明显更好地概括到更高的深度(最大提高15%)。第三,供者仅使用40%的培训数据获得了98%的完美质量质量质量准确性。但是,为需要更高推理深度的问题生成证据变得具有挑战性,而“深度5”的精度下降到65%,这表明将来的工作范围很大。我们的代码和模型可在https://github.com/swarnahub/prover上公开获取
Recent work by Clark et al. (2020) shows that transformers can act as 'soft theorem provers' by answering questions over explicitly provided knowledge in natural language. In our work, we take a step closer to emulating formal theorem provers, by proposing PROVER, an interpretable transformer-based model that jointly answers binary questions over rule-bases and generates the corresponding proofs. Our model learns to predict nodes and edges corresponding to proof graphs in an efficient constrained training paradigm. During inference, a valid proof, satisfying a set of global constraints is generated. We conduct experiments on synthetic, hand-authored, and human-paraphrased rule-bases to show promising results for QA and proof generation, with strong generalization performance. First, PROVER generates proofs with an accuracy of 87%, while retaining or improving performance on the QA task, compared to RuleTakers (up to 6% improvement on zero-shot evaluation). Second, when trained on questions requiring lower depths of reasoning, it generalizes significantly better to higher depths (up to 15% improvement). Third, PROVER obtains near perfect QA accuracy of 98% using only 40% of the training data. However, generating proofs for questions requiring higher depths of reasoning becomes challenging, and the accuracy drops to 65% for 'depth 5', indicating significant scope for future work. Our code and models are publicly available at https://github.com/swarnaHub/PRover