论文标题
与Wikihow有关目标,步骤和时间顺序的推理
Reasoning about Goals, Steps, and Temporal Ordering with WikiHow
论文作者
论文摘要
我们提出了一套关于程序事件之间两种关系的推理任务:目标步骤关系(“学习姿势”是更大的目标的步骤)和步骤临时关系(“购买Yoga Mat”通常先于“学习姿势”)。我们介绍了一个基于Wikihow的数据集,该数据集针对这两个关系,Wikihow是教学方法文章的网站。我们的人类验证测试集是常识性推断的可靠基准,在最先进的变压器模型和人类绩效的表现之间,差距约为10%至20%。我们自动生成的训练集允许模型有效地转移到需要了解程序事件知识的室外任务中,并在赃物,狙击手段上进行了大大改进的表演,并且在零和几乎没有拍摄设置中的故事固定测试。
We propose a suite of reasoning tasks on two types of relations between procedural events: goal-step relations ("learn poses" is a step in the larger goal of "doing yoga") and step-step temporal relations ("buy a yoga mat" typically precedes "learn poses"). We introduce a dataset targeting these two relations based on wikiHow, a website of instructional how-to articles. Our human-validated test set serves as a reliable benchmark for commonsense inference, with a gap of about 10% to 20% between the performance of state-of-the-art transformer models and human performance. Our automatically-generated training set allows models to effectively transfer to out-of-domain tasks requiring knowledge of procedural events, with greatly improved performances on SWAG, Snips, and the Story Cloze Test in zero- and few-shot settings.