通过无监督的事后知识注入实现对话目标

论文标题

通过无监督的事后知识注入实现对话目标

Achieving Conversational Goals with Unsupervised Post-hoc Knowledge Injection

论文作者

Majumder, Bodhisattwa Prasad, Jhamtani, Harsh, Berg-Kirkpatrick, Taylor, McAuley, Julian

论文摘要

当前的神经对话模型的局限性是，它们倾向于在产生的响应中缺乏特异性和信息性，这主要是由于依赖培训数据的培训数据，这些数据涵盖了有限的情况，并且传达了有限的知识。减轻此问题的一种方法是在解码时从外部来源中提取相关知识，并将其纳入对话框响应中。在本文中，我们提出了一种事后知识注入技术，我们首先检索了以对话框历史记录和现有对话框模型的初步响应为条件的一组相关知识片段。我们构建了多个候选响应，使用基于梯度的解码方法将每个检索的摘要分别注入初始响应中，然后使用无监督的排名步骤选择最终响应。我们在面向目标和知识的对话框设置中进行的实验表明，与先前的对话系统的响应相比，人类注释者判断提出方法的输出更具吸引力和信息性。我们进一步表明，知识主张促进了在两个实验环境中实现对话目标方面的成功。

A limitation of current neural dialog models is that they tend to suffer from a lack of specificity and informativeness in generated responses, primarily due to dependence on training data that covers a limited variety of scenarios and conveys limited knowledge. One way to alleviate this issue is to extract relevant knowledge from external sources at decoding time and incorporate it into the dialog response. In this paper, we propose a post-hoc knowledge-injection technique where we first retrieve a diverse set of relevant knowledge snippets conditioned on both the dialog history and an initial response from an existing dialog model. We construct multiple candidate responses, individually injecting each retrieved snippet into the initial response using a gradient-based decoding method, and then select the final response with an unsupervised ranking step. Our experiments in goal-oriented and knowledge-grounded dialog settings demonstrate that human annotators judge the outputs from the proposed method to be more engaging and informative compared to responses from prior dialog systems. We further show that knowledge-augmentation promotes success in achieving conversational goals in both experimental settings.

下载PDF全文

下载文献需遵守相关版权规定

论文标题