论文标题
通过实例加权改善使用互补的最后摄取选择来改善多转响应选择模型
Improving Multi-Turn Response Selection Models with Complementary Last-Utterance Selection by Instance Weighting
论文作者
论文摘要
基于开放域检索的对话系统需要大量的培训数据才能学习其参数。但是,实际上,培训数据的负样本通常是从未经注释的对话数据中随机选择的。生成的训练数据可能包含噪声并影响响应选择模型的性能。为了解决这个困难,我们考虑利用数据资源本身中的基本相关性来得出不同种类的监督信号并减少嘈杂数据的影响。更特别地,我们考虑了一个主要的汇编任务对。给定最后的话语和上下文,主要任务(\ ie我们的焦点)选择了正确的响应,互补任务在给定响应和上下文的情况下选择了最后的话语。关键点是,补充任务的输出用于为主要任务设置实例权重。我们在两个公共数据集中进行了广泛的实验,并在两个数据集中获得了显着改进。我们还研究了多个方面的方法的变体,结果验证了我们方法的有效性。
Open-domain retrieval-based dialogue systems require a considerable amount of training data to learn their parameters. However, in practice, the negative samples of training data are usually selected from an unannotated conversation data set at random. The generated training data is likely to contain noise and affect the performance of the response selection models. To address this difficulty, we consider utilizing the underlying correlation in the data resource itself to derive different kinds of supervision signals and reduce the influence of noisy data. More specially, we consider a main-complementary task pair. The main task (\ie our focus) selects the correct response given the last utterance and context, and the complementary task selects the last utterance given the response and context. The key point is that the output of the complementary task is used to set instance weights for the main task. We conduct extensive experiments in two public datasets and obtain significant improvement in both datasets. We also investigate the variant of our approach in multiple aspects, and the results have verified the effectiveness of our approach.