论文标题

TANET:抽象性对话摘要的线程预处理

TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization

论文作者

Yang, Ze, Wang, Liran, Tian, Zhoujin, Wu, Wei, Li, Zhoujun

论文摘要

尽管预训练的语言模型(PLM)取得了巨大的成功并成为NLP中的里程碑,但抽象性的对话摘要仍然是一项挑战,但研究较少的任务。困难在于两个方面。一个是缺乏大规模的对话摘要数据。另一个是,将现有的预训练模型应用于此任务是棘手的,因为对话中的结构依赖性及其非正式表达等。在这项工作中,我们首先基于Reddit社区中的多人讨论,首先构建了称为RCS的大规模(11m)预处理数据集。然后,我们提出Tanet,这是一个基于线程的变压器网络。与将对话视为一系列句子的现有预训练的模型不同,我们认为话语之间固有的上下文依赖性在理解整个对话时起着至关重要的作用,因此提出了两种新技术,将结构信息纳入我们的模型。首先是线程感知的关注,它是通过考虑说法中的上下文依赖性来计算的。其次,我们应用线程预测损失来预测话语之间的关系。我们在四个真实对话的数据集上评估了我们的模型,涵盖了成绩单,客户服务记录和论坛线程的类型。实验结果表明,就自动评估和人类判断而言,TANET可以实现新的最新。

Although pre-trained language models (PLMs) have achieved great success and become a milestone in NLP, abstractive conversational summarization remains a challenging but less studied task. The difficulty lies in two aspects. One is the lack of large-scale conversational summary data. Another is that applying the existing pre-trained models to this task is tricky because of the structural dependence within the conversation and its informal expression, etc. In this work, we first build a large-scale (11M) pretraining dataset called RCS, based on the multi-person discussions in the Reddit community. We then present TANet, a thread-aware Transformer-based network. Unlike the existing pre-trained models that treat a conversation as a sequence of sentences, we argue that the inherent contextual dependency among the utterances plays an essential role in understanding the entire conversation and thus propose two new techniques to incorporate the structural information into our model. The first is thread-aware attention which is computed by taking into account the contextual dependency within utterances. Second, we apply thread prediction loss to predict the relations between utterances. We evaluate our model on four datasets of real conversations, covering types of meeting transcripts, customer-service records, and forum threads. Experimental results demonstrate that TANET achieves a new state-of-the-art in terms of both automatic evaluation and human judgment.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源