理解和在故事中角色之间对话的基准

论文标题

理解和在故事中角色之间对话的基准

A Benchmark for Understanding and Generating Dialogue between Characters in Stories

论文作者

Yao, Jianzhu, Liu, Ziqi, Guan, Jian, Huang, Minlie

论文摘要

许多古典童话，小说和剧本都利用对话来推进故事情节并建立角色。我们提出了第一个研究，以探索机器是否可以理解和产生故事中的对话，这需要捕获不同角色的特征及其之间的关系。为此，我们提出了两个新任务，包括蒙版的对话生成和对话扬声器的认可，即分别产生丢失的对话转弯并预测说话者进行指定的对话转弯。我们构建了一个新的数据集拨号故事，该数据集由105K中国故事组成，并在图中编织了大量对话以支持评估。我们通过对拨号故事进行自动和手动评估测试现有模型来显示提出的任务的困难。此外，我们建议学习明确的角色表示，以提高这些任务的绩效。广泛的实验和案例研究表明，我们的方法可以产生更连贯和信息丰富的对话，并且比强基础更高的说话者识别精度。

Many classical fairy tales, fiction, and screenplays leverage dialogue to advance story plots and establish characters. We present the first study to explore whether machines can understand and generate dialogue in stories, which requires capturing traits of different characters and the relationships between them. To this end, we propose two new tasks including Masked Dialogue Generation and Dialogue Speaker Recognition, i.e., generating missing dialogue turns and predicting speakers for specified dialogue turns, respectively. We build a new dataset DialStory, which consists of 105k Chinese stories with a large amount of dialogue weaved into the plots to support the evaluation. We show the difficulty of the proposed tasks by testing existing models with automatic and manual evaluation on DialStory. Furthermore, we propose to learn explicit character representations to improve performance on these tasks. Extensive experiments and case studies show that our approach can generate more coherent and informative dialogue, and achieve higher speaker recognition accuracy than strong baselines.

下载PDF全文

下载文献需遵守相关版权规定

论文标题