论文标题

神经语言产生:配方,方法和评估

Neural Language Generation: Formulation, Methods, and Evaluation

论文作者

Garbacea, Cristina, Mei, Qiaozhu

论文摘要

基于神经网络的生成建模的最新进展重新点燃了希望能够与人无缝交流并能够理解自然语言的计算机系统的希望。在满足各种用户需求的多种环境和任务中,已经采用了神经体系结构来生成各种成功程度的文本摘录。值得注意的是,在大规模数据集中训练的高能力深度学习模型即使在缺乏明确的监督信号的情况下,也表明了学习数据模式的无与伦比的能力,开辟了有关生成现实和连贯文本的许多新可能性。尽管自然语言的生成领域正在迅速发展,但仍有许多开放挑战需要解决。在这项调查中,我们正式定义和分类自然语言产生问题。我们回顾了特定的应用程序任务,这些任务是这些通用表述的实例化,其中产生的自然语言具有实际重要性。接下来,我们包括用于生成多种文本的方法和神经体系结构的全面概述。然而,没有标准方法来评估这些生成模型产生的文本质量,这构成了对该领域进步的严重瓶颈。为此,我们还回顾了当前评估自然语言生成系统的方法。我们希望这项调查将提供有关神经自然语言产生的表述,方法和评估的信息概述。

Recent advances in neural network-based generative modeling have reignited the hopes in having computer systems capable of seamlessly conversing with humans and able to understand natural language. Neural architectures have been employed to generate text excerpts to various degrees of success, in a multitude of contexts and tasks that fulfil various user needs. Notably, high capacity deep learning models trained on large scale datasets demonstrate unparalleled abilities to learn patterns in the data even in the lack of explicit supervision signals, opening up a plethora of new possibilities regarding producing realistic and coherent texts. While the field of natural language generation is evolving rapidly, there are still many open challenges to address. In this survey we formally define and categorize the problem of natural language generation. We review particular application tasks that are instantiations of these general formulations, in which generating natural language is of practical importance. Next we include a comprehensive outline of methods and neural architectures employed for generating diverse texts. Nevertheless, there is no standard way to assess the quality of text produced by these generative models, which constitutes a serious bottleneck towards the progress of the field. To this end, we also review current approaches to evaluating natural language generation systems. We hope this survey will provide an informative overview of formulations, methods, and assessments of neural natural language generation.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源