论文标题
自上而下的神经建筑,用于文本级别解析话语的修辞结构
A Top-Down Neural Architecture towards Text-Level Parsing of Discourse Rhetorical Structure
论文作者
论文摘要
由于其在深厚的自然语言理解和各种下游应用中的重要性,因此近年来,文本级解析(DRS)引起了越来越多的关注。但是,所有有关文本话语解析的研究都采用自下而上的方法,这极大地限制了DR对本地信息的决心,并且无法从整体话语的全球信息中受益。在本文中,我们从计算和感知点的观点证明,自上而下的体系结构更适合文本级别的DRS解析。在基础上,我们向文本级别的DRS解析提出了自上而下的神经体系结构。特别是,我们将话语解析作为递归分级排名任务,在该任务中,根据其等级将分级分类为不同的级别,与之相关的基本话语单位(EDU)是相应安排的。这样,我们可以通过带有内部堆栈的编码器来确定完整的DRS作为分层树结构。对英语RST-DT语料库和中国CDTB语料库进行的实验表明,我们提出的自上而下的方法对文本级别的DRS解析的有效性很大。
Due to its great importance in deep natural language understanding and various down-stream applications, text-level parsing of discourse rhetorical structure (DRS) has been drawing more and more attention in recent years. However, all the previous studies on text-level discourse parsing adopt bottom-up approaches, which much limit the DRS determination on local information and fail to well benefit from global information of the overall discourse. In this paper, we justify from both computational and perceptive points-of-view that the top-down architecture is more suitable for text-level DRS parsing. On the basis, we propose a top-down neural architecture toward text-level DRS parsing. In particular, we cast discourse parsing as a recursive split point ranking task, where a split point is classified to different levels according to its rank and the elementary discourse units (EDUs) associated with it are arranged accordingly. In this way, we can determine the complete DRS as a hierarchical tree structure via an encoder-decoder with an internal stack. Experimentation on both the English RST-DT corpus and the Chinese CDTB corpus shows the great effectiveness of our proposed top-down approach towards text-level DRS parsing.