论文标题
通过转换比较:语法和词汇语义的反向工程UCCA
Comparison by Conversion: Reverse-Engineering UCCA from Syntax and Lexical Semantics
论文作者
论文摘要
建立强大的自然语言理解系统将需要清楚地表征各种语言意义表示形式是否相互补充。为了进行系统的比较分析,我们使用两种互补方法评估了来自不同框架的含义表示形式之间的映射:(i)基于规则的转换器,以及(ii)一个有监督的省视解析器,该解析器仅使用来自另一个功能的信息来解析一个框架。我们应用这些方法将Streusle语料库(带有句法和词汇语义注释)转换为UCCA(图形结构的全句子含义表示)。两种方法都产生了令人惊讶的准确目标表示形式,接近完全监督的UCCA解析器质量 - 表明UCCA注释与Streusle注释有部分冗余。尽管框架之间存在这种实质性的融合,但我们发现了几个重要的差异领域。
Building robust natural language understanding systems will require a clear characterization of whether and how various linguistic meaning representations complement each other. To perform a systematic comparative analysis, we evaluate the mapping between meaning representations from different frameworks using two complementary methods: (i) a rule-based converter, and (ii) a supervised delexicalized parser that parses to one framework using only information from the other as features. We apply these methods to convert the STREUSLE corpus (with syntactic and lexical semantic annotations) to UCCA (a graph-structured full-sentence meaning representation). Both methods yield surprisingly accurate target representations, close to fully supervised UCCA parser quality---indicating that UCCA annotations are partially redundant with STREUSLE annotations. Despite this substantial convergence between frameworks, we find several important areas of divergence.