波斯抽象含义表示：注释指南和金标准数据集

论文标题

波斯抽象含义表示：注释指南和金标准数据集

Persian Abstract Meaning Representation: Annotation Guidelines and Gold Standard Dataset

论文作者

Takhshid, Reza, Azin, Tara, Shojaei, Razieh, Bahrani, Mohammad

论文摘要

本文介绍了波斯抽象含义表示（AMR）指南，这是用AMR注释波斯语句子的详细指南，重点介绍了适合波斯独特的句法结构的必要适应性。我们讨论了波斯AMR黄金标准数据集的开发过程，该数据集由按照指南制作的1,562个句子组成。通过检查语言规格和细微差别，这些语言规格和细微差别区分了像波斯语这样的低资源语言的AMR注释，我们阐明了开发普遍意义表示框架的挑战和局限性。本研究中引入的指南和数据集突出了这些挑战，旨在推进该领域。

This paper introduces the Persian Abstract Meaning Representation (AMR) guidelines, a detailed guide for annotating Persian sentences with AMR, focusing on the necessary adaptations to fit Persian's unique syntactic structures. We discuss the development process of a Persian AMR gold standard dataset consisting of 1,562 sentences created following the guidelines. By examining the language specifications and nuances that distinguish AMR annotations of a low-resource language like Persian, we shed light on the challenges and limitations of developing a universal meaning representation framework. The guidelines and the dataset introduced in this study highlight such challenges, aiming to advance the field.

下载PDF全文

下载文献需遵守相关版权规定

论文标题