论文标题

波斯抽象含义表示:注释指南和金标准数据集

Persian Abstract Meaning Representation: Annotation Guidelines and Gold Standard Dataset

论文作者

Takhshid, Reza, Azin, Tara, Shojaei, Razieh, Bahrani, Mohammad

论文摘要

本文介绍了波斯抽象含义表示(AMR)指南,这是用AMR注释波斯语句子的详细指南,重点介绍了适合波斯独特的句法结构的必要适应性。我们讨论了波斯AMR黄金标准数据集的开发过程,该数据集由按照指南制作的1,562个句子组成。通过检查语言规格和细微差别,这些语言规格和细微差别区分了像波斯语这样的低资源语言的AMR注释,我们阐明了开发普遍意义表示框架的挑战和局限性。本研究中引入的指南和数据集突出了这些挑战,旨在推进该领域。

This paper introduces the Persian Abstract Meaning Representation (AMR) guidelines, a detailed guide for annotating Persian sentences with AMR, focusing on the necessary adaptations to fit Persian's unique syntactic structures. We discuss the development process of a Persian AMR gold standard dataset consisting of 1,562 sentences created following the guidelines. By examining the language specifications and nuances that distinguish AMR annotations of a low-resource language like Persian, we shed light on the challenges and limitations of developing a universal meaning representation framework. The guidelines and the dataset introduced in this study highlight such challenges, aiming to advance the field.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源