论文标题

PGD​​P5K:平面几何问题的图表解析数据集

PGDP5K: A Diagram Parsing Dataset for Plane Geometry Problems

论文作者

Hao, Yihan, Zhang, Mingliang, Yin, Fei, Huang, Linlin

论文摘要

图表解析是解决几何问题问题的重要基础,在智能教育和文档图像理解领域引起了越来越多的关注。由于复杂的布局和主要关系之间,平面几何图解析(PGDP)仍然是一项具有挑战性的任务,值得进一步的研究和探索。适当的数据集对于PGDP的研究至关重要。尽管已经提出了一些具有粗略注释的数据集来解决几何问题,但它们的规模很小,要么不公开。粗略的注释也使它们不是很有用。因此,我们提出了一个名为PGDP5K的新的大规模几何图数据集和一种新颖的注释方法。我们的数据集由5000个图样本组成,由16个形状组成,涵盖5种位置关系,22种符号类型和6种文本类型。与以前的数据集不同,我们的PGDP5K数据集在原始层面上标有更细粒度的注释,包括原始类,位置和关系。更重要的是,结合上述注释和几何知识知识,它可以自动和独特地产生可理解的几何命题。我们在PGDP5K和IMP-AGEOMETRY3K数据集上进行了实验,该数据集表明,最新方法(SOTA)方法仅达到66.07%的F1值。这表明PGDP5K为将来的研究带来了挑战。我们的数据集可从http://www.nlpr.ia.ac.cn/databases/casia-pgdp5k/获得。

Diagram parsing is an important foundation for geometry problem solving, attracting increasing attention in the field of intelligent education and document image understanding. Due to the complex layout and between-primitive relationship, plane geometry diagram parsing (PGDP) is still a challenging task deserving further research and exploration. An appropriate dataset is critical for the research of PGDP. Although some datasets with rough annotations have been proposed to solve geometric problems, they are either small in scale or not publicly available. The rough annotations also make them not very useful. Thus, we propose a new large-scale geometry diagram dataset named PGDP5K and a novel annotation method. Our dataset consists of 5000 diagram samples composed of 16 shapes, covering 5 positional relations, 22 symbol types and 6 text types. Different from previous datasets, our PGDP5K dataset is labeled with more fine-grained annotations at primitive level, including primitive classes, locations and relationships. What is more, combined with above annotations and geometric prior knowledge, it can generate intelligible geometric propositions automatically and uniquely. We performed experiments on PGDP5K and IMP-Geometry3K datasets reveal that the state-of-the-art (SOTA) method achieves only 66.07% F1 value. This shows that PGDP5K presents a challenge for future research. Our dataset is available at http://www.nlpr.ia.ac.cn/databases/CASIA-PGDP5K/.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源