RZCR：通过基于自由基推理的零弹性字符识别

论文标题

RZCR：通过基于自由基推理的零弹性字符识别

RZCR: Zero-shot Character Recognition via Radical-based Reasoning

论文作者

Diao, Xiaolei, Shi, Daqian, Tang, Hao, Shen, Qiang, Li, Yanzeng, Wu, Lei, Xu, Hao

论文摘要

长尾效应是一个普遍的问题，它限制了对现实世界数据集中深度学习模型的性能。由于字符使用频率差异，字符图像数据集还受到这种不平衡数据分布的影响。因此，当当前的角色识别方法应用于现实世界时，尤其是对于缺乏训练样本的尾巴类别，例如不常见的字符。在本文中，我们通过称为RZCR提出一个零拍的字符识别框架，以提高尾部中几个样本字符类别的识别性能。具体来说，我们通过根据拼字法分解和重建字符来利用激进分子，字符的图形单位。 RZCR由基于视觉语义融合的激进信息提取器（RIE）和知识图字符推理器（KGR）组成。 Rie的目的是识别候选激进分子及其可能与角色图像的结构关系。然后将结果馈入KGR，以通过知识图推理来识别目标字符。我们在多个数据集上验证我们的方法，RZCR显示出令人鼓舞的实验结果，尤其是在几个样本字符数据集上。

The long-tail effect is a common issue that limits the performance of deep learning models on real-world datasets. Character image datasets are also affected by such unbalanced data distribution due to differences in character usage frequency. Thus, current character recognition methods are limited when applied in the real world, especially for the categories in the tail that lack training samples, e.g., uncommon characters. In this paper, we propose a zero-shot character recognition framework via radical-based reasoning, called RZCR, to improve the recognition performance of few-sample character categories in the tail. Specifically, we exploit radicals, the graphical units of characters, by decomposing and reconstructing characters according to orthography. RZCR consists of a visual semantic fusion-based radical information extractor (RIE) and a knowledge graph character reasoner (KGR). RIE aims to recognize candidate radicals and their possible structural relations from character images in parallel. The results are then fed into KGR to recognize the target character by reasoning with a knowledge graph. We validate our method on multiple datasets, and RZCR shows promising experimental results, especially on few-sample character datasets.

下载PDF全文

下载文献需遵守相关版权规定

论文标题