部分可观测时空混沌系统的无模型预测

论文标题

部分可观测时空混沌系统的无模型预测

Unsupervised Question Answering via Answer Diversifying

论文作者

Nie, Yuxiang, Huang, Heyan, Chi, Zewen, Mao, Xian-Ling

论文摘要

无监督的问题回答是一项有吸引力的任务，因为它在标签数据上的独立性。以前的工作通常使用启发式规则以及预培训的模型来构建数据和训练QA模型。但是，这些作品中的大多数都将命名为实体（NE）视为唯一的答案类型，它忽略了现实世界中答案的高度多样性。为了解决这个问题，我们提出了一种新颖的无监督方法，通过多样化的答案，名为Diverseqa。具体而言，所提出的方法由三个模块组成：数据构建，数据增强和降解过滤器。首先，数据构建模块将提取的命名实体扩展到一个较长的句子构成中，作为构建具有不同答案的质量检查数据集的新答案跨度。其次，数据增强模块通过嵌入级别的对抗训练采用了依赖性数据增强过程。第三，denoising滤波器模块旨在减轻构造数据中的噪声。广泛的实验表明，所提出的方法在五个基准数据集上优于先前的无监督模型，包括skeadv1.1，newsqa，triviaqa，bioasq和Duorc。此外，提出的方法在几个射击学习环境中显示出强大的性能。

Unsupervised question answering is an attractive task due to its independence on labeled data. Previous works usually make use of heuristic rules as well as pre-trained models to construct data and train QA models. However, most of these works regard named entity (NE) as the only answer type, which ignores the high diversity of answers in the real world. To tackle this problem, we propose a novel unsupervised method by diversifying answers, named DiverseQA. Specifically, the proposed method is composed of three modules: data construction, data augmentation and denoising filter. Firstly, the data construction module extends the extracted named entity into a longer sentence constituent as the new answer span to construct a QA dataset with diverse answers. Secondly, the data augmentation module adopts an answer-type dependent data augmentation process via adversarial training in the embedding level. Thirdly, the denoising filter module is designed to alleviate the noise in the constructed data. Extensive experiments show that the proposed method outperforms previous unsupervised models on five benchmark datasets, including SQuADv1.1, NewsQA, TriviaQA, BioASQ, and DuoRC. Besides, the proposed method shows strong performance in the few-shot learning setting.

下载PDF全文

下载文献需遵守相关版权规定

论文标题