论文标题
使用生成对抗网络估算连续值的干预措施的影响
Estimating the Effects of Continuous-valued Interventions using Generative Adversarial Networks
论文作者
论文摘要
尽管对估计观察数据的离散干预措施的影响的问题非常关注,但是在连续值的干预措施(例如与剂量参数相关的处理)中,几乎没有完成的工作。在本文中,我们通过建立修改生成对抗网络(GAN)框架来解决这个问题。我们的模型Scigan具有灵活性,并且能够同时估算几种不同连续干预措施的反事实结果。关键思想是使用明显修改的GAN模型来学习生成反事实结果,然后使用标准监督方法来学习推理模型,能够为新样本估算这些反事实。为了解决转移到持续干预措施所带来的挑战,我们为歧视者提出了一种新颖的架构 - 我们建立了一个层次结构歧视者,该分层歧视器利用了连续干预设置的结构。此外,我们提供了理论上的结果,以支持我们对GAN框架和分层歧视者的使用。在实验部分中,我们介绍了一个新的半合成数据模拟,用于连续干预设置,并证明了对现有基准模型的改进。
While much attention has been given to the problem of estimating the effect of discrete interventions from observational data, relatively little work has been done in the setting of continuous-valued interventions, such as treatments associated with a dosage parameter. In this paper, we tackle this problem by building on a modification of the generative adversarial networks (GANs) framework. Our model, SCIGAN, is flexible and capable of simultaneously estimating counterfactual outcomes for several different continuous interventions. The key idea is to use a significantly modified GAN model to learn to generate counterfactual outcomes, which can then be used to learn an inference model, using standard supervised methods, capable of estimating these counterfactuals for a new sample. To address the challenges presented by shifting to continuous interventions, we propose a novel architecture for our discriminator - we build a hierarchical discriminator that leverages the structure of the continuous intervention setting. Moreover, we provide theoretical results to support our use of the GAN framework and of the hierarchical discriminator. In the experiments section, we introduce a new semi-synthetic data simulation for use in the continuous intervention setting and demonstrate improvements over the existing benchmark models.