迈向因果关系，推断：一种依次诊断的顺序歧视方法

论文标题

迈向因果关系，推断：一种依次诊断的顺序歧视方法

Towards Causality-Aware Inferring: A Sequential Discriminative Approach for Medical Diagnosis

论文作者

Lin, Junfan, Wang, Keze, Chen, Ziliang, Liang, Xiaodan, Lin, Liang

论文摘要

医学诊断助手（MDA）旨在建立一种交互式诊断剂，以依次询问识别疾病的症状。但是，由于用于构建患者模拟器的对话记录是被动收集的，因此某些任务无关的偏见（例如收集者的偏好）可能会恶化数据。这些偏见可能会阻碍诊断剂从模拟器中获取可运输知识。这项工作试图通过利用因果图来识别和解决两个代表性的非因果偏见，即，（i）默认 - 答案偏见和（ii）分布询问偏见，试图解决MDA中的这些关键问题。具体而言，偏见（i）源自患者模拟器，该模拟器试图以一些有偏见的默认答案来回答未记录的查询。因此，由于答案有偏见，诊断剂无法完全证明其优势。为了消除这种偏见，并受到因果图的倾向得分匹配技术的启发，我们提出了一个基于倾向的患者模拟器，以通过从其他记录中获取知识来有效地回答未录制的查询；偏见（ii）本质上与被动收集的数据相同，并且是训练代理人“学习”而不是“记住什么”的关键障碍之一。例如，在训练数据的分布中，如果症状与某种疾病高度相结合，则该特工可能会学会询问该症状以区分该疾病，因此可能不会概括到分布外病例中。为此，我们提出了一种进行性保证剂，其中包括分别考虑症状查询和疾病诊断的双重过程。查询过程是由诊断过程以自上而下的方式驱动的，以查询症状以增强诊断信心。

Medical diagnosis assistant (MDA) aims to build an interactive diagnostic agent to sequentially inquire about symptoms for discriminating diseases. However, since the dialogue records used to build a patient simulator are collected passively, the data might be deteriorated by some task-unrelated biases, such as the preference of the collectors. These biases might hinder the diagnostic agent to capture transportable knowledge from the simulator. This work attempts to address these critical issues in MDA by taking advantage of the causal diagram to identify and resolve two representative non-causal biases, i.e., (i) default-answer bias and (ii) distributional inquiry bias. Specifically, Bias (i) originates from the patient simulator which tries to answer the unrecorded inquiries with some biased default answers. Consequently, the diagnostic agents cannot fully demonstrate their advantages due to the biased answers. To eliminate this bias and inspired by the propensity score matching technique with causal diagram, we propose a propensity-based patient simulator to effectively answer unrecorded inquiry by drawing knowledge from the other records; Bias (ii) inherently comes along with the passively collected data, and is one of the key obstacles for training the agent towards "learning how" rather than "remembering what". For example, within the distribution of training data, if a symptom is highly coupled with a certain disease, the agent might learn to only inquire about that symptom to discriminate that disease, thus might not generalize to the out-of-distribution cases. To this end, we propose a progressive assurance agent, which includes the dual processes accounting for symptom inquiry and disease diagnosis respectively. The inquiry process is driven by the diagnosis process in a top-down manner to inquire about symptoms for enhancing diagnostic confidence.

下载PDF全文

下载文献需遵守相关版权规定

论文标题