电子健康记录的基于概念的模型解释

论文标题

电子健康记录的基于概念的模型解释

Concept-based model explanations for Electronic Health Records

论文作者

Mincu, Diana, Loreaux, Eric, Hou, Shaobo, Baur, Sebastien, Protsyuk, Ivan, Seneviratne, Martin G, Mottram, Anne, Tomasev, Nenad, Karthikesanlingam, Alan, Schrouff, Jessica

论文摘要

复发性神经网络（RNN）通常用于电子健康记录（EHR）不良结果的顺序建模，因为它们可以编码过去的临床状态。与许多任务中的其他建模方法相比，这些深层复发的架构表现出了提高的性能，从而激发了对在临床环境中部署深层模型的兴趣。确保安全模型部署和构建用户信任的关键要素之一是模型解释性。最近引入了使用概念激活向量的测试（TCAV），是一种通过将高级概念与网络梯度进行比较，以提供人为理解的解释。尽管该技术在实际成像应用中显示出令人鼓舞的结果，但尚未应用于结构化的时间输入。为了使TCAV应用于EHR中的顺序预测，我们提出了该方法的扩展到时间序列数据。我们在重症监护室的开放EHR基准上评估了提出的方法，以及我们能够更好地隔离个体效应的合成数据。

Recurrent Neural Networks (RNNs) are often used for sequential modeling of adverse outcomes in electronic health records (EHRs) due to their ability to encode past clinical states. These deep, recurrent architectures have displayed increased performance compared to other modeling approaches in a number of tasks, fueling the interest in deploying deep models in clinical settings. One of the key elements in ensuring safe model deployment and building user trust is model explainability. Testing with Concept Activation Vectors (TCAV) has recently been introduced as a way of providing human-understandable explanations by comparing high-level concepts to the network's gradients. While the technique has shown promising results in real-world imaging applications, it has not been applied to structured temporal inputs. To enable an application of TCAV to sequential predictions in the EHR, we propose an extension of the method to time series data. We evaluate the proposed approach on an open EHR benchmark from the intensive care unit, as well as synthetic data where we are able to better isolate individual effects.

下载PDF全文

下载文献需遵守相关版权规定

论文标题