使用EHRS的标签依赖性和事件引导的可解释疾病风险预测

论文标题

使用EHRS的标签依赖性和事件引导的可解释疾病风险预测

Label-dependent and event-guided interpretable disease risk prediction using EHRs

论文作者

Niu, Shuai, Song, Yunya, Yin, Qing, Guo, Yike, Yang, Xian

论文摘要

电子健康记录（EHRS）包含患者的异质数据，这些数据是从参与患者护理的医疗提供者中收集的，包括医疗笔记，临床事件，实验室测试结果，症状和诊断。在现代医疗保健领域，预测患者是否会根据其EHR遇到任何风险，成为一个有前途的研究领域，其中人工智能（AI）起着关键作用。要使AI模型实际上适用，要求预测结果应既准确又可解释。为了实现这一目标，本文提出了一个依赖标签和事件引导的风险预测模型（LERP），以通过主要从非结构化医疗票据中提取信息来预测多种疾病风险的存在。我们的模型在以下方面列出了。首先，我们采用了一种依赖标签的机制，该机制更加关注与风险标签名称在语义上相似的医学注释中的单词。其次，由于临床事件（例如，治疗和药物）也可以表明患者的健康状况，我们的模型利用了事件中的信息，并利用它们来生成事件引导的医疗票据表示。第三，集成了依赖标签的依赖性和事件引导的表示形式，以做出强大的预测，在这种预测中，人们对医学注释的单词的注意力权重实现了可解释性。为了证明所提出的方法的适用性，我们将其应用于模仿III数据集，该数据集包含从医院收集的现实世界EHR。我们的方法以定量和定性方式进行评估。

Electronic health records (EHRs) contain patients' heterogeneous data that are collected from medical providers involved in the patient's care, including medical notes, clinical events, laboratory test results, symptoms, and diagnoses. In the field of modern healthcare, predicting whether patients would experience any risks based on their EHRs has emerged as a promising research area, in which artificial intelligence (AI) plays a key role. To make AI models practically applicable, it is required that the prediction results should be both accurate and interpretable. To achieve this goal, this paper proposed a label-dependent and event-guided risk prediction model (LERP) to predict the presence of multiple disease risks by mainly extracting information from unstructured medical notes. Our model is featured in the following aspects. First, we adopt a label-dependent mechanism that gives greater attention to words from medical notes that are semantically similar to the names of risk labels. Secondly, as the clinical events (e.g., treatments and drugs) can also indicate the health status of patients, our model utilizes the information from events and uses them to generate an event-guided representation of medical notes. Thirdly, both label-dependent and event-guided representations are integrated to make a robust prediction, in which the interpretability is enabled by the attention weights over words from medical notes. To demonstrate the applicability of the proposed method, we apply it to the MIMIC-III dataset, which contains real-world EHRs collected from hospitals. Our method is evaluated in both quantitative and qualitative ways.

下载PDF全文

下载文献需遵守相关版权规定

论文标题