论文标题

使用EHR数据进行心血管风险预测的多模式学习

Multimodal Learning for Cardiovascular Risk Prediction using EHR Data

论文作者

Bagheri, Ayoub, Groenhof, T. Katrien J., Veldhuis, Wouter B., de Jong, Pim A., Asselbergs, Folkert W., Oberski, Daniel L.

论文摘要

电子健康记录(EHR)包含重要的临床和研究价值的结构化和非结构化数据。已经开发出各种机器学习方法来利用EHR中的信息进行风险预测。但是,这些尝试中的大多数都集中在结构化的EHR字段上,并丢失了非结构化文本中的大量信息。为了利用EHR中捕获的潜在信息,在这项研究中,我们提出了一种多模式复发性神经网络模型,用于心血管风险预测,将医学文本和结构化临床信息整合在一起。提出的多模式双向长期记忆(BILSTM)模型将单词嵌入到经典的临床预测因子上,然后将其应用于最终完全连接的神经网络之前。在实验中,我们比较了使用临床变量和胸部X射线放射学报告的不同深层神经网络(DNN)体系结构的性能。拟议的Bilstm模型对现实世界中患者的数据集或心血管疾病的高风险进行了评估,该模型表明了最先进的表现,并且胜过其他DNN基线架构。

Electronic health records (EHRs) contain structured and unstructured data of significant clinical and research value. Various machine learning approaches have been developed to employ information in EHRs for risk prediction. The majority of these attempts, however, focus on structured EHR fields and lose the vast amount of information in the unstructured texts. To exploit the potential information captured in EHRs, in this study we propose a multimodal recurrent neural network model for cardiovascular risk prediction that integrates both medical texts and structured clinical information. The proposed multimodal bidirectional long short-term memory (BiLSTM) model concatenates word embeddings to classical clinical predictors before applying them to a final fully connected neural network. In the experiments, we compare performance of different deep neural network (DNN) architectures including convolutional neural network and long short-term memory in scenarios of using clinical variables and chest X-ray radiology reports. Evaluated on a data set of real world patients with manifest vascular disease or at high-risk for cardiovascular disease, the proposed BiLSTM model demonstrates state-of-the-art performance and outperforms other DNN baseline architectures.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源