MinOhealth.AI：对加纳，越南和美利坚合众国胸膜积液和心脏肿瘤诊断的深度学习系统的临床评估

论文标题

MinOhealth.AI：对加纳，越南和美利坚合众国胸膜积液和心脏肿瘤诊断的深度学习系统的临床评估

minoHealth.ai: A Clinical Evaluation Of Deep Learning Systems For the Diagnosis of Pleural Effusion and Cardiomegaly In Ghana, Vietnam and the United States of America

论文作者

Akogo, Darlington, Sarkodie, Benjamin Dabo, Samori, Issah Abubakari, Jimah, Bashiru Babatunde, Anim, Dorothea Akosua, Mensah, Yaw Boateng

论文摘要

快速准确的心脏肿瘤和胸腔积液的诊断对于降低死亡率和医疗费用至关重要。人工智能在诊断医疗状况方面已显示出希望。通过这项研究，我们试图评估人工智能（AI）系统开发我的Minohealth AI实验室，将在诊断心脏肿大和胸膜积液的诊断中使用加纳，越南和美国的胸部X射线，以及与在加纳工作中工作相比，AI系统的表现如何。本研究中使用的评估数据集包含从三个数据集随机选择的100张图像。深度学习模型进一步测试了较大的加纳数据集，该数据集包含561（561）个样本。然后在评估数据集上对两个AI系统进行了评估，而我们还将评估数据集中的胸部X射线图像提供给4位放射科医生，并具有5 - 20年的经验，以独立诊断。对于心脏肿大，Minohealth-AI系统在接收器操作特征曲线（AUC-ROC）下为0.9和0.97评分，而单个放射科医生的AUC-ROC范围为0.77至0.87。对于胸腔积液，MinOhealth-AI系统得分为0.97和0.91，而单个放射科医生的得分在0.75至0.86之间。在这两种情况下，表现最佳的AI模型的表现都优于最佳性能放射科医生约10％。我们还评估了MinOhealt-AI系统和放射学家之间的特异性，灵敏度，负预测值（NPV）和正预测价值（PPV）。

A rapid and accurate diagnosis of cardiomegaly and pleural effusion is of the utmost importance to reduce mortality and medical costs. Artificial Intelligence has shown promise in diagnosing medical conditions. With this study, we seek to evaluate how well Artificial Intelligence (AI) systems, developed my minoHealth AI Labs, will perform at diagnosing cardiomegaly and pleural effusion, using chest x-rays from Ghana, Vietnam and the USA, and how well AI systems will perform when compared with radiologists working in Ghana. The evaluation dataset used in this study contained 100 images randomly selected from three datasets. The Deep Learning models were further tested on a larger Ghanaian dataset containing five hundred and sixty one (561) samples. Two AI systems were then evaluated on the evaluation dataset, whilst we also gave the same chest x-ray images within the evaluation dataset to 4 radiologists, with 5 - 20 years experience, to diagnose independently. For cardiomegaly, minoHealth-ai systems scored Area under the Receiver operating characteristic Curve (AUC-ROC) of 0.9 and 0.97 while the AUC-ROC of individual radiologists ranged from 0.77 to 0.87. For pleural effusion, the minoHealth-ai systems scored 0.97 and 0.91 whereas individual radiologists scored between 0.75 and 0.86. On both conditions, the best performing AI model outperforms the best performing radiologist by about 10%. We also evaluate the specificity, sensitivity, negative predictive value (NPV), and positive predictive value (PPV) between the minoHealth-ai systems and radiologists.

下载PDF全文

下载文献需遵守相关版权规定

论文标题