论文标题

肺癌分类器的放射性特征选择

Radiomic feature selection for lung cancer classifiers

论文作者

Shakir, Hina, Rasheed, Haroon, Khan, Tariq Mairaj Rasool

论文摘要

具有定量成像特征集成的机器学习方法最近对肺结节分类引起了很多关注。但是,关于有效特征的分类目的排名方法的文献缺乏研究。此外,还需要评估分类任务所需的最佳功能。在这项研究中,我们研究了被监督和无监督的特征选择技术对计算机断层扫描(CT)图像中结节分类的机器学习方法的影响。研究工作探讨了Naive Bayes和支持向量机(SVM)的分类性能,并接受了2、4、8、12、16和20的培训,这些特征来自受监督和无监督的排名方法。使用经过8个从监督功能排名方法选择的8个放射线特征训练的SVM实现了最佳分类结果,精度为100%。这项研究进一步表明,可以通过训练任何具有更少的放射线特征的SVM或天真的贝叶斯来实现非常好的结节分类。放射线特征从2到20的周期性增加不会改善分类结果,无论是使用监督还是无监督的排名方法进行选择。

Machine learning methods with quantitative imaging features integration have recently gained a lot of attention for lung nodule classification. However, there is a dearth of studies in the literature on effective features ranking methods for classification purpose. Moreover, optimal number of features required for the classification task also needs to be evaluated. In this study, we investigate the impact of supervised and unsupervised feature selection techniques on machine learning methods for nodule classification in Computed Tomography (CT) images. The research work explores the classification performance of Naive Bayes and Support Vector Machine(SVM) when trained with 2, 4, 8, 12, 16 and 20 highly ranked features from supervised and unsupervised ranking approaches. The best classification results were achieved using SVM trained with 8 radiomic features selected from supervised feature ranking methods and the accuracy was 100%. The study further revealed that very good nodule classification can be achieved by training any of the SVM or Naive Bayes with a fewer radiomic features. A periodic increment in the number of radiomic features from 2 to 20 did not improve the classification results whether the selection was made using supervised or unsupervised ranking approaches.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源