美文网首页影像组学学习笔记
影像组学学习笔记(37)-机器学习模型判断脑卒中发病时间(文献报

影像组学学习笔记(37)-机器学习模型判断脑卒中发病时间(文献报

作者: 北欧森林 | 来源:发表于2021-01-31 06:12 被阅读0次

本笔记来源于B站Up主: 有Li 的影像组学系列教学视频
本节(37)主要介绍一篇文献,关于 机器学习模型判断脑卒中发病时间

文献题目:Machine Learning Approach to Identify Stroke Within 4.5 Hours, 2019年12月发表在 Stoke 上。

研究的目的为:

to investigate the ability of machine learning techniques analyzing diffusion weighted imaging (DWI) and fluid-attenuated inversion recovery (FLAIR) MRI to identify patients within the recommended time window for thrombolysis.

由于专业的不同,重点学习一下这篇文献的方法学部分及相应的结果描述吧!

方法(Methods)
  1. 首先是研究的整体流程图


    literature_1.png
  2. 图像主要处理流程为:
    A. Infarct regions were automatically segmented on the ADC maps by applying normalized absolute thresholding.
    B. A quantile curve of ADC intensities within the brain mask was constructed for each ADC map of each subject, and an intersection point between 2 tangent lines with maximum and minimum differential coeffcients was identifed on each quantile-intensity curve. (这句没有看懂,大概意思是交代自动分割阈值是如何选择的)
    C. The ADC maps were normalized
    D. Each normalized ADC map was thresholded at the optimal absolute value of 0.845.
    E. FLAIR images were coregistered onto ADC maps.


    image.png
  3. Generation of Ratio Maps

Ratio maps were constructed by reflecting the image around the ftted midsagittal plane, resulting in quantitative comparisons of the relative signals of the infarct regions and the contralateral side.

  1. Extraction of Imaging Features

... extracted from the registered FLAIR images and FLAIR ratio maps included intensity, gradient, and texture information...

image.png
  1. Machine Learning
    交代了使用的机器学习算法,以及它们各自的特点。

...cluster analysis including LR, and modern classifcation theories including SVM which
needs feature selection and RF which needs not feature selection

  1. Human Visual Assessment of DWI-FLAIR Mismatch
    描述了人类视觉评价的过程,以及mismatch的定义

A DWI-FLAIR mismatch was defned as the presence of a visible acute ischemic lesion on DWI but no traceable parenchymal hyperintensity in the corresponding region on FLAIR imaging

7.统计方法描述:
A. 研究者使用了单因素t检验对89个特征进行筛选,并使用了Bonferroni correction对p值进行了校正。(这里研究者把p值设在了0.2,如果是常规的0.05,然后再除以89,会非常小);如果校正后特征数目小于5个,则根据p值排序,选择前5个。
B. 机器学习模型和人类视觉的表现差异。
C. Youden index来决定cutoff 值(但是后文好像没有交代结果)。

结果(Results):
  1. 首先对分组情况(training set, test set)进行了描述,基线资料、target的分布等。自然,两组之间没有什么统计学差异。
  2. 单因素分析进行特征筛选,共筛选到34个特征用于ML建模(LR, SVM,RF)。
  3. 每个类别中表现最优的用于测试集中进行评价。尽管RF的AUROC最大,但是三者之间并没有统计学差异(如图)


    image.png

    和人类视觉判断相比,ML模型在sensitivity和NPV上表现优良,但在specificity和PPV上略逊一筹。


    image.png

拓展:
在仔细研读文献的统计方法部分时,发现在机器学习领域,有很多统计方法和我们平时用的不太一样。

The Bonferroni correction compensates for that increase by testing each individual hypothesis at a significance level of α/m, where α is the desired overall alpha level and m is the number of hypotheses. For example, if a trial is testing m=20 hypotheses with a desired α=0.05, then the Bonferroni correction would test each individual hypothesis at α=0.05/20=0.0025.(from wikipedia)

  1. Bonferroni correction 可以称作是“最简单粗暴有效”的校正方法,它拒绝了所有的假阳性结果发生的可能性,通过对p值的阈值进行校正来实现消除假阳性结果,但是该方法比较保守,容易错误地接受零假设。用的更多的是Holm's-Bonferroni,比前者要更容易发现显著差异。(知乎@123456@Sichao Song)

Sensitivity and specifcity were compared using McNemar tests, and positive predictive value and negative predictive value were compared using the Generalized Score Statistics method, as appropriate.

  1. McNemar test 和卡方检验的区别:
  • The McNemar is not testing for independence, but consistency in responses across two variables.
  • Although Chi-Square tests can be used for larger tables, McNemar tests can only be used for a 2×2 table.
  1. 这里的the Generalized Score Statistics method应该和 Generalized score tests 是一回事(没有找到相应的中文名)。它属于半参数方法,不受总体分布影响;最常用于重复测量数据。

Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. (ResearchGate@Andrzej S Kosinski)

参考资料:
如何计算McNemar检验,比较两种机器学习分类器
The Difference Between a Chi-Square Test and a McNemar Test

相关文章

  • 影像组学学习笔记(37)-机器学习模型判断脑卒中发病时间(文献报

    本笔记来源于B站Up主: 有Li[https://space.bilibili.com/542601735?fro...

  • 入门

    了解机器学习 标签需要通过机器学习模型判断出的结果 特征机器学习模型进行判断的条件(可以是很多的变量) 模型机器学...

  • 【机器学习六】决策树集成

      集成(ensemble)是合并多个机器学习模型来构建更强大模型的方法。在机器学习文献中有许多模型都属于这一类,...

  • 决策树集成

    集成(ensemble)是合并多个机器学习模型来构建更强大模型的方法。在机器学习文献中有许多模型都属于这一类,但已...

  • C#机器学习之判断日报是否合格

    简单来说机器学习的核心步骤在于“获取学习数据;选择机器算法;定型模型;评估模型,预测模型结果”,下面本人就以判断日...

  • Spark机器学习笔记

    Spark机器学习笔记 机器学习 机器学习是数据通过算法构建出模型并对模型进行评估,评估的性能如果达到要求就拿这个...

  • 影像组学笔记摘抄(1)

    以下内容是本人学习影像组学、机器学习知识过程中摘录的“金句”或者一些结论性的东西。由于时间不一,有些摘录注明了原始...

  • [机器学习入门] 李宏毅机器学习笔记-25(Sturctured

    [机器学习入门] 李宏毅机器学习笔记-25(Sturctured Linear Model;结构化预测-线性模型)...

  • 0-机器学习启蒙-导学

    机器学习-导学 机器学习工资高。 教学方法: 理论结合实际。 回归模型 分类模型 聚类模型 推荐系统 在大家把理论...

  • 1. 概述

    1 机器学习机器学习是一种统计学方法、计算机利用已有数据,得出某种模型,再利用此模型预测结果。【机器学习过程】 【...

网友评论

    本文标题:影像组学学习笔记(37)-机器学习模型判断脑卒中发病时间(文献报

    本文链接:https://www.haomeiwen.com/subject/lyxrzktx.html