论文标题

改进的广义射频估计器以解决依赖的协变量和失败时间结果错误

Improved Generalized Raking Estimators to Address Dependent Covariate and Failure-Time Outcome Error

论文作者

Oh, Eric J., Shepherd, Bryan E., Lumley, Thomas, Shaw, Pamela A.

论文摘要

使用电子健康记录(EHR)数据进行推断的生物医学研究通常由于测量误差而造成偏见。 EHR数据中存在的测量误差通常很复杂,由协变量中未知功能形式的误差和结果组成,这可能取决于。为了解决此类错误产生的偏差,最近已提出了广义耙子作为强大的方法,该方法可产生一致的估计,而无需对误差结构进行建模。我们提供了为什么这些先前提出的耙开估计器的理由,预计涉及事件指标分类的失败时间结果设置效率低下。我们提出了利用多个插补的耙开估计器,以将目标变量或辅助变量归为提高效率。我们还考虑了依赖结果的抽样设计,并研究了它们对耙估估计器的效率的影响,无论是否有多次插补。我们提出了一项广泛的数值研究,以检查各种测量误差设置中提出的估计器的性能。然后,我们将提出的方法应用于我们的激励环境中,在这些环境中,我们寻求通过范德比尔特综合护理诊所的电子健康记录数据来分析观察人群中的艾滋病毒结果。

Biomedical studies that use electronic health records (EHR) data for inference are often subject to bias due to measurement error. The measurement error present in EHR data is typically complex, consisting of errors of unknown functional form in covariates and the outcome, which can be dependent. To address the bias resulting from such errors, generalized raking has recently been proposed as a robust method that yields consistent estimates without the need to model the error structure. We provide rationale for why these previously proposed raking estimators can be expected to be inefficient in failure-time outcome settings involving misclassification of the event indicator. We propose raking estimators that utilize multiple imputation, to impute either the target variables or auxiliary variables, to improve the efficiency. We also consider outcome-dependent sampling designs and investigate their impact on the efficiency of the raking estimators, either with or without multiple imputation. We present an extensive numerical study to examine the performance of the proposed estimators across various measurement error settings. We then apply the proposed methods to our motivating setting, in which we seek to analyze HIV outcomes in an observational cohort with electronic health records data from the Vanderbilt Comprehensive Care Clinic.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源