论文标题

可重现的研究:回顾性

Reproducible Research: A Retrospective

论文作者

Peng, Roger D., Hicks, Stephanie C.

论文摘要

在过去的几十年中,计算技术的快速进步刺激了科学中的两种非凡现象:大规模和高通量数据收集以及创建和实施复杂的统计算法以进行数据分析。这两种现象共同在科学发现方面带来了巨大进步,但也引起了两个严重的问题,一个相对较新,一个非常熟悉。现代数据分析的复杂性提出了有关分析可重复性的问题,这意味着独立分析师使用原始数据和分析技术重新创建原始作者所主张的结果的能力。虽然看似直接的概念,但分析的可重复性通常会因缺乏分析中使用的数据和计算机代码的可用性而挫败。更普遍的关注点是科学发现的可复制性,这涉及通过完全独立的研究确认科学主张的频率。尽管复制性和可复制性的概念是相关的,但值得注意的是,它们专注于完全不同的目标并解决了科学进步的不同方面。在这篇综述中,我们将讨论可再现研究的起源,表征了公共卫生研究中再现性的当前状态,并将重生与当前对科学发现的可复制性的关注联系起来。最后,我们描述了未来公共卫生研究的可重复性和可复制性的前进道路。

Rapid advances in computing technology over the past few decades have spurred two extraordinary phenomena in science: large-scale and high-throughput data collection coupled with the creation and implementation of complex statistical algorithms for data analysis. Together, these two phenomena have brought about tremendous advances in scientific discovery but have also raised two serious concerns, one relatively new and one quite familiar. The complexity of modern data analyses raises questions about the reproducibility of the analyses, meaning the ability of independent analysts to re-create the results claimed by the original authors using the original data and analysis techniques. While seemingly a straightforward concept, reproducibility of analyses is typically thwarted by the lack of availability of the data and computer code that were used in the analyses. A much more general concern is the replicability of scientific findings, which concerns the frequency with which scientific claims are confirmed by completely independent investigations. While the concepts of reproduciblity and replicability are related, it is worth noting that they are focused on quite different goals and address different aspects of scientific progress. In this review, we will discuss the origins of reproducible research, characterize the current status of reproduciblity in public health research, and connect reproduciblity to current concerns about replicability of scientific findings. Finally, we describe a path forward for improving both the reproducibility and replicability of public health research in the future.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源