论文标题
检测具有差异MDL变化统计的变化迹象,以进行共同分析
Detecting Change Signs with Differential MDL Change Statistics for COVID-19 Pandemic Analysis
论文作者
论文摘要
我们关注的是从数据流中检测更改及其迹象的问题。例如,当给定一个地区的Covid-19病例的时间序列时,我们可能会通过检测案件变化的迹象来提高爆发的预警信号。我们提出了一种解决这个问题的新方法。关键思想是采用新的信息理论概念,我们称之为差分最小描述长度变化统计(D-MDL)来衡量变化符号的得分。我们首先给出了D-MDL的基本理论。然后,我们使用合成数据集证明其有效性。我们将其应用于COVID-19-19的流行病的预警信号。我们从经验上证明,D-MDL能够提高事件的预警信号,例如大幅增加/减少病例。值得注意的是,在37个研究国家中,大约64美元的案件大幅增加的事件中,我们的方法可以在事件发生之前平均检测到近六天,从而购买相当长的时间来做出回应。我们将警告信号与基本复制号$ r0 $和社会距离的时机联系起来。结果表明,我们的方法可以有效地监测$ r0 $的动态,并确认社会距离在一个地区占据流行方面的有效性。我们得出的结论是,从数据科学的角度来看,我们的方法是对大流行分析的有前途的方法。实验软件可在https://github.com/ibarakikenyukishi/differential-mdl-change-statistics获得。可以在https://ibarakikenyukishi.github.io/d-mdl-html/index.html上获得在线检测系统
We are concerned with the issue of detecting changes and their signs from a data stream. For example, when given time series of COVID-19 cases in a region, we may raise early warning signals of outbreaks by detecting signs of changes in the cases. We propose a novel methodology to address this issue. The key idea is to employ a new information-theoretic notion, which we call the differential minimum description length change statistics (D-MDL), for measuring the scores of change sign. We first give a fundamental theory for D-MDL. We then demonstrate its effectiveness using synthetic datasets. We apply it to detecting early warning signals of the COVID-19 epidemic. We empirically demonstrate that D-MDL is able to raise early warning signals of events such as significant increase/decrease of cases. Remarkably, for about $64\%$ of the events of significant increase of cases in 37 studied countries, our method can detect warning signals as early as nearly six days on average before the events, buying considerably long time for making responses. We further relate the warning signals to the basic reproduction number $R0$ and the timing of social distancing. The results showed that our method can effectively monitor the dynamics of $R0$, and confirmed the effectiveness of social distancing at containing the epidemic in a region. We conclude that our method is a promising approach to the pandemic analysis from a data science viewpoint. The software for the experiments is available at https://github.com/IbarakikenYukishi/differential-mdl-change-statistics. An online detection system is available at https://ibarakikenyukishi.github.io/d-mdl-html/index.html