论文标题

在大数据时代构建合成人群

Constructing synthetic populations in the age of big data

论文作者

Nicolaie, M. A., Fussenich, Koen, Ameling, Caroline, Boshuizen, Hendriek C.

论文摘要

为了使用微观模拟开发公共卫生干预模型,需要有关居民的广泛个人信息,例如社会人口统计学,经济和健康数据。数据机密性是此类数据的重要特征,而数据应支持现实的情况。此类数据的收集只有在有安全的环境中才有可能,而不直接用于外部微型模拟模型。本文的目的是通过基于有关整个荷兰人口的健康和社会经济决定因素的机密数据来预测单个特征来说明构建合成数据的方法。

To develop public health intervention models using microsimulations, extensive personal information about inhabitants is needed, such as socio-demographic, economic and health figures. Data confidentiality is an essential characteristic of such data, while the data should support realistic scenarios. Collection of such data is possible only in secured environments and not directly available for external micro-simulation models. The aim of this paper is to illustrate a method for construction of synthetic data by predicting individual features through models based on confidential data on health and socio-economic determinants of the entire Dutch population.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源