论文标题
Covault:一个安全的分析平台
CoVault: A Secure Analytics Platform
论文作者
论文摘要
个人数据的分析(例如个人的流动性,财务和健康数据)对社会有很大的好处。这些数据已经由当今智能手机,应用程序和服务收集,但是到目前为止,自由社会尚未使其用于大规模分析。可以说,这至少部分是由于缺乏分析平台,该平台可以通过透明的技术手段(理想情况下具有分散的信任)来保护数据,执行源策略,处理数百万个不同的数据源,以及对数十亿个具有可接受查询等待时间的记录进行查询。为了弥合这一差距,我们提出了一个名为Covault的分析平台,该平台将安全的多方计算(MPC)与基于信任的信任授权(TEE)授权,以便在数据中心内部的个人贡献的加密数据中执行批准的查询,以实现上述属性。我们表明,尽管MPC的成本很高,但Covault尺度表现良好。例如,Covault可以使用每20,000人的核心对连续地处理与8000万人口(约11.85b数据记录/天)的流行病分析相关的数据。与最先进的MPC平台相比,Covault可以处理7至100倍以上的查询,以及许多来源和大数据的比例。
Analytics on personal data, such as individuals' mobility, financial, and health data can be of significant benefit to society. Such data is already collected by smartphones, apps and services today, but liberal societies have so far refrained from making it available for large-scale analytics. Arguably, this is due at least in part to the lack of an analytics platform that can secure data through transparent, technical means (ideally with decentralized trust), enforce source policies, handle millions of distinct data sources, and run queries on billions of records with acceptable query latencies. To bridge this gap, we present an analytics platform called CoVault which combines secure multi-party computation (MPC) with trusted execution environment (TEE)-based delegation of trust to be able execute approved queries on encrypted data contributed by individuals within a datacenter to achieve the above properties. We show that CoVault scales well despite the high cost of MPC. For example, CoVault can process data relevant to epidemic analytics for a country of 80M people (about 11.85B data records/day) on a continuous basis using a core pair for every 20,000 people. Compared to a state-of-the-art MPC-based platform, CoVault can process queries between 7 to over 100 times faster, as well as scale to many sources and big data.