实验设计网络：用于在网络限制下服务异质学习者的范式

论文标题

实验设计网络：用于在网络限制下服务异质学习者的范式

Experimental Design Networks: A Paradigm for Serving Heterogeneous Learners under Networking Constraints

论文作者

Liu, Yuezhou, Li, Yuanyuan, Su, Lili, Yeh, Edmund, Ioannidis, Stratis

论文摘要

边缘计算功能的重大进展使学习能够在地理上不同的位置进行。通常，这些学习任务中所需的培训数据不仅是异质的，而且不是本地产生的。在本文中，我们提出了一个实验设计网络范式，其中学习者节点可能通过消耗网络上数据源节点生成的数据流而训练可能不同的贝叶斯线性回归模型。我们将此问题提出为社会福利优化问题，其中全局目标被定义为个别学习者的实验设计目标之和，而决策变量是受网络约束的数据传输策略。我们首先表明，假设泊松数据流，全局目标是连续的DR-Submodular函数。然后，我们提出了一种Frank-Wolfe型算法，该算法将解决方案从最佳的1-1/E因子输出。我们的算法包含一个新颖的梯度估计分量，该梯度估计成分是根据泊松尾界和采样精心设计的。最后，我们通过广泛的实验补充了理论发现。我们的数值评估表明，所提出的算法在最大化全球目标和受过训练的模型的质量方面都优于几种基线算法。

Significant advances in edge computing capabilities enable learning to occur at geographically diverse locations. In general, the training data needed in those learning tasks are not only heterogeneous but also not fully generated locally. In this paper, we propose an experimental design network paradigm, wherein learner nodes train possibly different Bayesian linear regression models via consuming data streams generated by data source nodes over a network. We formulate this problem as a social welfare optimization problem in which the global objective is defined as the sum of experimental design objectives of individual learners, and the decision variables are the data transmission strategies subject to network constraints. We first show that, assuming Poisson data streams, the global objective is a continuous DR-submodular function. We then propose a Frank-Wolfe type algorithm that outputs a solution within a 1-1/e factor from the optimal. Our algorithm contains a novel gradient estimation component which is carefully designed based on Poisson tail bounds and sampling. Finally, we complement our theoretical findings through extensive experiments. Our numerical evaluation shows that the proposed algorithm outperforms several baseline algorithms both in maximizing the global objective and in the quality of the trained models.

下载PDF全文

下载文献需遵守相关版权规定

论文标题