论文标题
帕累托有效的采集功能,用于成本吸引的贝叶斯优化
Pareto-efficient Acquisition Functions for Cost-Aware Bayesian Optimization
论文作者
论文摘要
贝叶斯优化(BO)是一种优化昂贵的黑盒功能的流行方法。它在隐式假设中有效地调整了机器学习算法,即高参数评估的成本大致相同。实际上,在时间,美元或能源方面,评估不同的超参数的成本都可以跨越几个数量级的差异。尽管已经提出了许多启发式方法来使BO成本了解,但这些启发式方法均未被证明是坚固的。在这项工作中,我们从帕累托效率方面重新调整了成本吸引力的BO,并引入了Pareto Front,这是一种数学对象,使我们能够强调常用的采集功能的缺点。基于此,我们提出了一种新型的帕累托效率改编的预期改进。在144个现实世界的黑框功能优化问题上,我们表明,帕累托有效的采集功能极大地超过了以前的解决方案,提高了高达50%的速度,同时提供了对成本准确性折衷的更好控制。我们还重新审视了高斯流程成本模型的共同选择,这表明简单,低变化的成本模型有效地预测了培训时间。
Bayesian optimization (BO) is a popular method to optimize expensive black-box functions. It efficiently tunes machine learning algorithms under the implicit assumption that hyperparameter evaluations cost approximately the same. In reality, the cost of evaluating different hyperparameters, be it in terms of time, dollars or energy, can span several orders of magnitude of difference. While a number of heuristics have been proposed to make BO cost-aware, none of these have been proven to work robustly. In this work, we reformulate cost-aware BO in terms of Pareto efficiency and introduce the cost Pareto Front, a mathematical object allowing us to highlight the shortcomings of commonly used acquisition functions. Based on this, we propose a novel Pareto-efficient adaptation of the expected improvement. On 144 real-world black-box function optimization problems we show that our Pareto-efficient acquisition functions significantly outperform previous solutions, bringing up to 50% speed-ups while providing finer control over the cost-accuracy trade-off. We also revisit the common choice of Gaussian process cost models, showing that simple, low-variance cost models predict training times effectively.