摊销推理中的概括差距

论文标题

摊销推理中的概括差距

Generalization Gap in Amortized Inference

论文作者

Zhang, Mingtian, Hayes, Peter, Barber, David

论文摘要

基于可能性的概率模型将其推广到看不见的数据的能力对于许多机器学习应用（例如无损压缩）至关重要。在这项工作中，我们研究了流行的概率模型类别的概括 - 变异自动编码器（VAE）。我们讨论了影响VAE的两个概括差距，并表明过度拟合通常由摊销推理主导。基于这一观察结果，我们提出了一个新的培训目标，以改善摊销推断的概括。我们演示了我们的方法如何在图像建模和无损压缩的背景下提高性能。

The ability of likelihood-based probabilistic models to generalize to unseen data is central to many machine learning applications such as lossless compression. In this work, we study the generalization of a popular class of probabilistic model - the Variational Auto-Encoder (VAE). We discuss the two generalization gaps that affect VAEs and show that overfitting is usually dominated by amortized inference. Based on this observation, we propose a new training objective that improves the generalization of amortized inference. We demonstrate how our method can improve performance in the context of image modeling and lossless compression.

下载PDF全文

下载文献需遵守相关版权规定

论文标题