休息：通过基于RL的空间转换来改善黑匣子模型的性能

论文标题

休息：通过基于RL的空间转换来改善黑匣子模型的性能

REST: Performance Improvement of a Black Box Model via RL-based Spatial Transformation

论文作者

Kim, Jae Myung, Kim, Hyungjin, Park, Chanwoo, Lee, Jungwoo

论文摘要

近年来，深度神经网络（DNN）已成为一个高度活跃的研究领域，并在各种计算机视觉任务上取得了显着的成就。然而，已知DNN通常会过度自信但对分布样本的预测不正确，这可能是现实部署的主要障碍，因为与多样化的现实世界样本相比，培训数据集始终受到限制。因此，当我们在实践中构建DNN模型时，为训练时间和测试时间之间的分配变化提供了鲁棒性的保证。此外，在许多情况下，深度学习模型被部署为黑匣子，并且已经针对培训数据集进行了优化的性能，因此更改黑匣子本身可能会导致性能退化。我们在这里研究了给出黑框图像分类器的特定条件下的几何变换的鲁棒性。我们提出了一个额外的学习者，\ emph {增强空间变换学习者（REST）}，将扭曲的输入数据转换为Black-box模型被视为分布的样本。我们的工作旨在通过在任何黑匣子的面前添加一个REST模块来提高鲁棒性，并仅训练REST模块，而无需以端到端的方式重新训练原始的黑匣子模型，即我们尝试将现实世界中的数据转换为训练分布，而黑框模型的性能最适合。我们使用从黑框模型获得的置信分数来确定是否从分布中得出转换后的输入。我们从经验上表明，我们的方法在对几何变换和样品效率上的概括方面具有优势。

In recent years, deep neural networks (DNN) have become a highly active area of research, and shown remarkable achievements on a variety of computer vision tasks. DNNs, however, are known to often make overconfident yet incorrect predictions on out-of-distribution samples, which can be a major obstacle to real-world deployments because the training dataset is always limited compared to diverse real-world samples. Thus, it is fundamental to provide guarantees of robustness to the distribution shift between training and test time when we construct DNN models in practice. Moreover, in many cases, the deep learning models are deployed as black boxes and the performance has been already optimized for a training dataset, thus changing the black box itself can lead to performance degradation. We here study the robustness to the geometric transformations in a specific condition where the black-box image classifier is given. We propose an additional learner, \emph{REinforcement Spatial Transform learner (REST)}, that transforms the warped input data into samples regarded as in-distribution by the black-box models. Our work aims to improve the robustness by adding a REST module in front of any black boxes and training only the REST module without retraining the original black box model in an end-to-end manner, i.e. we try to convert the real-world data into training distribution which the performance of the black-box model is best suited for. We use a confidence score that is obtained from the black-box model to determine whether the transformed input is drawn from in-distribution. We empirically show that our method has an advantage in generalization to geometric transformations and sample efficiency.

下载PDF全文

下载文献需遵守相关版权规定

论文标题