论文标题
残留网络的有效抗氧化方法
An Effective Anti-Aliasing Approach for Residual Networks
论文作者
论文摘要
传统上,频域中的图像预处理在计算机视觉中起着至关重要的作用,甚至在深度学习的早期就成为标准管道的一部分。但是,随着大型数据集的出现,许多从业人员得出的结论是,这是不必要的,因为相信这些先验可以从数据本身中学到。频率混叠是当子采样任何信号(例如图像或特征图)导致子采样输出中的失真时可能发生的现象。我们表明,我们可以通过放置不可训练的模糊过滤器并在关键位置使用平滑的激活功能来减轻这种效果,尤其是在网络缺乏学习能力的情况下。这些简单的架构变化导致在Imagenet-C上的自然损坏[10]和Meta-Dataset上的自然损坏[17]的自然损坏[17]中的图像分类中的分布概括进行了实质性改善,而无需引入其他可训练的参数,也没有使用开源代码bases的默认超级参数。
Image pre-processing in the frequency domain has traditionally played a vital role in computer vision and was even part of the standard pipeline in the early days of deep learning. However, with the advent of large datasets, many practitioners concluded that this was unnecessary due to the belief that these priors can be learned from the data itself. Frequency aliasing is a phenomenon that may occur when sub-sampling any signal, such as an image or feature map, causing distortion in the sub-sampled output. We show that we can mitigate this effect by placing non-trainable blur filters and using smooth activation functions at key locations, particularly where networks lack the capacity to learn them. These simple architectural changes lead to substantial improvements in out-of-distribution generalization on both image classification under natural corruptions on ImageNet-C [10] and few-shot learning on Meta-Dataset [17], without introducing additional trainable parameters and using the default hyper-parameters of open source codebases.