论文标题
T-SEA:基于转移对象检测的自我启动攻击
T-SEA: Transfer-based Self-Ensemble Attack on Object Detection
论文作者
论文摘要
与基于查询的黑盒攻击相比,基于转移的黑框攻击不需要任何攻击模型的信息,这确保了他们的保密性。但是,大多数现有基于转移的方法都依赖于结合多个模型来提高攻击转移性,这是时间和资源密集的,更不用说在同一任务上获得多种模型的困难。为了解决这一限制,在这项工作中,我们专注于对对象检测的基于单模传输的黑框攻击,仅利用一种模型来实现对多个黑色框检测器的高转移性对抗性攻击。具体而言,我们首先要观察现有方法的补丁优化过程,并通过稍微调整其训练策略提出增强的攻击框架。然后,我们使用常规模型优化将补丁优化类似,在输入数据,攻击模型和对抗贴片上提出了一系列自我汇总方法,以有效利用有限的信息并防止贴片过度拟合。实验结果表明,可以使用多种经典的基本攻击方法(例如PGD和MIM)应用所提出的框架,以极大地提高多个主流检测器上良好优化的斑块的黑盒可传递性,同时提高白盒性能。我们的代码可在https://github.com/vdigpku/t-sea上找到。
Compared to query-based black-box attacks, transfer-based black-box attacks do not require any information of the attacked models, which ensures their secrecy. However, most existing transfer-based approaches rely on ensembling multiple models to boost the attack transferability, which is time- and resource-intensive, not to mention the difficulty of obtaining diverse models on the same task. To address this limitation, in this work, we focus on the single-model transfer-based black-box attack on object detection, utilizing only one model to achieve a high-transferability adversarial attack on multiple black-box detectors. Specifically, we first make observations on the patch optimization process of the existing method and propose an enhanced attack framework by slightly adjusting its training strategies. Then, we analogize patch optimization with regular model optimization, proposing a series of self-ensemble approaches on the input data, the attacked model, and the adversarial patch to efficiently make use of the limited information and prevent the patch from overfitting. The experimental results show that the proposed framework can be applied with multiple classical base attack methods (e.g., PGD and MIM) to greatly improve the black-box transferability of the well-optimized patch on multiple mainstream detectors, meanwhile boosting white-box performance. Our code is available at https://github.com/VDIGPKU/T-SEA.