发现针对语义图像合成的特定类GAN控件

论文标题

发现针对语义图像合成的特定类GAN控件

Discovering Class-Specific GAN Controls for Semantic Image Synthesis

论文作者

Schönfeld, Edgar, Borges, Julio, Sushko, Vadim, Schiele, Bernt, Khoreva, Anna

论文摘要

先前的工作已广泛研究了无条件图像合成的gan的潜在空间结构，从而通过无监督的可解释潜在方向实现了生成的图像的全局编辑。但是，发现有条件gan的语义图像合成（SIS）的潜在方向尚未得到探索。在这项工作中，我们专门专注于解决这一差距。我们提出了一种新型优化方法，用于在预算的SIS模型的潜在空间中找到空间分离的类别特异性方向。我们表明，我们方法发现的潜在方向可以有效地控制语义类别的局部外观，例如，彼此独立地改变其内部结构，纹理或颜色。对各种数据集发现的GAN控件的视觉检查和定量评估表明，我们的方法发现了一套各种独特和语义上有意义的潜在特定编辑的潜在方向。

Prior work has extensively studied the latent space structure of GANs for unconditional image synthesis, enabling global editing of generated images by the unsupervised discovery of interpretable latent directions. However, the discovery of latent directions for conditional GANs for semantic image synthesis (SIS) has remained unexplored. In this work, we specifically focus on addressing this gap. We propose a novel optimization method for finding spatially disentangled class-specific directions in the latent space of pretrained SIS models. We show that the latent directions found by our method can effectively control the local appearance of semantic classes, e.g., changing their internal structure, texture or color independently from each other. Visual inspection and quantitative evaluation of the discovered GAN controls on various datasets demonstrate that our method discovers a diverse set of unique and semantically meaningful latent directions for class-specific edits.

下载PDF全文

下载文献需遵守相关版权规定

论文标题