论文标题
壮举:注意力编辑
FEAT: Face Editing with Attention
论文作者
论文摘要
最近,使用预审预告剂的发电机的潜在空间已被证明是基于GAN的面部操纵的有效手段。这种方法的成功在很大程度上依赖于发电机潜在空间轴的先天分离。但是,面部操纵通常只打算仅影响当地区域,而通用发电机则往往没有必要的空间分离。在本文中,我们建立在StyleGan Generator的基础上,并提出了一种方法,该方法明确鼓励面对操纵,通过合并学习的注意力图来专注于预期的区域。在编辑图像的一代中,注意图是指引导原始特征与修改后的膜之间混合的掩模。潜在空间编辑的指南是通过使用剪辑来实现的,该剪辑最近被证明对文本驱动的编辑有效。我们进行广泛的实验,并表明我们的方法只能通过参与相关区域来根据文本描述执行脱离且可控的面部操作。定性和定量实验结果都证明了我们的面部区域编辑方法比替代方法的优越性。
Employing the latent space of pretrained generators has recently been shown to be an effective means for GAN-based face manipulation. The success of this approach heavily relies on the innate disentanglement of the latent space axes of the generator. However, face manipulation often intends to affect local regions only, while common generators do not tend to have the necessary spatial disentanglement. In this paper, we build on the StyleGAN generator, and present a method that explicitly encourages face manipulation to focus on the intended regions by incorporating learned attention maps. During the generation of the edited image, the attention map serves as a mask that guides a blending between the original features and the modified ones. The guidance for the latent space edits is achieved by employing CLIP, which has recently been shown to be effective for text-driven edits. We perform extensive experiments and show that our method can perform disentangled and controllable face manipulations based on text descriptions by attending to the relevant regions only. Both qualitative and quantitative experimental results demonstrate the superiority of our method for facial region editing over alternative methods.