论文标题
Transeditor:基于变压器的双空间GAN,用于高度可控的面部编辑
TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing
论文作者
论文摘要
诸如Stylegan之类的最新进展促进了可控的面部编辑的增长。为了解决其属性在一个潜在空间中解耦的核心挑战,已经尝试采用双空间gan以更好地解开样式和内容表示形式。但是,这些方法仍然无法获得具有高可控性的合理编辑结果,尤其是对于复杂属性。在这项研究中,我们强调了双空GAN中相互作用的重要性,以进行更可控制的编辑。我们提出了Transeditor,这是一种基于变压器的新型框架,以增强这种相互作用。此外,我们开发了一种新的双空间编辑和反转策略,以提供额外的编辑灵活性。广泛的实验证明了所提出的框架在图像质量和编辑能力方面的优越性,这表明了跨性别者对高度可控的面部编辑的有效性。
Recent advances like StyleGAN have promoted the growth of controllable facial editing. To address its core challenge of attribute decoupling in a single latent space, attempts have been made to adopt dual-space GAN for better disentanglement of style and content representations. Nonetheless, these methods are still incompetent to obtain plausible editing results with high controllability, especially for complicated attributes. In this study, we highlight the importance of interaction in a dual-space GAN for more controllable editing. We propose TransEditor, a novel Transformer-based framework to enhance such interaction. Besides, we develop a new dual-space editing and inversion strategy to provide additional editing flexibility. Extensive experiments demonstrate the superiority of the proposed framework in image quality and editing capability, suggesting the effectiveness of TransEditor for highly controllable facial editing.