论文标题
部分可观测时空混沌系统的无模型预测
Human Image Generation: A Comprehensive Survey
论文作者
论文摘要
图像和视频综合已成为计算机视觉和机器学习社区中的一个盛开的话题,以及由于其巨大的学术和应用程序价值,以及深层生成模型的发展。许多研究人员致力于将高保真人类图像综合为日常生活中最常见的对象类别之一,在这些对象类别中,基于各种模型,任务设置和应用程序进行了大量研究。因此,有必要对这些有关人类形象产生的这些变体方法进行全面概述。在本文中,我们将人类图像生成技术分为三个范式,即数据驱动的方法,知识引导的方法和混合方法。对于每个范式,都会呈现最具代表性的模型和相应的变体,其中用模型体系结构概述了不同方法的优点和特征。此外,总结了文献中主要的公共人类形象数据集和评估指标。此外,由于具有广泛的应用潜力,涵盖了合成人类图像的典型下游用法。最后,讨论了人类形象产生的挑战和潜在机会,以阐明未来的研究。
Image and video synthesis has become a blooming topic in computer vision and machine learning communities along with the developments of deep generative models, due to its great academic and application value. Many researchers have been devoted to synthesizing high-fidelity human images as one of the most commonly seen object categories in daily lives, where a large number of studies are performed based on various models, task settings and applications. Thus, it is necessary to give a comprehensive overview on these variant methods on human image generation. In this paper, we divide human image generation techniques into three paradigms, i.e., data-driven methods, knowledge-guided methods and hybrid methods. For each paradigm, the most representative models and the corresponding variants are presented, where the advantages and characteristics of different methods are summarized in terms of model architectures. Besides, the main public human image datasets and evaluation metrics in the literature are summarized. Furthermore, due to the wide application potentials, the typical downstream usages of synthesized human images are covered. Finally, the challenges and potential opportunities of human image generation are discussed to shed light on future research.