论文标题

Tailornet:预测3D的衣服是人类姿势,形状和服装风格的函数

TailorNet: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style

论文作者

Patel, Chaitanya, Liao, Zhouyingcheng, Pons-Moll, Gerard

论文摘要

在本文中,我们提出了一种神经模型,该神经模型可预测3D的衣服变形,这是三个因素的函数:姿势,形状和样式(服装几何),同时保留皱纹细节。这超出了先前的模型,该模型要么特定于一种样式和形状,要么推广到不同形状,尽管样式特定,但仍会产生平滑的结果。我们的假设是(甚至是非线性的)示例组合(即使是细小的锻炼)等高频组成部分,这使得与三个因素共同学习。我们技术的核心是将变形分解为高频和低频分量。虽然低频组件是通过具有MLP的姿势,形状和样式参数预测的,但高频组件是用形状式特定姿势模型的混合物预测的。混合物的重量是用狭窄的带宽内核计算的,以确保仅将具有相似高频模式的预测组合在一起。样式变化是通过在规范的姿势中计算变形子空间来获得的,该子空间满足了物理约束,例如渗透 - 渗透和悬垂在身体上。 Tailornet提供了3D服装,这些服装保留了从基于物理的模拟(PBS)中学到的皱纹,同时运行的速度快1000倍以上。与PBS相比,Tailornet易于使用且完全可区分,这对于计算机视觉算法至关重要。几个实验表明,尾部的产生比先前的工作更现实的结果,甚至在Amass数据集的序列上产生了时间相干变形,尽管接受了其他数据集的静态姿势训练。为了刺激这一方向的进一步研究,我们将制作一个由55800帧组成的数据集,以及我们在https://virtualhumans.mpi-inf.mpg.de/tailornet上公开提供的模型。

In this paper, we present TailorNet, a neural model which predicts clothing deformation in 3D as a function of three factors: pose, shape and style (garment geometry), while retaining wrinkle detail. This goes beyond prior models, which are either specific to one style and shape, or generalize to different shapes producing smooth results, despite being style specific. Our hypothesis is that (even non-linear) combinations of examples smooth out high frequency components such as fine-wrinkles, which makes learning the three factors jointly hard. At the heart of our technique is a decomposition of deformation into a high frequency and a low frequency component. While the low-frequency component is predicted from pose, shape and style parameters with an MLP, the high-frequency component is predicted with a mixture of shape-style specific pose models. The weights of the mixture are computed with a narrow bandwidth kernel to guarantee that only predictions with similar high-frequency patterns are combined. The style variation is obtained by computing, in a canonical pose, a subspace of deformation, which satisfies physical constraints such as inter-penetration, and draping on the body. TailorNet delivers 3D garments which retain the wrinkles from the physics based simulations (PBS) it is learned from, while running more than 1000 times faster. In contrast to PBS, TailorNet is easy to use and fully differentiable, which is crucial for computer vision algorithms. Several experiments demonstrate TailorNet produces more realistic results than prior work, and even generates temporally coherent deformations on sequences of the AMASS dataset, despite being trained on static poses from a different dataset. To stimulate further research in this direction, we will make a dataset consisting of 55800 frames, as well as our model publicly available at https://virtualhumans.mpi-inf.mpg.de/tailornet.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源