人类行为的预测特征3D姿势

论文标题

人类行为的预测特征3D姿势

Forecasting Characteristic 3D Poses of Human Actions

论文作者

Diller, Christian, Funkhouser, Thomas, Dai, Angela

论文摘要

我们提出了预测特征3D姿势的任务：从对一个人的简短序列观察中，可以预测未来的3D姿势，以一种可能的动作定义，特征的姿势，例如，通过观察一个人捡起苹果的人，可以预测吃苹果的人的姿势。人类运动预测的先前工作估计未来以固定时间间隔构成的姿势。尽管易于定义，但这种逐帧的配方会混淆人类行动的时间和故意方面。取而代之的是，我们定义了一个具有语义上有意义的姿势预测任务，该任务将预测的姿势从时间上解脱出来，从目标指导的行为中汲取灵感。为了预测特征姿势，我们提出了一种概率方法，该方法模拟了可能特征姿势的分布中可能的多模式。然后，我们以自回归方式从预测的分布中对未来的假设进行采样，以从关节之间进行模型依赖性。为了评估我们的方法，我们构建了一个手动注释特征3D姿势的数据集。我们对该数据集进行的实验表明，我们提出的概率方法的平均表现要优于最先进的方法26％。

We propose the task of forecasting characteristic 3d poses: from a short sequence observation of a person, predict a future 3d pose of that person in a likely action-defining, characteristic pose -- for instance, from observing a person picking up an apple, predict the pose of the person eating the apple. Prior work on human motion prediction estimates future poses at fixed time intervals. Although easy to define, this frame-by-frame formulation confounds temporal and intentional aspects of human action. Instead, we define a semantically meaningful pose prediction task that decouples the predicted pose from time, taking inspiration from goal-directed behavior. To predict characteristic poses, we propose a probabilistic approach that models the possible multi-modality in the distribution of likely characteristic poses. We then sample future pose hypotheses from the predicted distribution in an autoregressive fashion to model dependencies between joints. To evaluate our method, we construct a dataset of manually annotated characteristic 3d poses. Our experiments with this dataset suggest that our proposed probabilistic approach outperforms state-of-the-art methods by 26% on average.

下载PDF全文

下载文献需遵守相关版权规定

论文标题