论文标题

关于体现药物的动作序列的感觉交通量

On the Sensory Commutativity of Action Sequences for Embodied Agents

论文作者

Caselles-Dupré, Hugo, Garcia-Ortiz, Michael, Filliat, David

论文摘要

对人造代理的感知是AI研究的巨大挑战之一。深度学习和数据驱动的方法成功地在有限的问题上可以使用监督来学习感知,但不能扩展到开放世界。在这种情况下,对于具有第一人称传感器的自主体现的代理,可以端对端学习感知来解决特定任务。然而,文献表明,感知不是纯粹的被动压缩机制,并且在抽象表示的制定中,行动起着重要作用。我们建议根据群体理论的数学形式主义研究这些体现的药物的感知,以便在感知和行动之间建立联系。特别是,我们考虑了连续作用序列相对于这种体现药物感知的感官信息的交换特性。我们介绍了感官通勤概率(SCP)标准,该标准衡量了代理人的自由度影响体现场景中的环境。我们展示了如何在不同环境(包括现实的机器人设置)中计算此标准。我们从经验上说明了如何使用SCP和动作序列的交换性能来学习环境中的对象并提高增强学习中的样本效率。

Perception of artificial agents is one the grand challenges of AI research. Deep Learning and data-driven approaches are successful on constrained problems where perception can be learned using supervision, but do not scale to open-worlds. In such case, for autonomous embodied agents with first-person sensors, perception can be learned end-to-end to solve particular tasks. However, literature shows that perception is not a purely passive compression mechanism, and that actions play an important role in the formulation of abstract representations. We propose to study perception for these embodied agents, under the mathematical formalism of group theory in order to make the link between perception and action. In particular, we consider the commutative properties of continuous action sequences with respect to sensory information perceived by such an embodied agent. We introduce the Sensory Commutativity Probability (SCP) criterion which measures how much an agent's degree of freedom affects the environment in embodied scenarios. We show how to compute this criterion in different environments, including realistic robotic setups. We empirically illustrate how SCP and the commutative properties of action sequences can be used to learn about objects in the environment and improve sample-efficiency in Reinforcement Learning.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源