论文标题
人工导师 - 学习者互动中的教学演示和务实学习
Pedagogical Demonstrations and Pragmatic Learning in Artificial Tutor-Learner Interactions
论文作者
论文摘要
在展示任务时,人类导师通过“显示”任务而不是“执行”(夸大了演示的相关部分)或通过提供最能歧义传达的目标的演示来修改其行为。类似地,人类的学习者务实地推断了导师的交流意图:他们解释了导师试图教他们的内容并推断出相关的学习信息。没有这种机制,传统的示范学习(LFD)算法将考虑诸如最佳选择之类的演示。在本文中,我们调查了在导师学习者设置中实施此类机制的,其中两个参与者都是具有多个目标的环境中的人造代理。利用教师的教学法和学习者的实用主义,我们从示范中显示出对标准学习的实质性改进。
When demonstrating a task, human tutors pedagogically modify their behavior by either "showing" the task rather than just "doing" it (exaggerating on relevant parts of the demonstration) or by giving demonstrations that best disambiguate the communicated goal. Analogously, human learners pragmatically infer the communicative intent of the tutor: they interpret what the tutor is trying to teach them and deduce relevant information for learning. Without such mechanisms, traditional Learning from Demonstration (LfD) algorithms will consider such demonstrations as sub-optimal. In this paper, we investigate the implementation of such mechanisms in a tutor-learner setup where both participants are artificial agents in an environment with multiple goals. Using pedagogy from the tutor and pragmatism from the learner, we show substantial improvements over standard learning from demonstrations.