论文标题
智能泵:MHealth的实用汤普森抽样
IntelligentPooling: Practical Thompson Sampling for mHealth
论文作者
论文摘要
在移动健康(MHealth)中,智能设备随着时间的推移反复提供行为治疗,以帮助用户采用并维持健康的行为。强化学习似乎是学习如何最佳做出这些顺序治疗决策的理想选择。但是,在有效地部署在移动医疗环境中,必须克服重大挑战。在这项工作中,我们关注以下挑战:1)在同一语境中的个人对治疗的反应不同。为了应对这些挑战,我们将汤普森采样匪徒概括以开发智能泵。智能工具学习个性化的治疗政策,从而解决了挑战。为了应对第二个挑战,智能泵更新每个用户的个性化程度,同时利用其他用户的可用数据来加快学习的速度。最后,IntellighentPool允许自开始治疗以来用户时间的函数变化,从而解决了挑战三。我们表明,智能泵平均比最新的遗憾低26%。我们证明了这种方法的希望及其在实时临床试验中甚至向一小群用户学习的能力。
In mobile health (mHealth) smart devices deliver behavioral treatments repeatedly over time to a user with the goal of helping the user adopt and maintain healthy behaviors. Reinforcement learning appears ideal for learning how to optimally make these sequential treatment decisions. However, significant challenges must be overcome before reinforcement learning can be effectively deployed in a mobile healthcare setting. In this work we are concerned with the following challenges: 1) individuals who are in the same context can exhibit differential response to treatments 2) only a limited amount of data is available for learning on any one individual, and 3) non-stationary responses to treatment. To address these challenges we generalize Thompson-Sampling bandit algorithms to develop IntelligentPooling. IntelligentPooling learns personalized treatment policies thus addressing challenge one. To address the second challenge, IntelligentPooling updates each user's degree of personalization while making use of available data on other users to speed up learning. Lastly, IntelligentPooling allows responsivity to vary as a function of a user's time since beginning treatment, thus addressing challenge three. We show that IntelligentPooling achieves an average of 26% lower regret than state-of-the-art. We demonstrate the promise of this approach and its ability to learn from even a small group of users in a live clinical trial.