论文标题
快速分布式土匪用于在线推荐系统
Fast Distributed Bandits for Online Recommendation Systems
论文作者
论文摘要
上下文匪徒算法通常用于推荐系统中,其中内容受欢迎程度可能会迅速变化。这些算法基于与他们两者相关的上下文,不断地学习用户和项目之间的潜在映射。学习用户之间的聚类或社交结构的最新建议算法表现出更高的建议准确性。但是,随着环境中用户和项目的数量增加,生成建议所需的时间大大恶化。结果,这些不能在实践中部署。最先进的分布式强盗算法-DCCB-依靠点对点网络在分布式工人之间共享信息。但是,这种方法随着用户数量的增加而不断扩展。此外,它遭受了簇的缓慢发现,导致准确性降解。为了解决上述问题,本文提出了一种称为DistClub的新型分布式匪徒算法。该算法懒惰地以分布式的方式创建了簇,并大大减少了网络数据共享需求,从而实现了高可扩展性。此外,DistClub的发现群体比最先进的算法更快,获得了更好的准确性。对现实世界的基准和合成数据集的评估表明,DistClub平均比DCCB快8.87倍,并且达到14.5%的归一化预测性能。
Contextual bandit algorithms are commonly used in recommender systems, where content popularity can change rapidly. These algorithms continuously learn latent mappings between users and items, based on contexts associated with them both. Recent recommendation algorithms that learn clustering or social structures between users have exhibited higher recommendation accuracy. However, as the number of users and items in the environment increases, the time required to generate recommendations deteriorates significantly. As a result, these cannot be deployed in practice. The state-of-the-art distributed bandit algorithm - DCCB - relies on a peer-to-peer net-work to share information among distributed workers. However, this approach does not scale well with the increasing number of users. Furthermore, it suffers from slow discovery of clusters, resulting in accuracy degradation. To address the above issues, this paper proposes a novel distributed bandit-based algorithm called DistCLUB. This algorithm lazily creates clusters in a distributed manner, and dramatically reduces the network data sharing requirement, achieving high scalability. Additionally, DistCLUB finds clusters much faster, achieving better accuracy than the state-of-the-art algorithm. Evaluation over both real-world benchmarks and synthetic datasets shows that DistCLUB is on average 8.87x faster than DCCB, and achieves 14.5% higher normalized prediction performance.