论文标题
长尾用户的混合兴趣建模
Hybrid Interest Modeling for Long-tailed Users
论文作者
论文摘要
用户行为建模是推荐系统的关键技术。但是,大多数方法都集中在具有大规模互动的首席用户上,因此遇到了数据稀疏问题。几种解决方案整合了辅助信息,例如人口统计特征和产品评论,另一个解决方案是从其他丰富的数据源转移知识。我们认为,当前方法受严格的隐私政策的限制,并且在现实世界应用程序中具有较低的可扩展性,很少有作品考虑长尾用户背后的行为特征。在这项工作中,我们提出了混合利益建模(HIM)网络,以在推荐中学习长尾用户的偏好,既有个性化的兴趣和半个性化的兴趣。为了实现这一目标,我们首先设计用户行为金字塔(UBP)模块,以捕获从稀疏的嘈杂的积极反馈中获得高元素的个性化兴趣。此外,个人交互太稀疏,不足以对用户兴趣进行充分的建模,我们设计了用户行为聚类(UBC)模块,以通过新颖的自我监督学习机制来学习潜在的用户兴趣组,从而从组 - 项目互动数据中捕获了粗粒的半人体兴趣。与最先进的基线相比,公共和工业数据集的广泛实验验证了他的优势。
User behavior modeling is a key technique for recommender systems. However, most methods focus on head users with large-scale interactions and hence suffer from data sparsity issues. Several solutions integrate side information such as demographic features and product reviews, another is to transfer knowledge from other rich data sources. We argue that current methods are limited by the strict privacy policy and have low scalability in real-world applications and few works consider the behavioral characteristics behind long-tailed users. In this work, we propose the Hybrid Interest Modeling (HIM) network to hybrid both personalized interest and semi-personalized interest in learning long-tailed users' preferences in the recommendation. To achieve this, we first design the User Behavior Pyramid (UBP) module to capture the fine-grained personalized interest of high confidence from sparse even noisy positive feedbacks. Moreover, the individual interaction is too sparse and not enough for modeling user interest adequately, we design the User Behavior Clustering (UBC) module to learn latent user interest groups with self-supervised learning mechanism novelly, which capture coarse-grained semi-personalized interest from group-item interaction data. Extensive experiments on both public and industrial datasets verify the superiority of HIM compared with the state-of-the-art baselines.