基于模型的公正学习排名

论文标题

基于模型的公正学习排名

Model-based Unbiased Learning to Rank

论文作者

Luo, Dan, Zou, Lixin, Ai, Qingyao, Chen, Zhiyu, Yin, Dawei, Davison, Brian D.

论文摘要

公正的学习排名（ULTR）学习以偏见的用户反馈数据对文档进行排名是信息检索的众所周知的挑战。无偏学习中的现有方法通常依赖点击建模或反向倾向加权（IPW）。不幸的是，搜索引擎面临着严重的长尾查询分布，在这里，点击建模和IPW都无法很好地处理。单击模型受到数据稀疏问题的影响，因为同一查询文件对在尾部查询上出现有限的时间； IPW患有高方差问题，因为它对小倾向分数值高度敏感。因此，在尾部查询下运行良好的一般性辩护框架迫切需要。为了解决这个问题，我们提出了一个基于模型的无偏学习对框架的框架。具体来说，我们开发了一个通用的上下文感知用户模拟器，以生成未观察到的排名列表以训练排名者的伪单击，这解决了数据稀疏问题。此外，考虑到伪点击和实际点击之间的差异，我们将排名列表的观察视为治疗变量，并以双重稳健的方式与伪标签进一步合并了逆倾向加权。派生的偏差和方差表明，所提出的基于模型的方法比现有方法更强大。最后，在基准数据集（包括模拟数据集和真实点击日志）上进行的大量实验证明了基于模型的方法在各种情况下都始终执行优于最先进的方法。该代码可在https://github.com/rowedenny/multr上找到。

Unbiased Learning to Rank (ULTR) that learns to rank documents with biased user feedback data is a well-known challenge in information retrieval. Existing methods in unbiased learning to rank typically rely on click modeling or inverse propensity weighting (IPW). Unfortunately, the search engines are faced with severe long-tail query distribution, where neither click modeling nor IPW can handle well. Click modeling suffers from data sparsity problem since the same query-document pair appears limited times on tail queries; IPW suffers from high variance problem since it is highly sensitive to small propensity score values. Therefore, a general debiasing framework that works well under tail queries is in desperate need. To address this problem, we propose a model-based unbiased learning-to-rank framework. Specifically, we develop a general context-aware user simulator to generate pseudo clicks for unobserved ranked lists to train rankers, which addresses the data sparsity problem. In addition, considering the discrepancy between pseudo clicks and actual clicks, we take the observation of a ranked list as the treatment variable and further incorporate inverse propensity weighting with pseudo labels in a doubly robust way. The derived bias and variance indicate that the proposed model-based method is more robust than existing methods. Finally, extensive experiments on benchmark datasets, including simulated datasets and real click logs, demonstrate that the proposed model-based method consistently performs outperforms state-of-the-art methods in various scenarios. The code is available at https://github.com/rowedenny/MULTR.

下载PDF全文

下载文献需遵守相关版权规定

论文标题