论文标题
验证用户查询变体的模拟
Validating Simulations of User Query Variants
论文作者
论文摘要
面向系统的IR评估仅限于对真实用户行为的抽象理解。作为解决方案,模拟用户交互提供了一种具有成本效益的方法,可以在没有可用的交互日志时使用更现实的指令来支持以系统为导向的实验。尽管有几种用于模拟点击或结果列表交互的用户模型,但很少有尝试查询模拟的尝试,并且尚未研究这些方法是否可以重现真实查询的属性。在这项工作中,我们借助于TREC测试集合来验证模拟的用户查询变体,以参考针对相应主题进行的真实用户查询。此外,我们引入了一种简单而有效的方法,该方法比确定的方法提供了更好的实际查询复制品。我们的评估框架验证了有关检索性能,主题得分分布的可重复性,共享任务实用程序,努力和效果的可重复性以及与真实用户查询变体相比的模拟。尽管主题评分分布以及经济方面的检索效率和统计特性接近实际查询,但模拟确切的期限匹配和后来的查询重新恢复仍然是一项挑战。
System-oriented IR evaluations are limited to rather abstract understandings of real user behavior. As a solution, simulating user interactions provides a cost-efficient way to support system-oriented experiments with more realistic directives when no interaction logs are available. While there are several user models for simulated clicks or result list interactions, very few attempts have been made towards query simulations, and it has not been investigated if these can reproduce properties of real queries. In this work, we validate simulated user query variants with the help of TREC test collections in reference to real user queries that were made for the corresponding topics. Besides, we introduce a simple yet effective method that gives better reproductions of real queries than the established methods. Our evaluation framework validates the simulations regarding the retrieval performance, reproducibility of topic score distributions, shared task utility, effort and effect, and query term similarity when compared with real user query variants. While the retrieval effectiveness and statistical properties of the topic score distributions as well as economic aspects are close to that of real queries, it is still challenging to simulate exact term matches and later query reformulations.