参数效率的及时调整使广义和校准的神经文本恢复器

论文标题

参数效率的及时调整使广义和校准的神经文本恢复器

Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers

论文作者

Tam, Weng Lam, Liu, Xiao, Ji, Kaixuan, Xue, Lilong, Zhang, Xingjian, Dong, Yuxiao, Liu, Jiahua, Hu, Maodi, Tang, Jie

论文摘要

迅速调整尝试更新预训练模型中的一些特定任务参数。它的性能与在语言理解和发电任务上的完整参数设置的微调相当。在这项工作中，我们研究了迅速调整神经文本检索器的问题。我们在跨内域，跨域和跨主题设置中引入参数效率提示调整。通过广泛的分析，我们表明该策略可以通过基于微调的检索方法来减轻两个问题 - 参数 - 信息效率和弱推广性。值得注意的是，它可以显着改善检索模型的零零弹药概括。通过仅更新模型参数的0.1％，及时调整策略可以帮助检索模型获得比所有参数更新的传统方法更好的概括性能。最后，为了促进回猎犬的跨主题概括性的研究，我们策划并发布了一个学术检索数据集，其中包含18K查询的87个主题，使其成为迄今为止特定于特定于主题的主题。

Prompt tuning attempts to update few task-specific parameters in pre-trained models. It has achieved comparable performance to fine-tuning of the full parameter set on both language understanding and generation tasks. In this work, we study the problem of prompt tuning for neural text retrievers. We introduce parameter-efficient prompt tuning for text retrieval across in-domain, cross-domain, and cross-topic settings. Through an extensive analysis, we show that the strategy can mitigate the two issues -- parameter-inefficiency and weak generalizability -- faced by fine-tuning based retrieval methods. Notably, it can significantly improve the out-of-domain zero-shot generalization of the retrieval models. By updating only 0.1% of the model parameters, the prompt tuning strategy can help retrieval models achieve better generalization performance than traditional methods in which all parameters are updated. Finally, to facilitate research on retrievers' cross-topic generalizability, we curate and release an academic retrieval dataset with 18K query-results pairs in 87 topics, making it the largest topic-specific one to date.

下载PDF全文

下载文献需遵守相关版权规定

论文标题