论文标题
IR_Metadata:IR实验的可扩展元数据模式
ir_metadata: An Extensible Metadata Schema for IR Experiments
论文作者
论文摘要
信息检索(IR)社区具有强大的传统,即使计算文物和资源可用于将来重复使用,从而验证实验结果。除了实际的测试集外,基础运行文件通常是在数据档案中托管的,这是TREC,CLEF或NTCIR等会议的一部分。不幸的是,运行数据本身并未提供有关基础实验的太多信息。例如,如果没有共享任务的网站或运行数据存档的上下文,单个运行文件就不会太多。在其他领域,例如社会科学,最好用元数据注释研究数据。在这项工作中,我们介绍了IR_Metadata-基于Primad模型的TREC运行文件的可扩展元数据架构。我们建议将元数据注释与Primad保持一致,该注释考虑了可能影响可重复性的计算实验的组成部分。此外,我们概述了应在元数据中报告的重要组成部分和信息,并提供了文献的证据。为了证明这些元数据注释的有用性,我们在repro_eval中实施了新功能,该功能支持概述的元数据模式用于可重复性研究的用例。此外,我们策划了一个数据集,该数据集的运行文件是从具有不同实例化的Primad组件实例的实验中得出的,并用相应的元数据注释这些文件。在实验中,我们涵盖了由元数据鉴定并由Primad分类的可重复性实验。通过这项工作,我们使IR研究人员能够注释TREC运行文件,并进一步提高实验工件的重用价值。
The information retrieval (IR) community has a strong tradition of making the computational artifacts and resources available for future reuse, allowing the validation of experimental results. Besides the actual test collections, the underlying run files are often hosted in data archives as part of conferences like TREC, CLEF, or NTCIR. Unfortunately, the run data itself does not provide much information about the underlying experiment. For instance, the single run file is not of much use without the context of the shared task's website or the run data archive. In other domains, like the social sciences, it is good practice to annotate research data with metadata. In this work, we introduce ir_metadata - an extensible metadata schema for TREC run files based on the PRIMAD model. We propose to align the metadata annotations to PRIMAD, which considers components of computational experiments that can affect reproducibility. Furthermore, we outline important components and information that should be reported in the metadata and give evidence from the literature. To demonstrate the usefulness of these metadata annotations, we implement new features in repro_eval that support the outlined metadata schema for the use case of reproducibility studies. Additionally, we curate a dataset with run files derived from experiments with different instantiations of PRIMAD components and annotate these with the corresponding metadata. In the experiments, we cover reproducibility experiments that are identified by the metadata and classified by PRIMAD. With this work, we enable IR researchers to annotate TREC run files and improve the reuse value of experimental artifacts even further.