research-article

ir_metadata: An Extensible Metadata Schema for IR Experiments

Authors:

SIGIR '22: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 3078 - 3089

https://doi.org/10.1145/3477495.3531738

Published: 07 July 2022 Publication History

Get Access

Abstract

The information retrieval (IR) community has a strong tradition of making the computational artifacts and resources available for future reuse, allowing the validation of experimental results. Besides the actual test collections, the underlying run files are often hosted in data archives as part of conferences like TREC, CLEF, or NTCIR. Unfortunately, the run data itself does not provide much information about the underlying experiment. For instance, the single run file is not of much use without the context of the shared task's website or the run data archive. In other domains, like the social sciences, it is good practice to annotate research data with metadata. In this work, we introduce \textttir\_metadata - an extensible metadata schema for TREC run files based on the PRIMAD model. We propose to align the metadata annotations to PRIMAD, which considers components of computational experiments that can affect reproducibility. Furthermore, we outline important components and information that should be reported in the metadata and give evidence from the literature. To demonstrate the usefulness of these metadata annotations, we implement new features in \textttrepro\_eval that support the outlined metadata schema for the use case of reproducibility studies. Additionally, we curate a dataset with run files derived from experiments with different instantiations of PRIMAD components and annotate these with the corresponding metadata. In the experiments, we cover reproducibility experiments that are identified by the metadata and classified by PRIMAD. With this work, we enable IR researchers to annotate TREC run files and improve the reuse value of experimental artifacts even further.

Supplementary Material

MP4 File (SIGIR22-rs1684.mp4)

Experimentation in information retrieval (IR) research is an inherently data-driven process that often results in experimental artifacts - so-called run files. We propose making the run files even more valuable by annotating them with metadata to promote the comparability, transparency, and reproducibility of IR experiments. This video introduces the outlined metadata schema and an overview of the related resources. From a practical point of view, we propose to add the metadata, similar to a file header, as comments at the beginning of the run file. Furthermore, we align the metadata schema to the PRIMAD model, providing a conceptual taxonomy for reproducible IR experiments. Besides the metadata schema, we introduce the software support of repro_eval (also with the help of a Colab notebook) and provide annotated runs as a curated dataset hosted in a Zenodo archive. Finally, we show how the metadata facilitates meta-evaluations by the use-case of reproducibility studies.

Download
54.30 MB

References

[1]

M. Agosti, G. M. Di Nunzio, and N. Ferro. 2006. Scientific Data of an Evaluation Campaign: Do We Properly Deal with Them?. In Evaluation of Multilingual and Multi-modal Information Retrieval, 7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006, Alicante, Spain, September 20--22, 2006, Revised Selected Papers (Lecture Notes in Computer Science, Vol. 4730), C. Peters, P. D. Clough, F. C. Gey, J. Karlgren, B. Magnini, D. W. Oard, M. de Rijke, and M. Stempfhuber (Eds.). Springer, 11--20. https://doi.org/10.1007/978--3--540--74999--8_2

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

How to Measure the Reproducibility of System-oriented IR Experiments

A digital metadata schema repository

repro_eval: A Python Interface to Reproducibility Measures of System-Oriented IR Experiments

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations