TREC DL 2021
See original GitHub issueDataset Information:
Another year of the DL track. From the current description, it looks like it will be the same format as prior years.
Links to Resources:
Dataset ID(s):
(assuming collections remain the same)
msmarco-document/trec-dl-2021
- queries, docs, scoreddocs, (eventually) qrelsmsmarco-passage/trec-dl-2021
- queries, docs, scoreddocs, (eventually) qrels
Supported Entities
- docs
- queries
- qrels
- scoreddocs
- docpairs
Additional comments/concerns/ideas/etc.
Still waiting for the release of datasets & potentially new information.
Issue Analytics
- State:
- Created 3 years ago
- Comments:7
Top Results From Across the Web
TREC 2021 Deep Learning Track Guidelines | msmarco
In 2021, the track will continue to have the same tasks (document ranking and passage ranking) and goals. Similar to the previous year,...
Read more >Overview of the TREC 2021 deep learning track - Microsoft
This is the third year of the TREC Deep Learning track. As in previous years, we leverage the MS MARCO datasets that made...
Read more >OVERVIEW OF THE TREC 2021 DEEP LEARNING TRACK
At TREC 2021, we hosted the third TREC Deep Learning Track continuing our focus on benchmarking ad hoc retrieval methods in the large-data...
Read more >University of Glasgow Terrier Team at the TREC 2021 Deep ...
smac.pub: trec2021-dl. Abstract: This paper describes our submission to the document ranking and passage ranking tasks of the TREC 2021 Deep Learning Track....
Read more >[2205.11245] PASH at TREC 2021 Deep Learning Track - arXiv
This paper describes the PASH participation in TREC 2021 Deep Learning Track. In the recall stage, we adopt a scheme combining sparse and...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
TODO: enable HTTP range requests for azure downloads via an HTTP header to be able to recover from network interruptions: https://github.com/microsoft/msmarco/issues/7#issuecomment-880104882
Identified here: https://github.com/allenai/ir_datasets/actions/runs/1064186261
Looks like there was a problem with the passage qrels and they were re-uploaded: https://github.com/microsoft/msmarco/commit/41b3a684ed8ebd4e753250c3687547a77c62e7dd
Since it was only removing duplicate IDs in the file, it should not have an effect on downstream evaluations. It may have an effect on systems that use the qrels for another purpose, e.g., training.
Need to fix hashes, sizes, tests.
Should tests look for such cases and warn if it occurs?