question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Incorrect architecture specified for ANCE DPR context encoder model

See original GitHub issue

In the config for your ANCE DPR context encoder in HuggingFace model hub, the architecture is listed as DPRQuestionEncoder but it should be DPRContextEncoder. This causes a failure when using “reflection” to check the correct model class type to use.

See: https://huggingface.co/castorini/ance-dpr-context-multi/blob/main/config.json Related: https://github.com/huggingface/transformers/issues/13670

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:7 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
MXueguangcommented, Sep 21, 2021

It shouldn’t affect results, but I’ll rerun to make sure it works.

1reaction
MXueguangcommented, Sep 21, 2021

Yes, @lintool @joshdevins, thank you for catching the issue, I’ll update the model on hugging face.

Read more comments on GitHub >

github_iconTop Results From Across the Web

DPR AutoModel loading incorrect architecture for ... - GitHub
Loading a DPR context encoder DPRContextEncoder using AutoModel.from_pretrained is actually loading DPRQuestionEncoder instead, and later fails. Steps to ...
Read more >
DPR - Hugging Face
It is used to instantiate the components of the DPR model according to the specified arguments, defining the model component architectures.
Read more >
PyTerrier Documentation
PyTerrier uses some of the functionality of the Java-based Terrier IR platform for indexing and retrieval functionality.
Read more >
arXiv:2212.09114v1 [cs.CL] 18 Dec 2022
In this paper, we propose a curriculum learning approach to bridge the gap between training and inference for passage retrieval using dual-cross ...
Read more >
Passage Retrieval | Papers With Code
Open-domain question answering relies on efficient passage retrieval to select candidate contexts, where traditional sparse vector space models, such as TF-IDF ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found