should mBART-large-en-ro have decoder_start_token_id by default?
See original GitHub issueHypothesis: since the argument prepend_bos
is set to “False” in fairseq/examples/README.md, mbart-large-en-ro does not need decoder_start_token_id
.
TODO:
- create branch that deletes
decoder_start_token_id
. Setting it to None in the config might not be enough. - verify that decoder_start_token_id is in fact not being used by setting a breakpoint in
generate
. - run_eval.py on wmt-en-ro/test and see if BLEU is >= 26.46, the score with decoder_start_token_id=250020.
Issue Analytics
- State:
- Created 3 years ago
- Reactions:1
- Comments:7 (7 by maintainers)
Top Results From Across the Web
No results found
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Gunna close this since the score is now basically the same as fairseq. Thanks for your help!
I think I fixed this another way in #6526 on master
=> {‘bleu’: 26.81}
{‘bleu’: 11.57} (and takes 40 mins!)
in the original fairseq I get 26.83.