GLUE test set predictions
See original GitHub issue🚀 Feature request
Motivation
The run_glue
script is super helpful. But it currently doesn’t implement producing predictions on the test datasets for the GLUE tasks. I think this would be extremely helpful for a lot of people. I’m sure there are plenty of people who have implemented this functionality themselves, but I haven’t found any. Since transformers
already provides train and dev for GLUE, it would be cool to complete the feature set with providing test set predictions.
Your contribution
I’m personally working on a branch that extends the glue_processors
to support the test sets (which are already downloaded by the recommended download_glue.py
script. I also update the run_glue.py
script to produce the *.tsv
files required by the GLUE online submission interface.
I think I’m a couple days out from testing/completing my implementation. I’m also sure plenty of implementations exist of this. If there are no other plans to support this in the works, I’m happy to submit a PR.
Issue Analytics
- State:
- Created 4 years ago
- Reactions:6
- Comments:7 (3 by maintainers)
@AMChierici make sure you run from master, there’s indeed a
mode
kwarg now.@shoarora Thanks for this first PR and I did check yours while merging the other (to make sure that the indices in csv parsing, etc. were correct)
@AMChierici I didn’t author #4463, which is what has made it to master to enable this feature. I haven’t played with it yet so sorry I can’t be of more help