Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Training Classification Model + tokenizer from Scratch

See original GitHub issue

Describe the bug I’ve been looking through the repository and observed the ClassificationModel requires a pre-trained model as part of the args. Is there any chance we can take a model from scratch, train a tokenizer for our dataset and train the model on classification? Any help is really appreciated, I’ve really enjoyed using the simple transformers library for my research!

Issue Analytics

State:
Created 3 years ago
Comments:10 (3 by maintainers)

Top GitHub Comments

1reaction

ThilinaRajapaksecommented, May 27, 2020

I can answer this empirically - option 2 is superior. That is why transfer learning is so powerful.

The philosophical question is whether it’s worth the effort to fine-tune a pre-trained language model on the domain-specific text (i.e. fine-tune the language model itself) before training the model on the classification itself. In this case, I would suggest skipping the language model fine-tuning at first, and then coming back to it if the final results are not satisfactory.

0reactions

stale[bot]commented, Aug 8, 2020

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.