XLM-R tokenizer is none
See original GitHub issueEnvironment info
transformers
version: 4.3.2- Platform: Linux-4.19.112±x86_64-with-Ubuntu-18.04-bionic
- Python version: 3.6.9
- PyTorch version (GPU?): 1.7.0+cu101 (True)
- Tensorflow version (GPU?): 2.4.1 (True)
Who can help
Information
I am using XLM-R:
The problem arises when using:
- the official example scripts: (give details below)
The tasks I am working on is:
- my own task or dataset: (give details below)
To reproduce
Steps to reproduce the behaviour:
tokenizer = XLMRobertaTokenizer.from_pretrained('xlm-roberta-base')
model = XLMRobertaModel.from_pretrained('xlm-roberta-base')
print(tokenizer, model)
Result
The xlm-r tokenizer is none but the model can be found.
I am a beginner for this model. Many thanks for your help.
Issue Analytics
- State:
- Created 3 years ago
- Comments:6 (3 by maintainers)
Top Results From Across the Web
Tokenizer - Hugging Face
A tokenizer is in charge of preparing the inputs for a model. ... If no value is provided, will default to VERY_LARGE_INTEGER (...
Read more >SST-2 Binary text classification with XLM-RoBERTa model
A standard way to process text is: Tokenize text. Convert tokens into (integer) IDs. Add any special tokens IDs. XLM-R uses ...
Read more >xlm-r-large tokenize dataset - Kaggle
This kernel tokenizes the whole (train+test) dataset ahead of time and saves it in npy file format for later loading in order to...
Read more >Source code for comet.models.encoders.xlmr
def __init__( self, xlmr: XLMRModel, tokenizer: XLMRTextEncoder, ... by removing the LM and classification heads # xlmr.model.decoder.lm_head.dense = None ...
Read more >Notes on Transformers Book Ch. 4 - Christian Mills
Non -English pretrained models typically exist only for languages like ... The tokenizer model analyzes the training corpus to find the most ......
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
This is probably because you hadn’t restarted your kernel after installing the
sentencepiece
dependency!Not sure how but it’s working today.