Dev Observability
Product
Pricing
Docs
Resources
Blog
Company
Debug Wordle

question-mark

Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

encode_plus is not in GPT2 Tokenizer

See original GitHub issue

It seems you removed encode_plus, what is the successor? All the notebook includes inputs = tokenizer.encode_plus(text, return_tensors='pt', add_special_tokens=True) which is wrong and raise an error.

Issue Analytics

State:
Created 4 years ago
Comments:18 (8 by maintainers)

Top GitHub Comments

1reaction

liygztingcommented, Dec 29, 2020

The latest repo is ok to re-run

Thanks 👍

0reactions

jessevigcommented, Dec 26, 2020

I see. And the imports in your version are exactly the same as in the demo notebook? In that case I would recommend adding some debug statements to neuron_view,py, to see what is being returned from the model, e.g. the final line here:

Read more comments on GitHub >

Top Results From Across the Web

Tokenizer - Hugging Face

Tokenizer. A tokenizer is in charge of preparing the inputs for a model. The library contains tokenizers for all the models. Most of...

what's difference between tokenizer.encode ... - Stack Overflow

The tokenizer.encode_plus function combines multiple steps for us: ... Tokenize all of the sentences and map the tokens to thier word IDs.

What is the difference between batch_encode_plus() and ...

I have read documentations related to T5 Transformer model. While using T5Tokenizer I am kind of confused with tokenizing my sentences. Can ...

Understanding the GPT-2 Source Code Part 2 - Medium

description='Pre-encode text files into tokenized training set.', ... I'm not sure why they did not use the fire library here so if anyone ......

github.com-huggingface-transformers_-_2019-09-29_08-40-52

sentence2 = "His findings were not compatible with this research."inputs1 = tokenizer.encodeplus(sentence0, sentence1, addspecialtokens=True ...

Top Related Medium Post

No results found

Top Related StackOverflow Question

No results found

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Top Related Reddit Thread

No results found

Top Related Hackernoon Post

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

Top Related Hashnode Post

No results found

layer and attention are empty.

feat: Allow passing `stats` object to `pass` and `fail` functions