Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

past_key_values tuple index out of range error when using text2text-generation pipeline with encoder-decoder model

See original GitHub issue

Environment info

transformers version: 4.3.0
Platform: Linux-5.4.0-65-generic-x86_64-with-Ubuntu-20.04-focal
Python version: 3.7.9
PyTorch version (GPU?): 1.7.0 (False)
Tensorflow version (GPU?): not installed (NA)
Using GPU in script?: No
Using distributed or parallel set-up in script?: No

Who can help

@LysandreJik

Information

Model I am using (Bert, XLNet …): I am using the encoder-decoder model with a Roberta encoder and RobertaForCausalLM decoder.

The problem arises when using:

the official example scripts: (give details below)
my own modified scripts: (give details below)

In my real code I am using custom pre-trained models and tokenizers, but the error and behavior is the same as that produced by the demo script below.

The tasks I am working on is:

an official GLUE/SQUaD task: (give the name)
my own task or dataset: (give details below)

I am trying to use a pipeline to generate results from an encoder-decoder model that was trained on a custom text2text dataset.

To reproduce

Steps to reproduce the behavior:

You can just run the script below, or:

Load an encoder-decoder model with RoBERTa encoder and decoder
Create a text2text-generation pipeline with an appropriate tokenizer
Use the pipeline to generate a result

import transformers

encoder = transformers.RobertaModel.from_pretrained(pretrained_model_name_or_path='roberta-base')
decoder = transformers.RobertaForCausalLM.from_pretrained(pretrained_model_name_or_path='roberta-base')

encoder_decoder_model = transformers.EncoderDecoderModel(encoder=encoder, decoder=decoder)

tokenizer = transformers.AutoTokenizer.from_pretrained('google/roberta2roberta_L-24_bbc')

text2text = transformers.pipeline('text2text-generation', model=encoder_decoder_model, tokenizer=tokenizer)

output = text2text('This is a test sentence.')

print(output)

Output:

If you want to use `RobertaLMHeadModel` as a standalone, add `is_decoder=True.`
normalizer.cc(51) LOG(INFO) precompiled_charsmap is empty. use identity normalization.
Traceback (most recent call last):
  File "demo.py", line 12, in <module>
    output = text2text('This is a test sentence.')
  File "/home/james/Code/demo/.env/lib/python3.7/site-packages/transformers/pipelines/text2text_generation.py", line 125, in __call__
    **generate_kwargs,
  File "/home/james/Code/demo/.env/lib/python3.7/site-packages/torch/autograd/grad_mode.py", line 26, in decorate_context
    return func(*args, **kwargs)
  File "/home/james/Code/demo/.env/lib/python3.7/site-packages/transformers/generation_utils.py", line 913, in generate
    **model_kwargs,
  File "/home/james/Code/demo/.env/lib/python3.7/site-packages/transformers/generation_utils.py", line 1177, in greedy_search
    output_hidden_states=output_hidden_states,
  File "/home/james/Code/demo/.env/lib/python3.7/site-packages/torch/nn/modules/module.py", line 727, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/home/james/Code/demo/.env/lib/python3.7/site-packages/transformers/models/encoder_decoder/modeling_encoder_decoder.py", line 430, in forward
    **kwargs_decoder,
  File "/home/james/Code/demo/.env/lib/python3.7/site-packages/torch/nn/modules/module.py", line 727, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/home/james/Code/demo/.env/lib/python3.7/site-packages/transformers/models/roberta/modeling_roberta.py", line 937, in forward
    return_dict=return_dict,
  File "/home/james/Code/demo/.env/lib/python3.7/site-packages/torch/nn/modules/module.py", line 727, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/home/james/Code/demo/.env/lib/python3.7/site-packages/transformers/models/roberta/modeling_roberta.py", line 771, in forward
    past_key_values_length = past_key_values[0][0].shape[2] if past_key_values is not None else 0
IndexError: tuple index out of range

Expected behavior

I expect the pipeline to generate an output string.

Issue Analytics

State:
Created 3 years ago
Comments:8 (5 by maintainers)

Top GitHub Comments

2reactions

sailistcommented, Feb 23, 2022

The same exception may also be raised when model is in train mode, call model.eval() before may solve this problem. It happened when I use model BartForConditionalGeneration.

0reactions

github-actions[bot]commented, May 9, 2021

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Top Results From Across the Web

sklearn .fit transformers , IndexError: tuple index out of range

Given that you're using a column transformer, I would pass the whole dataframe to method .fit() called on the ColumnTransformer instance, ...

Python IndexError: tuple index out of range Solution

When you try to access an item in a tuple that does not exist, Python returns an error that says “tuple index out...

Error in fine-tuning BERT - Beginners - Hugging Face Forums

As a follow-up from my previous question, I am trying to fine-tune a model, but I am getting an error: IndexError: tuple index...

distruct2.2.py error IndexError: tuple index out of range

I obtained mean individual assignment and mean population assignment probability tables using clumpp (attached).

IndexError: tuple index out of range in Python - Its Linux FOSS

The error “IndexError: tuple index out of range” occurs in Python when the accessed item does not exist in the given range of...

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Start Free

Top Related Reddit Thread

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

past_key_values tuple index out of range error when using text2text-generation pipeline with encoder-decoder model

Environment info

Who can help

Information

To reproduce

Expected behavior

Issue Analytics

Top GitHub Comments

Top Results From Across the Web

Top Related Medium Post

Top Related StackOverflow Question

Troubleshoot Live Code

Top Related Reddit Thread

Top Related Hackernoon Post

Top Related Tweet

Top Related Dev.to Post

Top Related Hashnode Post

Seq2seq now has larger memory requirements, OOM w/Deepspeed on previously runnable models

Model Parallelism for Bert Models