Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

stage 3 shape/dimension issues

See original GitHub issue

Edit: Okay - I have zero clue why this is happening but at this point can only assume that it has to do with using CUDA 11 and sparse attention. I’ve disabled literally everything - including FusedAdam and cpu_offload, and I even just reinstalled deepspeed but this issue persists.

Wouldn’t be the first time I had strange errors because of a somehow borked CUDA install either…

On the other hand, the issue does at least seem quite coincidentally related to the linked deepspeed issue. So I guess I’ll leave this up until I know more.

Original post:

Perhaps it has to do with this: https://github.com/microsoft/DeepSpeed/issues/828

When enabling deepspeed stage 3 and using their “FusedAdam” optimizer (the super fast CPU one) instead of passing in the normal Adam optimizer, I get the following stacktrace:

Traceback (most recent call last):
  File "train_dalle.py", line 331, in <module>
    loss = distr_dalle(text, images, return_loss = True)
  File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/opt/conda/lib/python3.8/site-packages/deepspeed/runtime/engine.py", line 914, in forward
    loss = self.module(*inputs, **kwargs)
  File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/root/DALLE-pytorch/dalle_pytorch/dalle_pytorch.py", line 459, in forward
    image = self.vae.get_codebook_indices(image)
  File "/opt/conda/lib/python3.8/site-packages/torch/autograd/grad_mode.py", line 27, in decorate_context
    return func(*args, **kwargs)
  File "/root/DALLE-pytorch/dalle_pytorch/vae.py", line 152, in get_codebook_indices
    _, _, [_, _, indices] = self.model.encode(img)
  File "/root/.local/lib/python3.8/site-packages/taming_transformers-0.0.1-py3.8.egg/taming/models/vqgan.py", line 54, in encode
    quant, emb_loss, info = self.quantize(h)
  File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/root/.local/lib/python3.8/site-packages/taming_transformers-0.0.1-py3.8.egg/taming/modules/vqvae/quantize.py", line 42, in forward
    torch.sum(self.embedding.weight**2, dim=1) - 2 * \
IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1)

The input_mask that we currently support in our implementation must have this shape (batch_size, 1, 1, sequence_length). This means that the mask can be different for different input.

Issue Analytics

State:
Created 2 years ago
Reactions:1
Comments:5 (5 by maintainers)

Top GitHub Comments

1reaction

janEbertcommented, Apr 13, 2021

May also be related to external parameters in DeepSpeed. I’ve opened an issue asking about these but haven’t gotten an answer yet. (EDIT: They answered.)

Since you’re getting an error, I’m assuming these are problems with external parameters finally showing up. 😃

0reactions

janEbertcommented, Apr 20, 2021

DeepSpeed 0.3.15 now automatically detects external parameters but #207 adds manual external parameter registration anyway.

Top Results From Across the Web

3D Shapes: Properties, Figures & Examples - Study.com

A 3-dimensional shape is any solid object with three distinct dimensions (length, width, and height). Compared to 2-dimensional shapes, ...

Three-Dimensional Shapes Part 2: Calculating Volume

We introduced a number of three - dimensional shapes in the last clip, but we still just talked about two- dimensional surface area....

3D Shapes Introduction Lesson - YouTube

This video provides an introduction lesson to 3D shapes. We go through the names of ... 1.3K views 6 months ago Key Stage...

Three Dimensional Shapes (3D shapes) | Definition & Formulas

In geometry, three-dimensional shapes or 3D shapes are solids that have three dimensions such as length, width and height. Whereas 2d shapes ...

NRICH topics: 3D Geometry, Shape and Space Nets

We have found 24 NRICH Mathematical resources connected to Nets, you may find related items under 3D Geometry, Shape and Space.

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Start Free

Top Related Reddit Thread

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

stage 3 shape/dimension issues

Issue Analytics

Top GitHub Comments

Top Results From Across the Web

Top Related Medium Post

Top Related StackOverflow Question

Troubleshoot Live Code

Top Related Reddit Thread

Top Related Hackernoon Post

Top Related Tweet

Top Related Dev.to Post

Top Related Hashnode Post

Resuming training size mismatch

OpenAIDiscreteVAE No such file or directory: '/root/.cache/dalle/tmp.encoder.pkl'