Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

torch._dynamo.exc.Unsupported: dynamic shapes: arange

See original GitHub issue

🐛 Describe the bug

While giving a try with pytorch2 on OpenNMT-py

using these two lines:

    rawmodel = build_model(model_opt, opt, vocabs, checkpoint)
    model = torch.compile(rawmodel, fullgraph=True, backend='nvprims_aten')

Error logs

getting this:

from user code:
   File "/home/vincent/nlp/OpenNMT-py/onmt/encoders/transformer.py", line 126, in forward
    mask = ~sequence_mask(src_len).unsqueeze(1)
  File "/home/vincent/nlp/OpenNMT-py/onmt/utils/misc.py", line 58, in sequence_mask
    return (torch.arange(0, max_len, device=lengths.device)

Set torch._dynamo.config.verbose=True for more information


You can suppress this exception and fall back to eager by setting:
    torch._dynamo.config.suppress_errors = True

Minified repro

No response

Issue Analytics

State:
Created 9 months ago
Comments:14 (8 by maintainers)

Top GitHub Comments

1reaction

vince62scommented, Dec 6, 2022

because of this: https://github.com/pytorch/pytorch/issues/90170

not so easy to run a minifier but will try.

0reactions

vince62scommented, Dec 9, 2022

okay, I managed to tweak triton to recognize ptxas 7.8 / cuda 11.8 and inductor mode does not trigger any error. However nothing seems happening got a loop with both the TypedStorage warning and: [2022-12-09 13:33:41,820] torch._inductor.lowering: [WARNING] using triton random, expect difference from eager no log from my training loop. nvidia-smi show some activty (both ram and util) but without any other message difficult to go further.

EDIT: I’ll try module by module to see where the problem could be.