question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

(Bug) Using 'conv_like' attention causes loss to nosedive too quickly

See original GitHub issue

edit: here’s all (edit) four runs. https://wandb.ai/afiaka87/dalle_coco_train/reports/conv_like-sanity-check--Vmlldzo1MzY3MTg

‘conv_like’ seems to still have issues at the moment.

Original post:

@lucidrains I don’t think that was the fix for conv_like. I’m still seeing the behavior from before when I turn it on.

https://wandb.ai/afiaka87/dalle_coco_train/reports/dalle-pytorch-attention-issues-using-full-axial_row-axial_col-conv_like---Vmlldzo1MzY2MjY

That’s a report of my current run. I’m gonna do a re-run with just full_conv again just to make sure it’s not due to the combination somehow.

Edit: okay, just doing a simple 16 depth, 8 heads, 8 batch size run with just ‘conv_like’ attention. Here’s that run: https://wandb.ai/afiaka87/dalle_coco_train/runs/36s8nsn9

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:7 (7 by maintainers)

github_iconTop GitHub Comments

1reaction
afiaka87commented, Mar 16, 2021

@lucidrains Feel free to check that report again. Seems your latest fix did the trick. Nicely done.

1reaction
lucidrainscommented, Mar 16, 2021

@afiaka87 thanks yet again! this really helps speed up debugging - I think I found another 🐛 , by no means could it be the last one though lol https://github.com/lucidrains/DALLE-pytorch/commit/f68cb213a3a687d7b2f0cc6c7f58148bbb89d6c4

Read more comments on GitHub >

github_iconTop Results From Across the Web

No results found

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found