question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

if gradient_accumulation_steps == 1 there is division by zero when calculating G_noise and S_noise

https://github.com/kingoflolz/mesh-transformer-jax/blob/22de86e3cabb995ad1005cd90b6a407c0a5f954f/device_train.py#L351

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:7 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
kingoflolzcommented, Aug 17, 2021

I’ve found with the v1 model that per_replica_batch==1 and gradient_accumulation_steps==batch is faster than the other way around

0reactions
mgrankincommented, Aug 18, 2021

This sounds like a bug to me then. Should be submitted to JAX/XLA?

Read more comments on GitHub >

github_iconTop Results From Across the Web

Dividing by Zero
Dividing by Zero is undefined. Dividing. To see why, let us look at what is meant by "division": Division is splitting into equal...
Read more >
Division by Zero
Division by zero is the operation of taking the quotient of any number x and 0, i.e., x/0. The uniqueness of division breaks...
Read more >
How to Divide by Zero
There we have it: one divided by zero equals unsigned infinity. And, by extension, so does almost any other number divided by zero...
Read more >
Divide by Zero — Rules & Examples
When something other than 0 is divided by 0, the result is undefined. But when 0 is divided 0, it is called indeterminate....
Read more >
Division by Zero
Hidden inconspicuously in the middle is a division by zero, and at that point the proof has stepped off the brink, making all...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found