Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Very small gradients causing no weight update from your model

See original GitHub issue

Thanks for your code. It helps me understand the BiDAF in details.

However, I found the model had no performance increasing. Every epoch, the metric is always the same. And then, I found it’s the optimized gradients too small. it’s the order of 10^-3~10^-8.

I can’t find what’s wrong. And I think your code is good to understand. So, what may be the problem?

Issue Analytics

State:
Created 6 years ago
Comments:31 (19 by maintainers)

Top GitHub Comments

1reaction

oneTakencommented, Dec 8, 2017

Emm, I am trying to wrap the code into tensorboard. So I can compare with the keras training log, to have a more clear knowledge. By the way, I have a deadline recently. So I can’t spend all my time to solve this. But If I have some improvement, I will tell you.

On Thu, Dec 7, 2017 at 3:02 AM, Junki Ohmura notifications@github.com wrote:

So far, there are not improvements even I modified following items…

RNN -> LSTM

original loss function

use BiDAF’s script to build dataset

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/jojonki/BiDAF/issues/1#issuecomment-349740879, or mute the thread https://github.com/notifications/unsubscribe-auth/ARNbsCNdzmMJQPpB2oxXYegMrYmozT0fks5s9uTPgaJpZM4Q0IWy .

0reactions

aneesh-joshicommented, Aug 4, 2018

Thanks @jojonki I wanted to use a modification of BiDAF for Transfer Learning from Span to QA. I have implemented a version of it but I cannot get it to work as advertised.

Thanks for your work.