Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Batch normalization

See original GitHub issue

Hello, Chelsea.

The batch normalization documentation says that this ops is not attached to the TensorFlow graph by default. So, there’re two ways to force the updates during training:

explicitly tell the graph to update ops in tf.GraphKeys.UPDATE_OPS
or set updates_collections parameter of batch_norm to None.

I don’t see neither of those in the code. Maybe I’m missing something.

I haven’t been able to make the first way work due to while cycle in map_fn function. But the second modification is easy and seems to work. Although, I’m not sure I see any difference in performance.

Issue Analytics

State:
Created 6 years ago
Comments:8 (4 by maintainers)

Top GitHub Comments

3reactions

ghostcommented, Jan 12, 2018

@cbfinn, as you mentioned before,

I compute the test-time statistics using the test batch of data, instead of computing the average training statistics

This seems to be a bit of cheating especially on test-time. In general, we can assume evaluating only one sample at a time on test-time and then there is no way to get proper statistics for batch_norm. This means the test-set performance will partially dependent on the size of batch.

0reactions

cbfinncommented, Apr 20, 2018

Yes

On Thu, Apr 19, 2018, 7:35 PM Jackie Loong notifications@github.com wrote:

Hi, I see your approcach. If I use moving average of the statistics by adding update_op into train ops, Then Need I set train=FALSE when testing use batch_norm function?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/cbfinn/maml/issues/9#issuecomment-382949995, or mute the thread https://github.com/notifications/unsubscribe-auth/ABMlAZJQHg-N16-ZxjUNiimpNGK70_XVks5tqUl3gaJpZM4OzLMX .

Top Results From Across the Web

Batch normalization - Wikipedia

Batch normalization (also known as batch norm) is a method used to make training of artificial neural networks faster and more stable through...

A Gentle Introduction to Batch Normalization for Deep Neural ...

Batch normalization is a technique for training very deep neural networks that standardizes the inputs to a layer for each mini-batch.

Batch normalization in 3 levels of understanding

Batch -Normalization (BN) is an algorithmic method which makes the training of Deep Neural Networks (DNN) faster and more stable. It consists of ......

Batch Normalization: Accelerating Deep Network Training by ...

Batch Normalization allows us to use much higher learning rates and be less careful about initialization. It also acts as a regularizer, ...

Introduction to Batch Normalization - Analytics Vidhya

Batch normalization is the process to make neural networks faster and more stable through adding extra layers in a deep neural network.