question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Default training not converging

See original GitHub issue

Running just the train_softmax.py command in the example run.sh script with the sample data doesn’t seem to converge, even at 50 epochs.

Command:

python -u ./code/1-development/train_softmax.py --num_epochs=50 --batch_size=3 --development_dataset_path=data/development_sample_dataset_speaker.hdf5 --train_dir=results/TRAIN_CNN_3D/train_logs

Output:

image

Loss:

image

Learning rate:

image

Issue Analytics

  • State:closed
  • Created 6 years ago
  • Comments:6 (3 by maintainers)

github_iconTop GitHub Comments

1reaction
astorficommented, Oct 10, 2017

Section VI

The output of the last layer (FC5) will be fed to the softmax layer which has the cardinality of N = 511, where N is the number of speakers in the development phase. For the enrollment and evaluation stages, 100 subjects have been used and the speaker utterances are split into two equal parts for two aforementioned phases.

However, I accept it could be explained in a better way in the paper.

About the number of samples per speaker, In my experiments, I had more than 50 per speaker. Perhaps if you take 50 for development and 50 for enrollment/evaluation it can demonstrate a fair experiment. However, your background model may not be that strong for generalization. In any case, you can give it a shot!

0reactions
skandermcommented, Oct 10, 2017

Oops, I might have had an old version of the paper:

image

I’ll have a look at the newest version and re-open if there’s an issue. Thanks for clarifying!

Read more comments on GitHub >

github_iconTop Results From Across the Web

Things to try when Neural Network not Converging
I know that the things to try would vary widely depending on network architecture. But tweaking which parameters (learning rate, momentum, initial weights,...
Read more >
When does a neural network fail to converge?
In simple words, we can think of failure in convergence as a condition where we can't find the convergence point in the learning...
Read more >
training - Weights not converging while cost function has ...
The weights in a model do not need to converge to stop training. One possible explanation is that the model error surface has...
Read more >
When does the problem arise of neural networks not ... - Quora
It can be that the series that elapses the flow diverges - meaning it does not converge by definition - it doesn't close...
Read more >
Training using custom dataset does not converge #920 - GitHub
Training using custom dataset does not converge #920 ... So I left the default anchor ratios since they are valid. I have tried...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found