question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Can Each Source Be Trained Separately? - Unable to Train Quality Model

See original GitHub issue

❓ Questions

@adefossez

Thank you for your help in https://github.com/facebookresearch/demucs/issues/221 !

I’m running into a new problem.

I was able to get training started by using the following dset settings:

dset:
  wav: C:/Users/Anjok/Desktop/Demucs-Train/test <------ Dataset location
  sources: ['vocals', 'non_vocals']
  samplerate: 44100
  channels: 2
epochs: 320
batch_size: 6 <--------- I'm using an RTX 3090 with 24GB's of V-RAM. This was the largest batch size I was able to use.
weights: [1.,1.] <--------- I had to decrease the number of weights from 4 to 2.

I was able to train a model, but it comes out very poorly and with much distortion. The dataset consists of 1400 tracks and I’ve been able to successfully train a strong vocal model that achieved a 9.703 SDR on AIcrowds testset using the KUIELAB-MDX-Net code, so I know there isn’t an issue with the dataset. I’m wondering if having to decrease the number of weights is causing this issue? I also tried training with the vocal stem and mixture only using the following settings, but the results were even worse.

dset:
  wav: C:/Users/Anjok/Desktop/Demucs-Train/test
  sources: ['vocals']
  samplerate: 44100
  channels: 2
epochs: 320
batch_size: 6 
weights: [1.] <--------- Decreased the number of weights from 2 to 1.

I’m eager to train Demucs using our dataset because I know it’s solid, but I can’t seem to get a good model yet. What would you recommend? Perhaps I need to be using a different settings? I did a test with a small dataset consisting of 4 stem models as well, and the results were very poor. What exact settings were used to train mdx_extra?

Issue Analytics

  • State:open
  • Created 2 years ago
  • Comments:6 (1 by maintainers)

github_iconTop GitHub Comments

3reactions
Anjok07commented, Apr 3, 2022

@Anjok07 These are really fantastic values that you have achieved here, well with the number of data in the dataset.

Could you somehow make your trained model available to the community? I think demucs is a great invention, and I would like it to be even better, because of a larger dataset. Would be really great if this could be done somehow.

I plan on it! It’s going to take time though since the dataset is so big and each epoch takes about 9 hours. I recently had to restart the session from scratch due to a small mistake in my code. It’ll likely be a little over a month before I release it.

2reactions
Anjok07commented, Jun 13, 2022

These models have been released and added to the UVR GUI. Check it out here! - https://github.com/Anjok07/ultimatevocalremovergui/releases/tag/v5.3.0

Read more comments on GitHub >

github_iconTop Results From Across the Web

The Model Performance Mismatch Problem (and what to do ...
1. Model Overfitting. Perhaps the most common cause is that you have overfit the training data. You have hit upon a model, a...
Read more >
To retrain, or not to retrain? Let's get analytical about ML ...
We can see if the model trained on the complete dataset does better than the model trained only on the most recent periods....
Read more >
What Is Training Data? How It's Used in Machine Learning
A training dataset can never be comprehensive and can't teach everything that a model might encounter in the real world. Therefore a test ......
Read more >
A Comprehensive Hands-on Guide to Transfer Learning with ...
Traditional learning is isolated and occurs purely based on specific tasks, datasets and training separate isolated models on them. No knowledge ...
Read more >
How Amazon SageMaker Provides Training Information
SageMaker makes this information available to the Docker container so that your training algorithm can use it. This section explains how we make...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found