question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Right set of UNet hyperparameters when training DDPM

See original GitHub issue

Hi there ! I am currently training a DDPM model on a custom image dataset following the cool unconditional_image_generation example script.

Since I don’t have the compute to perform comprehensive hyperparameter tuning of my architecture, I was wondering if there are any common intuitions when designing the UNet denoiser : width/length of the residual blocks, number and positions of the attention blocks, etc. with respect to the number of samples in the training set as well as their resolution.

If anyone has a wide experience in training DMs, it would be super cool to share insights here or in a dedicated blog post such as the one discussing the hyperparameters choice when training Dreambooth.

Thank you ! 🤗

Issue Analytics

  • State:closed
  • Created 10 months ago
  • Comments:5 (3 by maintainers)

github_iconTop GitHub Comments

1reaction
patrickvonplatencommented, Nov 18, 2022

Maybe cc @anton-l ?

0reactions
leopoldmaillardcommented, Dec 1, 2022

Hello @anton-l, thank you for your insight !

I also found out that Dhariwal & Nichol discussed hyperparameters tuning of DDPM in their paper Diffusion Models Beat GANs on Image Synthesis.

Will close this for now !

Read more comments on GitHub >

github_iconTop Results From Across the Web

Parameters, Hyperparameters, Machine Learning
Therefore, setting the right hyperparameter values is very important because it directly impacts the performance of the model that will result from them...
Read more >
Trained Model in Supervised Deep Learning is a Conditional ...
We proved that a trained model in supervised deep learning minimizes the condi- tional risk for each input (Theorem 2.1).
Read more >
How diffusion models work: the math from scratch - AI Summer
We will focus on the most prominent one, which is the Denoising Diffusion Probabilistic Models (DDPM) as initialized by Sohl-Dickstein et al ...
Read more >
Keras Tuner: Lessons Learned From Tuning Hyperparameters ...
To select the right set of hyperparameters, we do hyperparameter tuning. Even though tuning might be time- and CPU-consuming, the end result pays...
Read more >
Léopold Maillard leopoldmaillard - GitHub
Right set of UNet hyperparameters when training DDPM. Hi there ! I am currently training a DDPM model on a custom image dataset...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found