Unconditional synthesis
See original GitHub issueI"m running the this command to generate unconditional samples.
python -m diffwave.inference --fast /path/to/model -o output.wav
I’ve trained for almost 4k epochs on 7k+ sounds. I seem to get the same sound (or a very similar one) regardless of training time.
I have not worked with diffwave before - any tips for debugging this?
Thanks
Issue Analytics
- State:
- Created a year ago
- Comments:5 (1 by maintainers)
Top Results From Across the Web
Unconditional Synthesis of Complex Scenes Using a ...
Review: In this paper, the authors propose a new paradigm for unconditional image synthesis with semantic layouts as the bottleneck. The presented approach...
Read more >Unconditional Image Generation | Papers With Code
These leaderboards are used to track progress in Unconditional Image Generation ... High-Resolution Image Synthesis with Latent Diffusion Models.
Read more >Cluster-Guided Image Synthesis With Unconditional Models
In the typical GAN setting an image is synthesized by sampling a vector from a latent distribution and performing a forward pass through...
Read more >MoDi: Unconditional Motion Synthesis from Diverse Data - arXiv
Yet, learning to unconditionally synthesize motions from a given distribution remains a challenging task, especially when the motions are highly ...
Read more >Unconditional Latent Diffusion - Hugging Face
By decomposing the image formation process into a sequential application of denoising autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis ...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
It seems that the Diffwave paper uses
res_channel = 256
for unconditional speech synthesis (but we have 64 in this code), which is why we could not get reasonable sounds.It shouldn’t output silence waves. When I trained shortly it generated noisy audio.