Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Reduce Stable Diffusion memory usage by keeping unet only on GPU.

See original GitHub issue

Is your feature request related to a problem? Please describe. Stable Diffusion is not compute heavy on all its steps. If we keep the diffusion unet on fp16 on GPU and everything else on CPU, we could reduce the GPU usage to 2.2GB while having a non-so-big impact on performance. It should democratize Stable Diffusion even further.

Only other thing that would need to be done is move the tensors from the devices accordingly, but we can use the models device and dtype attributes to make everything work.

Describe the solution you’d like I think what I’m proposing on https://github.com/huggingface/diffusers/pull/537 should be enough.

Describe alternatives you’ve considered Alternative is to use GPUs for the whole process and pay more for it.

Issue Analytics

State:
Created a year ago
Comments:7 (7 by maintainers)

Top GitHub Comments

1reaction

patrickvonplatencommented, Oct 7, 2022

Hey @piEsposito,

I’m wondering whether we could maybe try to just write a community pipeline for this: https://github.com/huggingface/diffusers/tree/main/examples/community

1reaction

piEspositocommented, Oct 5, 2022

I’ve created a feature request on accelerate to enable solving this in a more elegant way. If they let me work on the feature, I can open a PR and then try solving this.