Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

add device map and accelerate to DiffusionPipeline abstraction to reduce memory footprint when loading model

See original GitHub issue

Is your feature request related to a problem? Please describe. As a follow up for #281 we could add the device map and the possibility to load weights using accelerate to DiffusionPipeline abstraction for smaller memory footprint when loading models.

Describe the solution you’d like


from diffusers import DiffusionPipeline



pipeline = DiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-3", device_map="auto")

Describe alternatives you’ve considered

Load models by hand every time
Have a larger memory footprint when loading models

Additional context This is a follow up for #281.

I can work on that if you folks would let me.

Issue Analytics

State:
Created a year ago
Comments:8 (5 by maintainers)

Top GitHub Comments

1reaction

piEspositocommented, Oct 14, 2022

@CrazyBoyM, what happens is that, for that feature to work, we need a version of accelerate with https://github.com/huggingface/accelerate/pull/747 merged. This PR was merged 4 days ago and the last release was 7 days ago.

Until they make the next release for accelerate, if you really want to use this feature I suggest you install accelerate from the master branch of the repository: pip install git+https://github.com/huggingface/accelerate.git, which you can revert to the pypi version after the next release.

1reaction

CrazyBoyMcommented, Oct 14, 2022

Hi, friend.I see this PR https://github.com/huggingface/diffusers/pull/361 but when i try this:

self.pipe = StableDiffusionPipeline.from_pretrained(
            self.model_id_or_path,
            revision="fp32", 
            device_map="auto",
            torch_dtype=torch.float32,
            scheduler = DDIMScheduler(
                beta_start=0.00085,
                beta_end=0.012,
                beta_schedule="scaled_linear",
                clip_sample=False,
                set_alpha_to_one=False,
            ),
            # use_auth_token=True,
        )
        self.pipe = self.pipe.to(self.device)

i get a error with:

   set_alpha_to_one=False,
  File "/root/.conda/envs/ai/lib/python3.7/site-packages/diffusers/pipeline_utils.py", line 517, in from_pretrained
    loaded_sub_model = load_method(os.path.join(cached_folder, name), **loading_kwargs)
  File "/root/.conda/envs/ai/lib/python3.7/site-packages/transformers/modeling_utils.py", line 2269, in from_pretrained
    max_memory=max_memory,
  File "/root/.conda/envs/ai/lib/python3.7/site-packages/accelerate/utils/modeling.py", line 480, in infer_auto_device_map
    max_layer_size, max_layer_names = get_max_layer_size(modules_to_treat, module_sizes, no_split_module_classes)
  File "/root/.conda/envs/ai/lib/python3.7/site-packages/accelerate/utils/modeling.py", line 261, in get_max_layer_size
    modules_children = list(module.named_children())
AttributeError: 'Parameter' object has no attribute 'named_children'

can you help me?

Top Results From Across the Web

Stable Diffusion with Diffusers - Hugging Face

In this post, we want to show how to use Stable Diffusion with the Diffusers library, explain how the model works and finally...

Reducing the Memory Footprint of Metal Apps - Apple Developer

Load lower -resolution textures when running on memory-constrained devices. Consider reducing the fidelity of 3D models and compressing per-vertex data. Simplify ...

Optimizing Memory Usage of Textures & Meshes // Unreal ...

Support on Patreon: https://patreon.com/techartaid/Streaming weekly at https://twitch.tv/techartaid/

Estimating GPU Memory Consumption of Deep Learning Models

ABSTRACT. Deep learning (DL) has been increasingly adopted by a variety of software-intensive systems. Developers mainly use GPUs to accel-.

Python mmap: Improved File I/O With Memory Mapping

In this tutorial, you'll learn how to use Python's mmap module to improve your code's performance when you're working with files.