question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[user application] which deepspeed flags are required if any

See original GitHub issue

In the context of integrating DeepSpeed into transformers, I have a question to you wrt the deepspeed cl args

The paradigm you propose is:

deepspeed myprog myargs --deepspeed --deepspeed_config ds_config.js

but as we were discussing elsewhere there are too many cl args in ML apps, and we were considering if it might make sense to collapse -deepspeed and --deepspeed_config into a single cl arg for the transformers trainer and then re-construct it into 2 cl args before it goes into deepspeed.initialize.

context: https://github.com/huggingface/transformers/pull/9211#discussion_r546732810

Do you have a strong feeling that we should keep your proposed convention of 2 cl args facing users, or do you feel that it’s fine to collapse the two? I may be just unaware of something important so I wanted to run this idea by you.

And secondly, do you by chance have any brilliant ideas on how to name a one-to-rule-them-all cl arg so that it reads nice and functionally unambiguous to users? I think --deepspeed_config_file ds_config.js isn’t clear enough since it doesn’t say deepspeed is activated, but I could be wrong.

One of devs also suggests that --deepspeed_config doesn’t make it obvious that a file argument is expected.

I wonder if --deepspeed ds_config.js would do the trick - it’s actually less obvious that it expects a file argument, but it’s unambiguous about it activating deepspeed.

I totally understand that you may not have a strong opinion or want to spend any time on this since it’s just our peculiar desire for succinctness and clarity, but if you do have suggestions I’m all ears.

Thank you.

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:8 (7 by maintainers)

github_iconTop GitHub Comments

1reaction
tjruwasecommented, Dec 24, 2020

@g-karthik Thanks for eliciting this very important clarification regarding the discussion. Just to confirm, there are no breaking changes.

1reaction
stas00commented, Dec 24, 2020

@g-karthik, I think you may have misunderstood the intention/scope of this thread.

I was only asking for an advice of which flags would be best to use in HuggingFace transformers which will soon have DeepSpeed integrated.

I don’t think there are any plans on removing support for any existing flags in the DeepSpeed project itself.

Looking at the title of this Issue, I can see how you could have interpreted this - I have rephrased it so I hope now it won’t raise any alarms for others.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Training Setup — DeepSpeed 0.8.0 documentation
The set of DeepSpeed arguments include the following: 1) –deepspeed: boolean flag to enable DeepSpeed 2) –deepspeed_config <json file path>: path of a...
Read more >
DeepSpeed Configuration JSON
Users need to identify whether the model is decoder or encoder . Currently we only support these two. N/A. hidden_state_order: [str], Users need...
Read more >
DeepSpeed Integration - Hugging Face
If you're using only 1 GPU, here is how you'd have to adjust your training code in the notebook to use DeepSpeed. Copied....
Read more >
PyTorch Lightning vs DeepSpeed vs FSDP vs FFCV vs
PyTorch Lightning has become one of the most widely used deep learning frameworks in the world by allowing users to focus on the...
Read more >
Principal Researcher at Microsoft | WORK180
Are you interested in being part of an exciting open-source library for deep learning systems? The DeepSpeed team is hiring! DeepSpeed (http://deepspeed.ai) ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found