Suggestion: Support CPU-only environments/remove CUDA requirement
See original GitHub issueHi,
I’m training neural networks on a small CPU-only workstation, and I wanted to use the DeepSpeedCPUAdam optimizer to accelerate the training, but noticed that CUDA is required:
File "...python3.9/site-packages/deepspeed/ops/op_builder/builder.py", line 37, in installed_cuda_version
assert cuda_home is not None, "CUDA_HOME does not exist, unable to compile CUDA op(s)"
Making CUDA optional would be a great feature. In my own field (climate modeling and weather forecasting), machine learning is an increasingly popular topic but GPU’s and deep learning are often overkill. (Right now, I’m training neural nets with 1-2 hidden layers and 10-20 neurons, which is quite efficient on CPU’s). Furthermore, support for non-NVIDIA GPU’s should be relevant in the longer term.
I realize this is presented as a deep learning library, but dropping the CUDA requirement for features which do not really require it might still be useful. Thanks!
Issue Analytics
- State:
- Created 2 years ago
- Comments:7 (3 by maintainers)
Top Results From Across the Web
Why does Anaconda install pytorch cpuonly when I install cuda?
I ran into a similar problem when I tried to install Pytorch with CUDA 11.1. Although the anaconda site explicitly lists a pre-built...
Read more >CUDA C++ Best Practices Guide
This Best Practices Guide is a manual to help developers obtain the best performance from NVIDIA ® CUDA® GPUs. It presents established parallelization...
Read more >Getting started with PyTorch - IBM
The CPU-only variant is built without CUDA and GPU support. It has a smaller installation size, and omits features that would require a...
Read more >Deep Learning graphic card requirement - Esri Community
Dan's suggestion of installing the pytorch-cpu package is probably the simplest way to avoid PyTorch using CUDA that is older, another option if ......
Read more >建议:支持仅限CPU的环境/删除CUDA要求:Suggestion
Suggestion : Support CPU-only environments/remove CUDA requirement嗨, 我正在train 一个small CPU工作站上的神经网络,我想使用DeepSpeedcpuadam ...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
hello, I have the same problem now. Is there a solution?
I’m also having this issue on deepspeed==0.5.10. My model trains fine with an optimizer from
torch.optim
but when usingdeepspeed.ops.adam.FusedAdam
I get: