pip install with building ops succeeds without installing any ops.
See original GitHub issueI’m trying to run deepseed with cpu adam, and after doing:
DS_BUILD_OPS=1 pip install deepspeed
here the output of ds_report
:
--------------------------------------------------
DeepSpeed C++/CUDA extension op report
--------------------------------------------------
NOTE: Ops not installed will be just-in-time (JIT) compiled at
runtime if needed. Op compatibility means that your system
meet the required dependencies to JIT install the op.
--------------------------------------------------
JIT compiled ops requires ninja
ninja .................. [OKAY]
--------------------------------------------------
op name ................ installed .. compatible
--------------------------------------------------
cpu_adam ............... [NO] ....... [OKAY]
fused_adam ............. [NO] ....... [OKAY]
fused_lamb ............. [NO] ....... [OKAY]
/bin/sh: line 1: type: llvm-config-9: not found
/bin/sh: line 1: type: cmake: not found
[WARNING] sparse_attn requires the 'cmake' command, but it does not exist!
sparse_attn ............ [NO] ....... [NO]
transformer ............ [NO] ....... [OKAY]
stochastic_transformer . [NO] ....... [OKAY]
utils .................. [NO] ....... [OKAY]
[WARNING] async_io requires the libraries: ['libaio-dev'] but are missing.
async_io ............... [NO] ....... [NO]
--------------------------------------------------
DeepSpeed general environment info:
torch install path ............... ['/home/sgugger/.pyenv/versions/3.7.9/envs/base/lib/python3.7/site-packages/torch']
torch version .................... 1.8.1+cu102
torch cuda version ............... 10.2
nvcc version ..................... 10.2
deepspeed install path ........... ['/home/sgugger/.pyenv/versions/3.7.9/envs/base/lib/python3.7/site-packages/deepspeed']
deepspeed info ................... 0.3.16, unknown, unknown
deepspeed wheel compiled w. ...... torch 1.8, cuda 10.2
I then cannot run anything using cpu adam (it’s not JITted at runtime as implied by the report), for instance pytest tests/deepspeed/test_deepspeed.py
in the Transformers repo fails.
If you have any insight on how to get those ops working, that would be great!
Issue Analytics
- State:
- Created 2 years ago
- Comments:5 (3 by maintainers)
Top Results From Across the Web
How to install modules with PIP (and fix it when it fails) - Medium
sudo pip install (THE NAME OF THE MODULE). (This will not work in Windows no matter how hard you try.) Now if none...
Read more >ImportError after successful pip installation [duplicate]
I have successfully installed a library with pip install <library-name> . But when I try to import it, python raises ImportError: No module ......
Read more >Hosting Python packages in Azure DevOps - Developer Support
App Dev Manager Herald Gjura demonstrates how to host Python packages using Azure DevOps. Problem to solve. I started to build a solution ......
Read more >pip install through azure devops feed fails
pip install through azure devops feed fails - ... if you include a step to upgrade/downgrade PIP the docker build will succeed. if...
Read more >Installing python deps with pip -r requirements.txt fails, but ...
Thus, pip is unable to build pyopencl , thus can't prepare it for installation and in the end, fails the whole command, installing...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Yes, this gives a clean error as well.
Good. So the prebuild side of things is covered.
Now to the JIT-way failure, what’s relevant log when you start the application the first time and JIT fails?
You will have to uninstall the prebuilt one of course.