zombie processes from dvc pull failures
See original GitHub issuePlease provide information about your setup
DVC version(i.e. dvc --version
), Platform and method of installation (pip, homebrew, pkg Mac, exe (Windows), DEB(Linux), RPM(Linux))
DVC version 0.93.0 Platform: DGX2 kubernetes cluster, installation with pip,
The base image we use.
FROM nvcr.io/nvidia/pytorch:19.10-py3
When we call dvc pull, if the pull fails for any reason (lack of right credentials for one, or if we ctr-C), we see a lot of zombie processes start spawning on our pod. This occurs to the point that it ties up all free resources on the cluster and grinds things to a halt. Can someone look into this please? It’s affecting a pilot team promoting DVC usage at a major healthcare corporation.
Issue Analytics
- State:
- Created 3 years ago
- Reactions:1
- Comments:7 (6 by maintainers)
Top Results From Across the Web
Zombie Processes. Learning about errors. Eating your…
The process object has to stay around until this happens — it consumes no resources and is dead, but it still exists —...
Read more >Disney has a PR Problem and it is Starting to Outshine the ...
I think Disney is taking advantage of pent up travel demand, full employment, high wages, backlog of DVC inventory, 50th Anniversary, etc. at ......
Read more >linux zombie process Code Example - Code Grepper
Zombie processes can be found easily with the ps -l command. ... If it still fails, a git process may have crashed in...
Read more >Delphix Engine Release Notes
DLPX-51355 Environment discovery fails when process users have hyphens. DLPX-51365 Permissions errors in GUI for jet stream users.
Read more >kernel/common - Git at Google
HOWTO - the process and procedures of how to do Linux kernel development. ... + lesb/link_fail: Link Error Status Block (LESB) link failure...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
@ychou85 could please run
ps -aef | grep dvc
or something similar to get fulldvc
commands with arguments, to actually see what is running now.@ychou85 could you please try to opt-out from the analytics as described here - https://dvc.org/doc/user-guide/analytics ?
@efiop can it be updater daemon also?