question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

zombie processes from dvc pull failures

See original GitHub issue

Please provide information about your setup DVC version(i.e. dvc --version), Platform and method of installation (pip, homebrew, pkg Mac, exe (Windows), DEB(Linux), RPM(Linux))

DVC version 0.93.0 Platform: DGX2 kubernetes cluster, installation with pip,

The base image we use.

FROM nvcr.io/nvidia/pytorch:19.10-py3

When we call dvc pull, if the pull fails for any reason (lack of right credentials for one, or if we ctr-C), we see a lot of zombie processes start spawning on our pod. This occurs to the point that it ties up all free resources on the cluster and grinds things to a halt. Can someone look into this please? It’s affecting a pilot team promoting DVC usage at a major healthcare corporation.

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Reactions:1
  • Comments:7 (6 by maintainers)

github_iconTop GitHub Comments

1reaction
shchekleincommented, May 7, 2020

@ychou85 could please run ps -aef | grep dvc or something similar to get full dvc commands with arguments, to actually see what is running now.

1reaction
shchekleincommented, May 7, 2020

@ychou85 could you please try to opt-out from the analytics as described here - https://dvc.org/doc/user-guide/analytics ?

@efiop can it be updater daemon also?

Read more comments on GitHub >

github_iconTop Results From Across the Web

Zombie Processes. Learning about errors. Eating your…
The process object has to stay around until this happens — it consumes no resources and is dead, but it still exists —...
Read more >
Disney has a PR Problem and it is Starting to Outshine the ...
I think Disney is taking advantage of pent up travel demand, full employment, high wages, backlog of DVC inventory, 50th Anniversary, etc. at ......
Read more >
linux zombie process Code Example - Code Grepper
Zombie processes can be found easily with the ps -l command. ... If it still fails, a git process may have crashed in...
Read more >
Delphix Engine Release Notes
DLPX-51355 Environment discovery fails when process users have hyphens. DLPX-51365 Permissions errors in GUI for jet stream users.
Read more >
kernel/common - Git at Google
HOWTO - the process and procedures of how to do Linux kernel development. ... + lesb/link_fail: Link Error Status Block (LESB) link failure...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found