question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

`toil kill` doesn't cancel batch system jobs

See original GitHub issue

As noted in https://gitter.im/bd2k-genomics-toil/Lobby?at=618eb0f3fb3dcd4e88818830, toil kill doesn’t seem to be using the normal shared file machinery to find pid.log, and when we reorganized where shared files live, it lost the ability to find it.

We should change toil kill to use the normal shared file read code, so it will always be able to find the files it is looking for, no matter where we store them.

┆Issue is synchronized with this Jira Story ┆friendlyId: TOIL-1087

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:9 (8 by maintainers)

github_iconTop GitHub Comments

1reaction
adamnovakcommented, Dec 9, 2021

We don’t necessarily have a way to find the workers. We could do something like the services do and have a terminate flag file in the job store that gets polled, and make the workers (and leader, for when it’s on another machine) have a thread to poll it and self-terminate.

0reactions
unito-botcommented, Dec 9, 2021

➤ Melaina Legaspi commented:

Lon Blauvelt : “Try to get the PID from the job store and then kill it with a gentle kill. Then kill all the workers. Toil kill should monitor to make sure all the workers are eliminated.“

Read more comments on GitHub >

github_iconTop Results From Across the Web

Toil Workflow Options and Command Line Interface
--stats option. Using this command makes certain that toil does not delete the job store, no matter what other options are specified (i.e....
Read more >
cancel batch tasks - OpenText Forums
The "clean" way to do this is to write a script (or Java program?) that essentially: Assuming you write the script to have...
Read more >
How to terminate a background process? - Unix Stack Exchange
Method #1 - ps. You can use the ps command to find the process ID for this process and then use the PID...
Read more >
linux - Why "jobs -p | kill -n 15" doesn't work? - Server Fault
I'm trying to install RVM (Ruby Version Manager). I was doing rm -r ./.rvm -i command. I wanted to remove .rvm folder and...
Read more >
Batch Jobs — HPC User Guide 1 documentation
At this stage your job has joined the batch system queue – all jobs wait in the queue until the batch system scheduler...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found