`toil kill` doesn't cancel batch system jobs
See original GitHub issueAs noted in https://gitter.im/bd2k-genomics-toil/Lobby?at=618eb0f3fb3dcd4e88818830, toil kill
doesn’t seem to be using the normal shared file machinery to find pid.log
, and when we reorganized where shared files live, it lost the ability to find it.
We should change toil kill
to use the normal shared file read code, so it will always be able to find the files it is looking for, no matter where we store them.
┆Issue is synchronized with this Jira Story ┆friendlyId: TOIL-1087
Issue Analytics
- State:
- Created 2 years ago
- Comments:9 (8 by maintainers)
Top Results From Across the Web
Toil Workflow Options and Command Line Interface
--stats option. Using this command makes certain that toil does not delete the job store, no matter what other options are specified (i.e....
Read more >cancel batch tasks - OpenText Forums
The "clean" way to do this is to write a script (or Java program?) that essentially: Assuming you write the script to have...
Read more >How to terminate a background process? - Unix Stack Exchange
Method #1 - ps. You can use the ps command to find the process ID for this process and then use the PID...
Read more >linux - Why "jobs -p | kill -n 15" doesn't work? - Server Fault
I'm trying to install RVM (Ruby Version Manager). I was doing rm -r ./.rvm -i command. I wanted to remove .rvm folder and...
Read more >Batch Jobs — HPC User Guide 1 documentation
At this stage your job has joined the batch system queue – all jobs wait in the queue until the batch system scheduler...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
We don’t necessarily have a way to find the workers. We could do something like the services do and have a terminate flag file in the job store that gets polled, and make the workers (and leader, for when it’s on another machine) have a thread to poll it and self-terminate.
➤ Melaina Legaspi commented:
Lon Blauvelt : “Try to get the PID from the job store and then kill it with a gentle kill. Then kill all the workers. Toil kill should monitor to make sure all the workers are eliminated.“