"Terminate" does not stop pipeline run execution
See original GitHub issueWhat steps did you take:
Hitting “Terminate” fails to actually stop pipeline execution if there are steps with retryStrategies or running init containers
What happened:
We are running a pipeline that has ±50 parallel step executions. If we want to stop pipeline run we have to delete Kubernetes Workflow resource rather than using “Terminate” UI button as it does not stop the run if:
- There are steps = containers with
retryStrategy
, as they are restarting instead of being terminated - If there are any init containers, the step won’t be stopped until they finish and the main one takes over
What did you expect to happen:
All pipelines steps/pods are terminated the instant user hits “Terminate” with no retries, as if the corresponding Workflow is deleted
How did you deploy Kubeflow Pipelines (KFP)? Kubeflow Pipelines are installed as a part of Kubeflow project
KFP version: Build commit: 743746b KFP SDK version: 0.5.1
Anything else you would like to add:
Please let me know if additional info is required
/kind bug /area frontend /area backend
Issue Analytics
- State:
- Created 3 years ago
- Reactions:9
- Comments:25 (13 by maintainers)
Top Results From Across the Web
How can I kill (not cancel) an errant Azure Pipeline run, stage ...
For example, canceling the pipeline does not stop it immediately if a condition is configured incorrectly. If the condition resolves to true the ......
Read more >bash - What is the most succinct way of terminating the rest of ...
Workaround 2. Execute everything but check the return results This will still run all commands no matter what, but it does let you...
Read more >I have a stuck Pipeline and I can not stop it - CloudBees Support
After 30 seconds of the build not stopping that should generate a link in the Console of the build which says "Click here...
Read more >Terminate - AWS Data Pipeline
AWS Data Pipeline attempts to put the activity, resource, or data node into the CANCELLED state if it does not start by the...
Read more >Pipeline with manual job never terminate - GitLab CI/CD
The default timeout for all pipelines is 60 minutes (unless otherwise specified), so it should not run forever. Take our pipeline as an...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Creating a minimal pipeline would require a lot of GitHub archeology, recurrent scheduling - and would probably not provoke the error, since the Kubeflow cluster is now upgraded.
What worked in the end was to remove the row that described the pipeline run from the MySQL database. This screenshot shows the database:
In the table
run_details
, delete the row with the same UUID as seen in the Kubeflow UI.Sure, code is in https://github.com/kubeflow/pipelines/blob/d6a2c23f56943ea8af35b5b2e5f6c6381bfb25ed/backend/src/apiserver/resource/resource_manager.go#L462